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INTRODUCTION 
Research  Initiation  Program  -  1990 

AFOSR  has  provided  funding  for  follow-on  research  efforts  for  the  participants  in  the 
Summer  Faculty  Research  Program.  Initially,  this  program  was  conducted  by  AFOSR  and 
popularly  known  as  the  Mini-Grant  Program.  Since  1983  the  program  has  been  conducted  by 
the  Summer  Faculty  Research  Program  (SFRP)  contractor  and  is  now  called  the  Research 
Initiation  Program  (RIP).  Funding  is  provided  to  establish  RIP  awards  to  about  half  the  number 
of  participants  in  the  SFRP. 

Participants  in  the  1990  SFRP  competed  for  funding  under  the  1990  RIP.  Participants 
submitted  cost  and  technical  proposals  to  the  contractor  by  1  November  1990,  following  their 
participation  in  the  1990  SFRP. 

Evaluation  of  these  proposals  were  made  by  the  contractor.  Evaluation  criteria  consisted 
of: 


1.  Technical  excellence  of  the  proposal 

2.  Continuation  of  the  SFRP  effort 

3.  Cost  sharing  by  the  university 

The  list  of  proposals  selected  for  award  was  forwarded  to  AFOSR  for  approval  of  funding. 
Those  approved  by  AFOSR  were  funded  for  research  efforts  to  be  completed  by  31  December 
1991. 


The  following  summarizes  the  events  for  the  evaluation  of  proposals  and  award  of  funding 
under  the  RIP. 

A.  RIP  proposals  were  submitted  to  the  contractor  by  1  November  1990.  The 
proposals  were  limited  to  $20,000  plus  cost  sharing  by  the  universities.  The 
universities  were  encouraged  to  cost  share,  since  this  is  an  effort  to  establish  a  long 
term  effort  between  the  Air  Force  and  the  university. 

B.  Proposals  were  evaluated  on  the  criteria  listed  above  and  the  final  award  approval 
was  given  by  AFOSR  after  consultation  with  the  Air  Force  Laboratories. 

C.  Subcontracts  were  negotiated  with  the  universities.  The  period  of  performance  of 
the  subcontract  was  between  October  1990  and  December  1991. 

Copies  of  the  final  reports  are  presented  in  Volumes  I  through  IV  of  the  1990  Research 
Initiation  Program  Report.  There  were  a  total  of  92  RIP  awards  made  under  the  1990  program. 
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PROGRAM  STATISTICS 


Total  SFRP  Participants  165 

Total  RIP  Proposals  submitted  by  SFRP  129 

Total  RIP  Proposals  submitted  by  GSRP  7 

Total  RIP  Proposals  submitted  136 

Total  RIP’s  funded  to  SFRP  88 

Total  RIP’s  funded  to  GSRP  4 

Total  RIP’s  funded  92 

Total  RIP  Proposals  submitted  by  HBCU’s  9 

Total  RIP  Proposals  funded  to  HBCU’s  2 


IV 


LABORATORY  PARTICIPATION 


Laboratory 

Participants 

Submitted 

Funded 

AAMRL 

10 

6 

5 

WRDC/APL 

11 

9 

7 

ATL 

11 

9  (1  GSRP) 

8  (1  GSRP) 

AEDC 

7 

10  (3  GSRP) 

6  (GSRP) 

WRDC/AL 

9 

8  (1  GSRP) 

6 

ESMC 

0 

0 

0 

ESD 

1 

1 

0 

ESC 

11 

8 

5 

ETL 

3 

2 

1 

WRDC/FDL 

11 

11 

6 

FJSRL 

8 

6 

4 

AFGL 

11 

9  (1  GSRP) 

6 

HRL 

14 

12 

8 

WRDC/ML 

11 

8 

6 

OEHL 

4 

2 

1 

AL 

7 

5 

4 

RADC 

12 

9 

6 

SAM 

17 

14 

8 

WL 

6 

7  (1  GSRP) 

5  (1  GSRP) 

WHMC 

1 

0 

0 

LIST  OF  PARTICIPATING  UNIVERSITIES 


Alabama,  University  of 

- 

4 

New  York-Buffalo,  State  Univ.  of 

- 

1 

Alfred  University 

- 

1 

North  Dakota  State  University 

- 

2 

Arizona  State  University 

- 

1 

North  Texas,  University  of 

- 

2 

Brigham  Young  University 

- 

1 

Northwestern  University 

- 

2 

Butler  University 

- 

1 

Ohio  State  University 

- 

2 

California  State  University 

- 

1 

Oklahoma  State  University 

- 

1 

Cincinnati,  University  of 

- 

1 

Pennsylvania  State  University 

- 

1 

Colorado-Colorado  Springs,  Univ.  of 

- 

2 

Puerto  Rico,  University  of 

- 

1 

Denver,  University  of 

- 

1 

Purdue  Calumet  University 

- 

1 

Duke  University 

- 

1 

Rensselaer  Polytechnic  Univ. 

- 

1 

Fairleigh  Dickinson  University 

- 

1 

Rhode  Island,  University  of 

- 

1 

Florida  Institute  of  Tech. 

- 

1 

Scranton,  University  of 

- 

2 

Florida  University  of 

- 

1 

South  Dakota,  University  of 

- 

1 

Houston,  University  of 

- 

1 

South  Florida,  University  of 

- 

1 

Houston-Victoria,  Univ.  of 

- 

1 

Southern  Illinois  University 

• 

1 

Idaho  State  University 

- 

1 

Southwest  Texas  State  Univ. 

- 

1 

Indiana  University 

- 

1 

Syracuse  University 

- 

1 

Indiana-Purdue  University 

- 

1 

Texas  A&M  University 

- 

1 

Kansas  State  University 

- 

1 

Texas  Tech.  University 

- 

1 

Kent  State  University 

- 

1 

Texas-Arlington,  Univ.  of 

- 

1 

Kentucky,  University  of 

- 

2 

Texas- Austin,  Univ.  of 

- 

1 

Lowell,  University  of 

- 

1 

Texas-San  Antonio,  Univ.  of 

- 

2 

MIT 

- 

1 

Trinity  University 

- 

1 

Maine,  University  of 

- 

2 

Tufts  University 

- 

1 

Marshall  University 

- 

1 

Utah  State  University 

- 

2 

Maryland-Baltimore,  University  of 

- 

1 

Utica  College 

- 

1 

Memphis  State  University 

- 

2 

Vanderbilty  University 

- 

3 

Michigan  Tech.  University 

- 

1 

Virginia  Poly.  Instit.  State  Univ. 

- 

1 

Minnesota,  University  of 

- 

1 

Washington  State  University 

- 

2 

Mississippi  State  University 

- 

3 

Wellesley  College 

- 

1 

Missouri- Kansas  City,  Univ.  of 

- 

1 

West  Texas  State  University 

- 

1 

Missouri- Rolla,  University  of 

- 

1 

West  Virginia  University 

- 

1 

Morehouse  College 

- 

1 

Worchester  Polytechnic  Institute 

- 

1 

New  Orleans,  University  of 

- 

1 

Wright  State  University 

- 

4 

New  York,  City  College  of 

- 

1 

Wyoming,  University  of 

- 

1 

Total  -  92 
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AERO  PROPULSION  AND  POWER  DIRECTORATE 


(Wright-Patterson  Air  Force  Base) 

Dr.  Muhammad  Choudhry 
West  Virginia  University 
Specialty:  Electrical  Engineering 

Dr.  Donald  Dareing 

University  of  Florida 

Specialty:  Mechanical  Engineering 

Dr.  Paul  Hedman 
Brigham  Young  University 
Specialty:  Chemical  Engineering 

Dr.  K.  Sankara  Rao 

North  Dakota  State  University 

Specialty:  Electrical  Engineering 


ARMAMENT  DIRECTORATE 
(Eglin  Air  Force  Base) 

Dr.  Charles  Camp 
Memphis  State  University 
Specialty:  Civil  Engineering 

Dr.  Arnold  Carden 
University  of  Alabama 
Specialty:  Metallurgy 

Dr.  Charles  Fosha 
University  of  Colorado 
Specialty:  Electrical  Engineering 

Dr.  John  George 
University  of  Wyoming 
Specialty:  Applied  Mathematics 


Dr.  Larry  Roe 

Virginia  Poly.  Instit.  State  University 
Specialty:  Mechanical  Engineering 

Dr.  Kaveh  Tagavi 

University  of  Kentucky 

Specialty:  Mechanical  Engineering 

Dr.  Richard  Tankin 
Northwestern  University 
Specialty:  Mechanical  Engineering 


Dr.  Kevin  Moore 
Idaho  State  University 
Specialty:  Electrical  Engineering 

Dr.  William  Siuru 

University  of  Colorado 

Specialty:  Mechanical  Engineering 

Dr.  Kenneth  Sobel 

City  College  of  New  York 

Specialty:  Electrical  Engineering 

Mr.  Randy  Gove  (GSRP) 
University  of  Alabama 
Specialty:  Physics 


via 


ARMSTRONG  LABORATORY 
(Brooks  Air  Force  Base) 


Dr.  Phillip  Bishop 
University  of  Alabama 
Specialty:  Exercise  Physiology 

Dr.  Robert  Blystone 
Trinity  University 
Specialty:  Zoology 

Dr.  Bruno  Breitmeyer 
University  of  Houston 
Specialty:  Experimental  Psychology 

Dr.  Vito  DelVecchio 
University  of  Scranton 
Specialty:  Biochemistry 


Dr.  Pushpa  Gupta 
University  of  Maine 
Specialty:  Mathematics 

Dr.  Ramesh  Gupta 

University  of  Maine 

Specialty:  Mathematical  Statistics 

Dr.  John  Szarek 
Marshall  University 
Specialty:  Pharmaceutical 

Dr.  Steven  Waller 
University  of  South  Dakota 
Specialty:  Pharmacology 


ARNOLD  ENGINEERING  DEVELOPMENT  CENTER 


(Arnold  Air  Force  Base) 

Dr.  Carlyle  Moore 
Morehouse  College 
Specialty:  Physics 

Dr.  Olin  Norton 
Mississippi  State  University 
Specialty:  Mechanical  Engineering 

Dr.  Richard  Peters 
Vanderbilt  University 
Specialty:  Electrical  Engineering 


Dr.  Chun  Su 

Mississippi  State  University 
Specialty:  Physics 

Mr.  Ben  Abbott  (GSRP) 
Vanderbilt  University 
Specialty:  Computer  Engineering 

Mr.  Theodore  Bapty  (GSRP) 
Vanderbilt  University 
Specialty:  Computer  Engineering 


ix 


AVIONICS  DIRECTORATE 
(Wright-Patterson  Air  Force  Base) 


Dr.  Michael  Breen 
Alfred  University 
Specialty:  Mathematics 

Dr.  R.  H.  Cofer 
Florida  Institute  of  Tech. 
Specialty:  Electrical  Engineering 

Dr.  Lawrence  Hall 
University  of  South  Florida 
Specialty:  Computer  Science 


CREW  SYSTEMS  DIRECTORATE 
(Wright-Patterson  Air  Force  Base) 

Dr.  Richard  Backs 
Wright  State  University 
Specialty:  Psychology 

Mr.  John  Duncan 
Kent  State  University 
Specialty:  Technology 

Dr.  Martin  Hagan 
Oklahoma  State  University 
Specialty:  Electrical  Engineering 


Dr.  Kevin  Kirby 
Wright  State  University 
Specialty:  Computer  Science 

Dr.  Richard  Miers 
Indiana  Univ.  -  Purdue  Univ. 
Specialty:  Physics 

Mr.  Eric  Byrne  (GSRP) 
Kansas  State  University 
Specialty:  Computer  Science 


Dr.  Ashok  Krishnamurthy 
Ohio  State  University 
Specialty:  Electrical  Engineering 

Dr.  Amit  Patra 
University  of  Puerto  Rico 
Specialty:  Mechanical  Engineering 


ELECTRONIC  TECHNOLOGY  LABORATORY 


Dr.  Ashok  K.  Goel 
Michigan  Tech.  University 
Specialty:  Electrical  Engineering 


x 


ENGINEERING  AND  SERVICES  CENTER 
(Tyndall  Air  Force  Base) 

• 

Dr.  William  Bannister 

University  of  Lowell 

Specialty:  Organic  Chemistry 

Dr.  Perry  McNeill 

University  of  North  Texas 

Specialty:  Education 

Dr.  Joseph  Dreisbach 

University  of  Scranton 

Specialty:  Chemistry 

Dr.  George  Veyera 

University  of  Rhode  Island 

Specialty:  Civil  Engineering 

Dr.  Michael  McFarland 

Utah  State  University 

Specialty:  Biological  Engineering 

FLIGHT  DYNAMICS  DIRECTORATE 
(Wright-Patterson  Air  Force  Base) 

Dr.  Marvin  Hamstad 

University  of  Denver 

Specialty:  Solid  Mechanics 

Dr.  Yulian  Kin 

Purdue  Calumet 

Specialty:  Mechanical  Engineering 

Dr.  Chin  Hsu 

Washington  State  University 

Specialty:  Electrical  Engineering 

Dr.  Byung-Lip  Lee 

Pennsylvania  State  University 

Specialty:  Materials  Science 

Dr.  David  Hui 

Worcester  Poly.  Institute 

Specialty:  Aerospace  Engineering 

Dr.  Lawrence  Zavodney 

Ohio  State  University 

Specialty:  Mechanical  Engineering 

FRANK  J.  SEILER  RESEARCH  LABORATORY 
(United  States  Air  Force  Academy) 

Dr.  Theodore  Burkey 

Memphis  State  University 

Specialty:  Chemistry 

Dr.  Ephrahim  Garcia 

State  Univ.  of  New  York-Buffalo 

Specialty:  Aerospace  Engineering 

Dr.  Richard  Carlin 

University  of  Alabama 

Specialty:  Chemistry 

Dr.  Thomas  Posbergh 

University  of  Minnesota 

Specialty:  Electrical  Engineering 

xi 

GEOPHYSICS  DIRECTORATE 
(Hanscom  Air  Force  Base) 

Dr.  Chia-Bo  Chang 
Texas  Tech.  University 
Specialty:  Meteorology 


Dr.  Craig  Rasmussen 
Utah  State  University 
Specialty:  Physics 


Dr.  George  Jumper 
Worcester  Poly.  Institute 
Specialty:  Mechanical  Engineering 


Dr.  Glenn  Stark 
Wellesley  College 
Specialty:  Physics 


Dr.  Charles  Lishawa 
Utica  College 

Specialty:  Physical  Chemistry 


Dr.  John  Wills 
Indiana  University 
Specialty:  Physics 


HUMAN  RESOURCES  DIRECTORATE 

(Brooks,  Williams  and  Wright-Patterson  Air  Force  Base) 


Dr.  Margaret  Batschelet 
University  of  Texas-San  Antonio 
Specialty:  English 

Dr.  James  Dykes 

University  of  Texas-San  Antonio 

Specialty:  Psychology 

Dr.  Verlin  Hinsz 

North  Dakota  State  University 

Specialty.  Psychology 

Dr.  Delayne  Hudspeth 
University  of  Texas-Austin 
Specialty:  Education 


Dr.  Gillray  Kandel 
Rensselaer  Poly.  Institute 
Specialty:  Experimental  Psychology 

Dr.  William  Moor 
Arizona  State  University 
Specialty:  Industrial  Engineering 

Dr.  Joan  Rentsch 
Wright  State  University 
Specialty:  Industrial  Psychology 

Dr.  Stanley  Stephenson 
Southwest  Texas  State  University 
Specialty:  Psychology 


xu 


MATERIALS  DIRECTORATE 
(Wright-Patterson  Air  Force  Base) 


Dr.  John  Connolly 
University  of  Missouri-Kansas 
Specialty:  Chemistry 

Dr.  David  Grossie 
Wright  State  University 
Specialty:  Chemistry 

Dr.  Prasad  Kadaba 
University  of  Kentucky 
Specialty:  Physics 


Dr.  Joseph  Lambert 
Northwestern  University 
Specialty:  Chemistry 

Dr.  Martin  Schwartz 
University  of  North  Texas 
Specialty:  Physical  Chemistry 

Dr.  Hai-Lung  Tsai 
University  of  Missouri-Rolla 
Specialty:  Mechanical  Engineering 


OCCUPATIONAL  AND  ENVIRONMENTAL  HEALTH  DIRECTORATE 
(Brooks  Air  Force  Base) 

Dr.  Miguel  Medina 
Duke  University 
Specialty:  Water  Resources 


ROCKET  PROPULSION  DIRECTORATE 
(Edwards  Air  Force  Base) 


Dr.  Shannon  Lieb 

Buder  University 

Specialty:  Physical  Chemistry 

Dr.  Thomas  Pollock 
Texas  A&M  University 
Specialty:  Materials  Science 


Dr.  Hung  Vu 

California  State  University 
Specialty:  Applied  Mechanics 

Dr.  Trevor  Williams 
University  of  Cincinnati 
Specialty:  Control  Theory 
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ROME  LABORATORIES 
(Griffiss  Air  Force  Base) 

Dr.  Gary  Craig 

Syracuse  University 

Specialty:  Electrical  Engineering 

Dr.  Daniel  Ryder 

Tufts  University 

Specialty:  Chemical  Engineering 

Dr.  Frances  Harackiewicz 

Southern  Illinois  University 

Specialty:  Electrical  Engineering 

Dr.  Behrooz  Shirazi 

Southern  Methodist  University 
Specialty:  Computer  Science 

Dr.  Shietung  Peng 

University  of  Maryland-Baltimore 
Specialty:  Computer  Science 

Dr.  Wayne  Smith 

Mississippi  State  University 
Specialty:  Computer  Science 

WEAPONS  DIRECTORATE 
(Kirtland  Air  Force  Base) 

Dr.  Gene  Carlisle 

West  Texas  State  University 

Specialty:  Inorganic  Chemistrv 

Dr.  Peter  Walsh 

Fairleigh  Dickinson  University 
Specialty:  Phvsics 

Dr.  William  Cofer 

Washington  State  University 

Specialty:  Civil  Engineering 

Mr.  Michael  Houts  (GSRP) 
Massachusetts  Inst,  of  Tech. 
Specialty:  Nuclear  Engineering 

Dr.  Johanna  Schruben 

University  of  Houston- Victoria 

Specialty:  Mathematics 
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MINI-GRANT  RESEARCH  REPORTS 


Technical 

Report 

Number  Title  and  Mini-Grant  Number 

Volume  I 

Rome  Laboratories 

1  Resource  Management  in  a  Heterogeneous 

Distributed  Systems 

2 10- 11  MG- 107 

Professor 

Dr.  Gary  Craig 

2 

Analysis  of  Magnetically  Controllable 

Microstrip  Antennas  on  Ferrite  Substrates 

210-1  IMG- 120 

Dr.  Frances  Harackiewicz 

3 

Architectural  Design  and  Simulation  of  a 

Parallel  Signal  Processor  for  Adaptive 

Space-Time  Processing 

210-1 1MG-103 

Dr.  Shietung  Peng 

4 

Thermal  Processing  of  Pb-Modified 
Bi-Sr-Ca-Cu-O  Superconducting  Thin  Films 
Prepared  by  the  Metalorganic  Deposition 
(MOD)  Method 

210-1 1MG-0U 

Dr.  Daniel  Ryder 

5 

Parallel  Architectures  for  AI  and  Dynamic 
Knowledge-Base  Systems 

210-1 1MG-064 

Dr.  Behrooz  Shirazi 

6 

Markov  Models  for  Simulating  Error  Patterns 
on  Data  Communications  Links 

210-1 1MG-012 

Dr.  Wayne  Smith 

Arnold  Engineering  Development  Center 

7  Performance  Measurement  and  Optimization 

of  Parallel  Systems 

210-1  IMG- 106 

Mr.  Ben  Abbott  (GSRP) 

8 

An  Environment  for  Reconfigurable 

Parallel  Processing  Software 

210-1  IMG- 104 

Mr.  Theodore  Bapty  (GSRP) 

9 

Feasibility  of  Measuring  Pulsed  X-Ray 

Spectra  using  Photoactivation  of  Nuclear 

Isomers 

210-1 1MG-056 

Dr.  Carlyle  Moore 

xvi 


10  Modeling  of  Burning  Particles  in  the  Plume  Dr.  Olin  Norton 

of  a  Flare 
210-1 1MG-122 


1 1  Image  Sequence  Noise  Reduction  using  Dr.  Richard  Peters 

Morphological  Filters 

210-1  IMG- 105 

12  Laser-Induced  Fluorescence  of  Hydroxyl  Dr.  Chun  Fu  Su 

210-1 1MG-039 

Engineering  and  Services  Center 

13  Non-Volatile  Precursors  to  Olefinic  Dr.  William  Bannister 

Chlorofluorocarbons  [NVP-OCFCs]  as 

Alternative  Fire  Extinguishing  Agents  with 
Reduced  Global  Environmental  Impacts 
210-1 1MG-1 13 

14  Synthesis  of  Potential  Intermediates  of  Dr.  Joseph  Dreisbach 

Microbial  Biodegradation  Pathways  of 

Nitroaromatic  Compounds 
210-1 1MG-094 

15  Propanotrophic  Biodegradation  of  Dr.  Michael  McFarland 

Trichloroethylene  (TCE)  in  Continuous  Flow 

Reactors 
210-1 1MG-067 

16  Submicron  Antennas  for  Solar  Energy  Dr.  Perry  McNeill 

Conversion 

210-1 1MG-045 

17  The  Microstructure  of  Compacted  Moist  Dr.  George  Veyera 

Sand  and  its  Effect  on  Stress  Transmission 

210-1 1MG-073 

Frank  J.  Seiler  Research  Laboratory 

18  Photoacoustic  Calorimetry,  ESR,  and  Dr.  Theodore  Burkey 

DSC  Studies  of  NTO 

210-1 1MG-055 


Dr.  Richard  Carlin 


19  Chemical  Reactivity  and  Thermodynamic 
Stability  of  Alkali  Metals  Deposited  from 
Chloroaluminate  Ambient-Temperature  Molten 
Salt  Electrolytes 

210-1 1MG-034 

20  Issues  in  the  Application  and  Design 
of  Reaction  Mass  Actuators 
210-11MG-111 

21  A  Control  Formulation  for  the  Active  and 
Passive  Damping  of  Flexible  Structures 
210-11MG-117 


Volume  II 
Phillips  Laboratory 
Geophysics  Directorate 

22  PBL  Disturbances  over  the  Desert  Southwest 
210-1 1MG-060 

23  Effect  of  Magnus  Moments  on  Missile 
Aerodynamic  Performance 

210-1 1MG-126 

24  Simulation  of  the  Spectra  of  Diatomic 
Ions  and  Molecules 

210-1 1MG-1 18 

25  Plasma  Transport  in  the  Equatorial 
Ionosphere  during  the  Great  Magnetic 
Storm  of  March,  1989 

210-1  IMG- 109 

26  Resonance-Enhanced  Multiphoton  Ionization 
of  Atmospheric  Molecules 

210-1 1MG-038 

27  Non  Uniform  Clouds 
210-1 1MG-027 

Rocket  Propulsion  Directorate 

28  Molecular  Dynamics  Study  of  Rocket 
Propellant  Adhesion 

210-1 1MG-032 


Dr.  Ephrahim  Garcia 

Dr.  Thomas  Posbergh 

Dr.  Chia-Bo  Chang 
Dr.  George  Jumper 

Dr.  Charles  Lishawa 

Dr.  Craig  Rasmussen 

Dr.  Glenn  Stark 

Dr.  John  Wills 

Dr.  Shannon  Lieb 


xviii 


29  Integration  of  Prototype  Reaction  Wheel  Dr.  Thomas  Pollock 

Torquers  into  the  Astrex  Facility 

210-11MG-116 

30  Control  System  Design  and  Optimal  Selection  Dr.  Hung  Vu 

of  Noisy  Actuators  and  Sensors  for  Flexible 

Structures 
210-1 1MG-089 

31  Sensor/ Actuator  Placement  for  Flexible  Dr.  Trevor  Williams 

Spacecraft 

210-1 1MG-124 

Advanced  Weapons  Survivability  Directorate, 

Lasers  and  Imaging  Directorate,  and 
Space  and  Missle  Technology  Directorate 

32  Second-Harmonic  Generation  in  Corona-Poled  Dr.  Gene  Carlisle 

Polymer  Films 

210-1 1MG-007 

33  Localization  Limiters  for  the  Microplane  Dr.  William  Cofer 

Concrete  Material  Model 

210-1 1MG-035 

34  No  Report  Submined  Mr.  Michael  Houts  (GSRP) 

210-1 1MG-004 

35  Misalignment  Parameters  of  a  Double  Aperture  Dr.  Johanna  Schruben 

Telescope  Obtained  as  a  Function  of  the  Ratio 

of  Optical  Transfer  Functions  Without  and 
With  Diversity 
210-1 1MG-133 

36  Applications  of  Compact  Toroid  Plasmas  Dr.  Peter  Walsh 

210-1 1MG-082 

Volume  III 
Wright  Laboratory 
Armament  Directorate 

37  Finite  Element  Analysis  of  Runway  Failure  Dr.  Charles  Camp 

Due  to  Blast  Loading 

210-1 1MG-1 10 

38  Analysis  of  the  Penetration  of  Reinforced  Dr.  Arnold  Carden 

Concrete  Slabs 

210-1 1MG-125 


< 


39 

Report  Not  Publishable  at  this  Time 

210-1 1MG-036 

Dr.  Charles  Fosha 

40 

Methods  which  Accelerate  Convergence  in 

Iterative  CFD  Solvers 

210-1 1MG-074 

Dr.  John  George 

41 

Final  Report  for  the  Research  Initiation 

Program 

210-1 1MG-083 

Mr.  Randy  Gove  (GSRP) 

42 

Neural  Networks  for  Guidance,  Navigation, 
and  Control  of  Exoatmospheric  Interceptors 

210-1 1MG-046 

Dr.  Kevin  Moore 

43 

Fire  Control  System  for  a  Laser  Aimed 

Machine  Gun 

210-1 1MG-075 

Dr.  William  Siuru 

44 

Robust  Eigenstructure  Assignment  with 

Application  to  Missile  Control 

210-1 1MG-026 

Dr.  Kenneth  Sobel 

Aero  Propulsion  and  Power  Directorate 

45 

Evaluation  of  MOS-Controlled  Thyristor  (MCT) 
at  270  V  DC  for  Static  and  Dynamic  Loads 

210-1 1MG-086 

Dr.  Muhammad  Choudhry 

46 

Non  Newtonian  Effects  of  Powder  Lubricant 
Slurries  in  Hydrostatic  and  Hydrodynamic 

Bearings 

210-1 1MG-014 

Dr.  Donald  Dareing 

47 

Investigation  of  the  Combustion 

Characteristics  of  Confined  Coannular 

Swirling  Jets  with  a  Sudden  Expansion 

210-1 1MG-013 

Dr.  Paul  Hedman 

48 

Aircraft  HVDC  Power  System  -  Stability 

Analysis 

210-1 1MG-015 

Dr.  K.  Sankara  Rao 

XX 


49 

Dynamic  Temperature  Measurements  in 

Reacting  Flows 

210-1 1MG-001 

Dr.  Larry  Roe 

50 

No  Report  Submitted 

210-11MG-098 

Dr.  Kaveh  Tagavi 

51 

Droplet  Distributions  from  the  Breakup 
of  a  Cylindrical  Liquid  Jet 

210-1 1MG-033 

Dr.  Richard  Tankin 

Avionics  Directorate 

52  Function  Decomposition  and 

Machine-Learning  Systems 

210-1 1MG-041 

Dr.  Michael  Breen 

53 

A  Formal  Process  Model  for  Software 
Re-engineering:  The  Analysis  Phase 
210-11MG-068 

Mr.  Eric  Byrne  (GSRP) 

54 

Probability  Modeling  in  Automatic 

Target  Recognition 

210-1 1MG-131 

Dr.  R.  H.  Cofer 

55 

The  Enhancement  of  Connectionist  Methods 
for  Recognizing  Airplanes  from  Radar  Returns 
210-1 1MG-006 

Dr.  Lawrence  Hall 

56 

Report  Not  Publishable  at  this  Time 

210-1 1MG-135 

Dr.  Kevin  Kirby 

57 

Fiber  Laser  Preamplifier  for  Laser 

Radar  Detectors 

210-1 1MG-134 

Dr.  Richard  Miers 

Electronic  Technology  Laboratory 

58  Computer  Simulation  of  Small-Geometry 

High-Speed  High-Density  Integrated  Circuit 
Performance  Indicators 

210-1 1MG-003 

Dr.  Ashok  K.  Goel 

Flight  Dynamics  Directorate 

59  Studies  on  the  Improvement  of  the  Accuracy 

of  Acoustic  Emission  Source  Location  for 

Smart  Structures  Applications 

210-1 1MG-077 

Dr.  Marvin  Hams  tad 

XXI 


60 


H-Infinity  Control  Design--A  New  Approach 
210-1 1MG-017 


Dr.  Chin  Hsu 


61  Stress  Wave  Propagation  Through  the  Thickness 
of  Graphite/Epoxy  Laminated  Plates  using  PVDF 
Sensors 

210-1 1MG-090 

62  Accelerated  Fatigue  Test  Procedure  for  the 
Structural  Polycarbonate  Component  of  the 
F-16  Canopy  Composite  Material 

210-1 1MG-005 

63  Fatigue  Fracture  Behavior  of  Cord-Reinforced 
Rubber  Composites 

210-1 1MG-088 

64  The  Efficacy  of  Constrairsd-Layer  Damping 
Treatment  to  Suppress  Parametric  and 
Autoparametric  Resonances  in  Nonlinear  and 
Internally  Resonant  Nonlinear  Structures 
210-1 IMG-097 

Materials  Directorate 

65 


66 


67 


AMI  Molecular  Orbital  Calculations  of  the 
Conformational  Properties  of  Odd-electron 
Rigid  Rod  Polymer  Model  Species 
210-1 1MG-002 

Structural  Analysis  of  Potential  NLO 

Chromaphores 

210-1 1MG-095 

Eddy  Current  and  Dielectric  Spectroscopy 
Characterization  of  Metals  and  Ceramics  - 
Application  to  NDF. 

210-1 1MG-085 


Dr.  David  Hui 


Dr.  Yulian  Kin 


Dr.  Byung-Lip  Lee 


Dr.  Lawrence  Zavodney 


Dr.  John  Connolly 


Dr.  David  Grossie 


Dr.  Prasad  Kadaba 


xxu 


68  Silicon/Tin  Polymers  for  Enhanced  Third 
Order  Nonlinear  Optical  Properties 
210-1 1MG-028 

69  Conformational  Structure  and  Dynamics 
of  Perfluoropolyalkylether  Lubricants 
210-1 1MG-020 

70  Modeling  of  the  Formation  of  Macrosegregation 
During  Casting  Solidification 

210-1 1MG-057 


Volume  IV 

Aerospace  Medicine  Directorate 

71  Empirical  Prediction  of  Physiological 
Response  to  Prolonged  Work  in  Encapsulating 
Protective  Clothing 

210-1 1MG-048 

72  Growth  Dynamics  of  RAW  264.7  Mouse 
Macrophage  Cells 

210-1 1MG-076 

73  Visual-Field  Differences  in  Perception 
and  Attention 

210-1 1MG-022 

74  Report  Not  Publishable  at  this  Time 
210-1 1MG-029 

75  An  Investigation  of  Dioxin  Half-life 
Estimation  in  Veterans  of  Project  Ranch  Hand 
210-1 1MG-065 

76  Repeated  Measures  Design  with  Missing 
Observations 

210-1 1MG-058 

77  Pulmonary  Effects  of  Hyperbaric  Oxygenation 
210-1 1MG-008 


Dr.  Joseph  Lambert 

Dr.  Martin  Schwartz 

Dr.  Hai-Lung  Tsai 

Dr.  Phillip  Bishop 

Dr.  Robert  Blystone 

Dr.  Bruno  Breitmeyer 

Dr.  Vito  DelVecchio 

Dr.  Pushpa  Gupta 

Dr.  Ramesh  Gupta 

Dr.  John  Szarek 


xxui 


78 


Dr.  Steven  Waller 


The  Four- Artery  Occlusion  Model  of  Loss 
of  Consciousness:  Effects  of  Single  and 
Multiple  Episodes  of  Transient  Cerebral 
Ischemia  on  Circle  of  Willis  Blood  Pressure, 
Neurochemistry  and  Behavior 
210-1  IMG- 108 

Crew  Systems  Directorate 

79  Sensitivity  of  Metabolic,  Respiratory, 
and  Performance  Measures  to  Cognitive 
Demands  in  Dual-Task  Performance 
210-1 1MG-091 

80  Analysis  and  Modeling  of  Pilot  Tasks  and 
Decisions  for  the  Pilot’s  Associate 
Pilot- Vehicle  Interface  using  Hypermedia 
and  a  Constraints  Based  Approach 
210-1 1MG-072 

81  Effects  of  Delays  in  Networked  Flight 
Simulators 

210-1  IMG- 114 

82  Speaker  Normalization  using  Neural  Networks 
210-1 1MG-019 

83  Modelling  the  Effects  of  Additional  Head 
Mass  on  Head/Neck  Dynamics 

210-1 1MG-049 

Human  Resources  Directorate 

84  An  Intelligent  Tutoring  System  to 
Encourage  Qualitative  Reasoning  in  Basic 
Writing  Students 

210-IIMG-040 

85  Training  a  Complex  Spatial  Skill  and  Target 
Identification  in  both  Single-Task  and 
Dual-Task  Modes 

210-1 1MG-016 

86  Computer  Mediated  Intellectual  Teamwork 
210-1 1MG-044 

87  Job  Survey  Research  Productivity  Tool 
210-1 1MG-051 


Dr.  Richard  Backs 

Mr.  John  Duncan 

Dr.  Martin  Hagan 

Dr.  Ashok  Krishnamurthy 

Dr.  Amit  Patra 

Dr.  Margaret  Batschelet 

Dr.  James  Dykes 

Dr.  Verlin  Hinsz 

Dr.  DeLayne  Hudspeth 


XXIV 


88 


Visually  Significant  Intraocular  Auto 

Florescence 

210-1 1MG-1 19 


Dr.  Gillray  Kandel 


89 

Development  of  Key  Variables  for  Multiship 
Simulation  Benefit/Cost  Analysis 

210-1  IMG- 100 

90 

Shared  Cognitive  Representations  of  Teams 
210-1 1MG-090 

91 

Using  Survival  Analysis  to  Help 

Make  Training  Decisions 

210-11MG-009 

Occupational  and  Environmental  Health  Directorate 

92  Mathematical  Modeling  and  Decision- 

Making  for  Air  Force  Contaminant 

Migration  Problems 

210-1 1MG-084 

Dr.  William  Moor 


Dr.  Joan  Rentsch 

Dr.  Stanley  Stephenson 


Dr.  Miguel  Medina 


xxv 


FINITE  ELEMENT  ANALYSIS  OF  RUNWAY 
FAILURE  DUE  TO  BLAST  LOADING 


Charles  V.  Camp 
Principal  Investigator 

MEMPHIS  STATE  UNIVERSITY 
Herff  College  of  Engineering 
Department  of  Civil  Engineering 
Memphis,  TN  38152 


December  20, 1991 


Introduction 


Accurate  analysis  and  prediction  of  damage  to  runway  pavements  due  to  blast  load¬ 
ings  from  buried  explosives  is  a  complicated  and  difficult  problem.  A  reliable  model 
capable  of  accurately  evaluating  runway  damage  has  to  account  for  three  distinct  but 
interacting  phenomena  First,  during  detonation  of  the  explosive,  suitable  estimates  of 
physical  characteristics  of  the  blast,  such  as  the  magnitude,  the  velocity,  and  the  shape 
of  the  initial  pressure  wave,  need  to  be  determined.  In  the  second  phase  of  the  model, 
the  characteristics  of  the  pressure  wave  are  applied  to  the  underlying  pavement  sub¬ 
grade.  As  the  blast  wave  spreads  through  the  soil,  the  subgrade  transmits  a  pressure  to 
the  underside  of  the  runway  pavement.  In  the  final  phase  of  the  model,  the  transmitted 
pressure  is  applied  as  a  dynamic  distributed  load  on  the  concrete  pavement.  The  struc¬ 
tural  response  of  the  pavement  is  the  most  definitive  mechanism  in  damage  predic¬ 
tions. 

The  blast  pressure  loading  is  of  a  highly  transient  impulsive  nature  and  produces  a 
complex  dynamic  response  from  the  concrete  pavement.  A  level  of  damage  for  the  run¬ 
way  system  may  be  assessed  from  results  of  a  nonlinear  dynamic  structural  analysis  of 
the  concrete  pavement.  Although  not  listed  as  one  of  the  three  mechanisms  in  the  dam¬ 
age  model,  the  dynamic  interaction  between  the  soil  subgrade  and  the  structure  is  a 
critical  component  in  the  dynamic  response  of  the  pavement  and  probably  the  least  un¬ 
derstood. 

Many  researchers  have  developed  simple  single-degree-of-freedom  (SDOF)  and  multi- 
ple-degree-of-freedom  (MDOF)  models  for  predicting  the  response  of  reinforced  con¬ 
crete  structures  to  dynamic  loads.  Application  of  such  models  for  predicting  the  dam¬ 
age  of  unreinforced  concrete  runways  due  to  blast  effects  has  had  limited  success.  In 
most  cases,  the  approximations  used  in  formulating  the  simple  model  contribute  to  in¬ 
accurate  results  and  an  unrealistic  structural  response. 

To  develop  a  more  reasonable  model  for  the  behavior  of  a  runway  subjected  to  a  blast 
loading,  the  basic  equations  of  motion  for  a  continuum  should  be  considered.  Such  an 
approach  provides  a  fundamental  characterization  of  the  physical  nature  of  the  struc¬ 
ture  and  how  it  responds  to  external  loads.  Constitutive  relationships  which  describe 
the  response  of  a  material  for  a  range  of  physical  effects  are  coupled  with  the  equations 
of  motion  to  form  a  complete  response  mechanism. 

The  resulting  formulation  may  be  very  complex  and  generally  has  few  direct  solutions. 
However,  during  the  past  few  decades,  researchers  and  engineers  have  used  powerful 
numerical  tools  to  solve  these  equations  for  a  wide  variety  of  problems  in  virtually  all 
fields  of  science.  The  most  robust  of  these  numerical  devices  is  the  finite  element  meth¬ 
od  (FEM). 
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In  this  study,  the  primary  objective  is  to  review  available  FEM  programs  in  an  attempt 
to  identify  systems  which  are  most  appropriate  for  modelling  the  dynamic  response  of 
the  concrete  runways  subjected  to  blast  loading. 

Research  Methodology 

The  objectives  of  this  project  are  twofold:  first,  to  investigate  the  applicability  of  FEM 
systems  to  predict  the  damage  of  runway  pavements  due  to  buried  explosives,  and  sec¬ 
ond,  to  recommend  areas  where  future  research  will  have  the  most  significant  impact. 
Upon  completion  of  the  review,  a  selected  FEM  program  determined  to  be  appropriate 
for  analysis  of  runway  pavements  to  dynamic  loads  is  obtained  and  used  to  solve  a  sim¬ 
plified  concrete  pavement  problem.  From  this  analysis  some  measure  of  the  operation¬ 
al  performance  may  be  ascertained.  The  project  is  divided  into  two  phases: 

Phase  One 

An  survey  of  available  FEM  analysis  programs  is  conducted  to  identify  formulations 
which  are  capable  of  modelling  problems  similar  to  the  runway  damage  problem.  In¬ 
formation  is  gathered  on  each  analysis  package  from  the  primary  vendor.  In  addition, 
to  vendor  supplied  information  several  experts  in  the  fields  of  FEM  modelling  and  the 
behavior  of  concrete  materials  are  consulted. 

The  authors  and/or  distributor  of  the  analysis  software  provided  basic  information  on 
the  fundamental  science  incorporated  in  their  product.  For  example,  the  level  of  ap¬ 
proximation  of  the  governing  equations  and  the  type  of  material  constitutive  relation¬ 
ships  used  in  the  formulation.  In  particular,  each  analysis  program  is  examined  to  de¬ 
termine  how  the  dynamic  response  and  failure  of  concrete  is  modelled. 

Another  important  criterion  used  for  this  evaluation  is  the  accessibility  of  basic  code  to 
modifications.  Such  a  feature  is  advantageous  and  efficient  for  developing  advanced 
material  models.  In  runway  damage  predictions,  the  ability  to  incorporate  and  investi¬ 
gate  different  types  of  formulations  to  model  the  soil /structure  interaction  problem  is 
an  important  consideration.  Since  reliable  runway  damage  predictions  must  accurate¬ 
ly  model  complex  impulsive  elasto-plastic  material  responses,  fracture  mechanics,  and 
stress  wave  propagation,  insights  into  how  these  mechanisms  are  treated  is  very  impor¬ 
tant. 

The  ADIN A  System  -  Based  on  information  supplied  by  the  vendor  and  conversations 
with  Dr.].  Tedesco,  Professor  of  Civil  Engineering  at  Auburn  University,  Auburn,  Ala¬ 
bama,  Dr.  J.  Walczak  of  ADINA  R&D,  Inc.,  and  Dr.  K.  Bathe,  Professor  of  Mechanical 
Engineering  at  the  Massachusetts  Institute  of  Technology,  Cambridge,  Massachusetts, 
the  ADINA  computer  program  system  is  selected  as  the  most  appropriate  finite  ele¬ 
ment  analysis  tool  for  preliminary  runway  damage  predictions.  The  ADINA  system 
has  one  of  the  most  sophisticated  concrete  material  models  available  and  has  the  ability 
to  analyze  high-strain  rate  phenomena.  Another  important  feature  of  the  ADINA  pro- 
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gram  is  the  capability  to  incorporate  user  supplied  loading  and  material  modules  into 
the  general  solution  procedure.  The  ability  to  develop  supplemental  or  new  material 
models  and  incorporate  these  modules  into  the  ADINA  system  is  a  critical  feature  for 
advanced  studies  of  the  runway  damage  problem. 

ADINA  Concrete  Model  -  The  general  ADINA  concrete  material  model  may  be  used 
in  both  two-  and  three-dimensional  analysis  of  structural  systems  characterized  by 
large  displacements  and  rotations.  The  basic  concrete  behavior  model  is  described  by 
several  characteristics:  a  nonlinear  stress-strain  relationship,  tension  and  compression 
failure  envelops,  and  a  model  to  control  post-cracking  and  crushing  of  concrete. 

In  the  ADINA  concrete  model,  the  general  multiaxial  stress-strain  relationship  is  de¬ 
veloped  from  a  uniaxial  stress-strain  behavior.  A  typical  uniaxial  stress-strain  relation¬ 
ship  may  be  defined  in  terms  of  three  strain  phases:  strain  at  ultimate  tension  cutoff, 
minimum  crushing  strain,  and  ultimate  compressive  strain.  In  a  multiaxial  stress  con¬ 
dition,  the  stress-strain  relationship  is  different  depending  on  whether  the  material  is 
being  loaded  of  unloaded.  In  the  complex  material  failure  model  for  compressive  fail¬ 
ure,  the  form  of  the  failure  envelop  is  very  sensitive  to  many  user  defined  parameters. 
In  most  cases,  the  failure  model  parameters  must  be  obtained  from  experimental  data 
for  the  material  and  type  of  structure  under  consideration. 

ADINA  Distributed  Loading  -  The  ADINA  system  has  modules  to  represent  loading 
for  concentrated,  pressure,  centrifugal  and  mass  proportional,  and  user-supplied  load¬ 
ing  situations.  The  wide  range  of  available  loading  configurations  allows  flexibility  in 
selecting  the  most  accurate  way  to  represent  the  blast  load  on  the  concrete  runway.  The 
three-dimensional  and  the  user  defined  loading  modules  are  examined  in  the  second 
phase  of  the  this  study.  In  large  displacement  analysis,  the  pressure  loading  may  be  a 
function  of  the  deformation  of  the  the  structure.  From  observations,  it  is  apparent  that 
at  some  time  after  detonation,  the  runway  slab  separates  from  the  underlying  sub¬ 
grade.  At  this  point,  the  pressure  loading  is  suspended.  In  addition,  some  form  of  cen- 
trifugal  or  mass  proportional  loading  arising  from  the  acceleration  of  the  slab  may  be 
important  in  the  analysis  of  the  runway  damage  after  the  structure  has  lost  contact  with 
the  subgrade. 

Phase  Two 

The  second  phase  of  this  project  will  be  completed  in  the  Summer  of  1992.  There  are 
several  reasons  for  requesting  a  six-month  extension  of  the  project.  First  and  foremost 
was  the  difficulty  Memphis  State  University  and  ADINA  R&D,  Inc.  had  in  negotiating 
a  special  license  of  the  ADINA  system  for  University  and  education  use.  After  several 
months  of  a  settlement  was  reached  and  the  ADINA  software  was  delivered  in  Septem¬ 
ber  1991 .  However,  due  to  technical  problems  with  the  SUN  SPARCstation  version  of 
ADINA  final  installation  of  the  system  was  not  completed  until  early  December  1991. 
Therefore,  phase  two  of  the  project  will  begin  early  January  1992. 
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To  establish  some  measure  of  the  operational  character  of  the  ADINA  system  a  simpli¬ 
fied  test  problem  will  be  analyzed.  To  maintain  correlation  with  the  actual  runway 
problem,  the  dynamic  response  of  a  simply  supported  circular  concrete  plate  subjected 
to  a  step  concentrated  load  will  be  modelled.  Several  cases  will  be  modeled.  In  the  first 
case,  the  value  of  the  step  load  will  be  less  than  the  static  pressure  that  will  cause  the 
plate  to  fail.  In  successive  cases,  the  value  of  the  unit  load  will  be  increased  until  the 
concrete  plate  fails.  This  series  of  test  problems  will  provide  common  benchmarks  for 
the  evaluation  of  the  performance  and  the  operational  properties  of  the  ADINA  system. 

An  important  aspect  in  the  operational  evaluation  is  the  practicality  of  pre-  and  post¬ 
processing  graphics  user  interfaces.  Well  designed  and  written  pre-  and  post-proces¬ 
sing  routines  can  greatly  enhance  the  utility  of  large  powerful  numerical  models.  The 
ADINA  system  pre-and  the  post-processing  routines  will  be  assessed  on  their  level  of 
graphical  and  functional  sophistication.  As  a  primitive  measure  of  programs  portabili¬ 
ty,  a  list  of  the  minimum  computer  configuration  capable  of  executing  an  analysis  using 
the  ADINA  system  will  be  listed. 

Anticipated  Outcomes  -  Future  Developments 

The  results  of  this  study  will  provide  a  source  of  important  information  on  dynamic 
modelling  of  concrete  pavements  and  directly  influence  the  direction  of  future  research 
efforts  in  predicting  runway  damage.  Accurate  assessment  of  the  structural  response 
of  concrete  pavements  will  provide  an  important  tool  for  researchers  and  developers 
interested  in  evaluating  the  performance  of  runway  designs. 

Time  Schedule  (revised,  12/91) 


January  March  June  September  January  Manch  June 


1991 


1992 
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Research  Background 


This  project  is  coordinated  by  the  Herff  College  of  Engineering  at  Memphis  State  Uni¬ 
versity  with  the  faculty  of  the  Department  of  Civil  Engineering  conducting  the  investi¬ 
gation.  The  principal  investigator  for  this  project  is  Dr.  Charles  V.  Camp,  Assistant  Pro¬ 
fessor  of  Civil  Engineering.  Professor  Camp  was  a  1990  USAF  Summer  Faculty  Re¬ 
search  Fellow  at  Eglin  AFB  where  he  was  involved  in  the  analysis  of  runway  damage 
using  a  simple  model  approach. 

The  Department  of  Civil  Engineering  is  in  the  process  of  building  a  powerful  network 
of  workstation  computers  for  the  Structural  Mechanics  section.  This  facility  is  devoted 
to  the  analysis  of  structural  systems  and  computational  mechanics.  Currently  the 
structural  mechanics  computer  laboratory  consists  of  an  upgraded  SUN  IPC  worksta¬ 
tion,  a  SUN  SPARCstation  1+,  and  a  SUN  SPARCstation  2. 

The  research  experience  and  capabilities  of  the  MSU  Civil  Engineering  faculty  com¬ 
bined  with  the  contributions  of  graduate  and  undergraduate  students  are  sufficient  to 
initiate  and  complete  the  proposed  research  within  the  project  time  period.  The  staff 
and  resources  of  the  MSU  Civil  Engineering  Department  will  provide  additional  sup¬ 
port  in  preparing  the  intermediate  reports  and  the  final  project  documents. 
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ABSTRACT 


ANALYSIS  OF  THE  PENETRATION  OF  REINFORCED  CONCRETE  SLABS 


The  work  performed  under  this  grant  was  divided  into  three  sections: 
Literature  Survey,  Study  of  the  static  and  dynamic  properties  of 
concrete,  and  the  use  of  the  hydrocode,  HULL,  to  perform  a  computer 
analysis  of  the  penetration  of  reinforced  concrete. 

The  literature  survey  consisted  of  three  months  work  by  Mr.  Wanstall 
searching  the  Government  Documents  section  of  our  library. 
Pertinent  references  are  included  with  abstracts  in  the  Final 
Report.  There  is  some  good  work  ongoing  at  several  research 
institutions  and  there  is  need  for  additional  work. 

Mr.  J.  G.  Lee  has  selected  as  his  dissertation  research  topic  the 
modification  of  the  dynamic  properties  of  concrete  after  exposure  to 
moderate  hydrostatic  compression.  An  abstract  has  been  submitted  to 
the  1992  HVIS  Symposium  for  consideration  of  the  inclusion  of  a 
paper  in  the  conference.  Mr.  Lee  has  developed  a  method  for 
exposing  concrete  cylinders  to  about  8  kb  hydrostatic  pressure  and 
then  measuring  the  passage  of  an  elastic  wave  through  the  material. 
The  modifications  in  acoustic  impedance  and  wave  speed  are  then 
measured 

We  have  modeled  the  reinforced  concrete  slabs  in  a  2D  eulerian  Hull 
analysis.  In  order  to  model  the  reinforcement,  we  have  placed  an 
equivalent  amount  of  steel  around  the  concrete  slab  to  provide 
confinement.  We  ran  one  case  with  steel  layers  on  entrance  and  exit 
side.  During  penetration  the  exit  side  steel  detached  from  the 
concrete.  We  then  modeled  the  reinforcement  as  a  continuous  shell 
around  the  slab,  48  cm  thick  and  98  cm  in  outer  radius.  This  solved 
the  detachment  problem.  We  also  provided  a  hole  in  the  entrance 
side  steel  so  that  no  work  is  done  on  initial  penetration  of  that 
layer.  We  left  no  radial  gap  between  the  penetrator  and  entrance 
layer  steel. 

We  do  not  obtain  results  in  our  computer  analysis  that  match  what  we 
were  shown  on  the  test  range  at  Eglin  AFB.  We  do  not  see  in  the 
computer  analysis  significant  bulging  of  the  entrance  side  steel 
layer.  The  exit  side  steel  layer  does  bulge  but  the  geometries  do 
not  match  the  field  data.  We  followed  the  event  all  the  way  to 
failure  of  the  steel  exit  layer.  Nor  do  we  see  the  pressure  pulse 
in  front  of  the  penetrator  rise  after  the  initial  impact  pressure 
begins  to  decay.  Our  analyses  shows  a  persistent,  monotonic  drop  in 
pressure  that  goes  to  zero.  Velocities  of  300  and  400  m/s  have  been 
tried  and  time  periods  of  up  to  5000  microsec  have  been  followed. 
Furthermore,  the  failure  model  of  concrete  that  we  have  used  does 
not  show  the  fracture  locations  that  we  have  found  documented  in  the 
literature.  This  work  points  the  direction  to  further  research. 
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FINAL  REPORT 


ANALYSIS  OF  THE  PENETRATION  OF  REINFORCED  SLABS 


INTRODUCTION 

This  is  a  final  report  showing  the  results  of  our  work  on  this 
project.  Professor  Carden  and  Mr.  Wanstall  spent  a  summer  at  Eglin 
AFB,  1990  performing  work  on  this  same  topic.  Much  of  the  summer 
was  spent  learning  a  new  computer  system,  Hull  Hydrocode,  and 
begining  to  learn  how  to  model  the  problem  in  2D  eulerian  format. 
We  were  taken  to  a  test  range  and  shown  a  large  number  of  reinforced 
concrete  slabs  that  had  been  penetrated  by  fairly  low  velocity 
projectiles. 

The  steel  reinforcing  mesh  of  the  failed  slabs  seen  on  the  range  was 
bowed  out  on  both  entrance  and  exit  side.  It  was  our  understanding 
of  the  problem  that  at  low  impact  velocities,  the  shock  pressures 
would  be  low,  but  the  acoustic  wave  reflections  combined  with  the 
volume  change  in  the  concrete  (due  to  the  displacement  of  the 
concrete  by  the  presence  of  a  solid  rod)  accounted  for  the  build  up 
of  pressure  in  the  bulk  of  the  concrete  in  the  vicinity  of  the 
penetrator.  This  pressure  exerted  itself  against  the  steel 
reinforcement  and  pushed  the  reinforcement  outward,  both  on  the 
entrance  and  exit  sides.  The  reinforcement  was  bent  outward  like  a 
membrane  under  internal  prressure.  This  led  to  an  estimate  of  the 
pressure.  But  the  build  up  of  pressure  in  the  concrete  also  builds 
up  pressure  on  the  front  of  the  penetrator.  At  a  velocity  of  300 
m/s,  the  time  for  penetration  of  a  one  meter  thick  slab  is  of  the 
order  of  5  to  15  ms.  It  is  this  build  up  of  pressure  that  extracts 
momentum  from  the  penetrator.  Without  the  steel  reinforcement,  such 
a  build  up  of  pressure  would  not  be  possible  since  the  pressure  on 
the  free  surfaces  of  the  concrete  is  essentially  zero.  We  see  the 
effect  of  the  reinforcing  rods  (if  spaced  sufficiently  close 
together  in  a  3D  mesh)  as  a  confinement  for  the  concrete.  Such 
mesh,  for  maximum  benefit,  must  be  on  both  entrance  and  exit  sides 
of  the  slab.  Consequently,  long  run  times  for  the  hydrocode  are 
necessary  to  observe  the  penetration  event.  Long  run  times  and 
small  cell  size  require  large  computing  facilities  and  patience. 

We  have  performed  these  hydrocode  analyses  on  an  IBM  RS  6000  with  32 
mb  of  core  memory  and  two  hard  disks,  one  of  360  mb  and  another  of 
1.2  Gb.  We  have  used  OTIHULL  version  4  for  the  analysis. 

Object  and  Scope.  The  purpose  of  this  work  was  to  perform  hydrocode 
analyses  of  thick  reinforced  concrete  slabs  using  low  velocity 
projectiles.  In  the  overall  plan,  a  literature  on  concrete  dynamic 
and  static  mechanical  properties  and  techniques  for  measuring  these 
properties  were  reviewed.  This  literature  survey  suggested  that  we 
needed  to  perform  some  dynamic  property  tests  and  we  have  built  and 
calibrated  a  facility  to  to  that. 
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LITERATURE  SURVEY. 


The  results  of  the  survey  of  the  NTIS  unclassified  data  base  of 
concrete  properties  and  test  methods  is  included  in  an  appendix.  We 
found  a  number  of  articles  of  which  we  were  unaware  and  a  few  that 
we  deem  as  significant.  The  work  by  Rajendran  (Dayton  Univ.  )  and 
Nicolas  (WPAFB)  has  some  interesting  innovations  for  formulating 
dynamic  constitutive  eguations.  The  work  by  Malvern  shows  results 
from  a  large  Split  Hopkinson  Pressure  bar  experiments  on  large 
samples  of  concrete.  From  these  SHPB  tests  stress-strain  functions 
for  plane  strain  compression  can  be  obtained  at  strain  rates  of  up 
to  about  2000  per  sec.  In  general,  the  SHPB  does  not  give 
information  concerning  fracture  and  the  results  are  limited  to 
strains  of  up  to  about  10  percent.  No  information  concerning 
failure  under  shearing  stresses  (Mode  II,  III  fracture)  was  found. 
Little  information  concerning  the  heterogeneity  of  concrete  and  its 
influence  on  wave  propagation  and  attenuation  was  found.  It  is 
known  that  ordinary  concrete  has  a  significant  pore  volume. 
Pressures  above  some  threshold  will  begin  to  close  these  pores  and 
raise  the  density.  No  references  could  be  found  that  elucidate  this 
fact  and  its  significance  to  dynamic  loading.  The  only  published 
Hugoniot  on  concrete  was  that  produced  at  General  Motors  Research 
Institute  several  decades  ago. 


DEVELOPMENT  OF  A  DYNAMIC  TEST  METHOD  FOR  CONCRETE 

We  have  seen  the  need  to  measure  and  record  the  dynamic  properties 
of  concrete  for  elastic  waves.  If  one  models  the  low  velocity 
impact  of  thick  reinforced  concrete  slabs,  one  observes  that  the 
elastic  wave  reflects  between  the  reinforcement  planes  about  twenty 
times.  While  spherical  expansion  attenuates  the  particle  velocity 
and  pressure,  the  reflection  from  a  high  acoustic  impedance  surface 
(the  steel  reinforcement  plane)  causes  an  increase  in  the  pressure. 
Furthermore,  the  displacement  of  the  concrete  caused  by  the 
insertion  of  the  projectile  causes  an  increase  in  pressure.  For 
unconfined  concrete,  modeled  as  a  cylinder  under  pressure,  the 
maximum  pressure  is  governed  by  the  tensile  strength  (static 
pressure)  or  the  maximum  shearing  strength  (for  dynamic  loading). 
The  equation  of  motion  is: 

(dC^/dr)  =  (<T*  -  CJ/r  +  f  r 

The  0\)  term  is  twice  the  maximum  shearing  stress  for  the  state 
of  stress.  For  dynamic  loading  of  a  cylinder,  with  significant 
acceleration  terms,  both  stresses  would  be  negative  on  the  inside 
surface  of  a  cylinder.  The  hoop  stress  will  be  tensile  on  the 
outside  surface.  This  was  pointed  out  by  G.  W.  Taylor  in  an  article 
on  the  fracturing  of  bombs.  Twice  tau  max  is  the  difference  beween 
the  two  principal  stresses.  For  the  metals,  fracture  begins  on  the 
outside  surface.  We  address  this  model  only  to  point  out  that  there 
is  a  limit  to  the  pressure  gradient  in  concrete  determined  by  the 
geometry,  the  boundary  conditions,  the  acceleration  terms  and  the 
strength.  For  low  velocity  impact  of  UN  reinforced  concrete,  this 
pressure  gradient 
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is  quite  low.  But  for  reinforced  concrete,  the  boundary  value  of 
pressure  is  whatever  is  limited  by  the  confinement  properties  of  the 
reinforcement.  Therefore,  the  reinforcement  allows  a  much  greater 
pressure  region  in  front  of  the  penetrator  after  some  initial  period 
of  time  to  allow  this  pressure  to  build  up.  The  time  depends  upon 
the  dimensions  of  the  slab,  the  size  of  the  penetrator,  and  the 
geometry  of  the  reinforcement. 

We  determined  that  a  clearer  understanding  was  needed  for  the 
elastic  propagation  (and  reflection)  of  impact  waves  in  concrete. 
Especially  as  affected  by  the  changes  wrought  in  the  concrete  by  the 
pressure  excursions.  We  have  therefore  modified  a  drop  tower  to 
impact  a  2  in  dia  concrete  cylinder  mounted  on  a  steel  load  bar  and 
impacted  from  the  top  by  a  two  inch  dia  steel  bar  having  velocities 
of  up  to  10  m/s.  Because  of  the  difference  of  steel  and  concrete, 
the  wave  passing  through  the  concrete  reflects  at  the  next 
concrete-steel  interface.  It  reflects  back  and  forth  until  the 
transmitted  wave  in  the  load  bar  reaches  a  stabilized  value.  The 
intensity  of  the  first  reflection  is  a  measure  of  the  impedance 
mismatch,  and  the  time  period  between  the  reflections  is  a  measure 
of  the  propagation  velocity.  Prior  exposure  to  high  hydrostatic 
pressure  causes  a  permanent  change  in  density.  The  compressive 
elastic  modulus  also  increases  due  to  the  increase  in  density.  If 
these  do  not  change  proportionally,  there  is  a  change  in  the 
propagation  velocity,  and  there  will  be  an  increase  in  acoustic 
impedance.  A  dissertation  by  Mr.  J.  G.  Lee  is  to  be  published  which 
will  show  these  methods  and  data.  We  show  in  Appendix  some  of  the 
data  taken  for  a  steel  on  a  steel  impact  where  the  steel  specimen 
has  an  area  mismatch,  not  an  impedance  mismatch. 


HYDROCODE  ANALYSIS  OF  THE  IMPACT  OF  REINFORCED  CONCRETE 

We  have  modeled  the  concrete  slab  as  a  2D  eulerian  cylinder,  48  cm 
thick,  96  cm  in  radius.  The  penetrator  was  modeled  as  a  rigid 
island  4  cm  in  radius  and  50  cm  long.  At  Eglin  AFB  we  ran  several 
analyses  out  to  250  microsec.  We  never  achieved  more  than  about  20 
percent  penetration.  It  became  apparent  that  we  should  continue  the 
problem  until  complete  penetration  was  achieved.  Furthermore  we 
could  not  model  a  3D  reinforcement  mesh  in  a  2D  calculation.  We 
therefore  calculated  the  '•equivalent"  thickness  of  steel  represented 
as  a  continuous  sheet.  At  first  the  steel  was  only  on  the  exit 
side.  But  this  sheet  separated  from  the  concrete  when  the  reflected 
stress  in  the  concrete  was  tensile.  We  solved  this  by  making  the 
steel  continuous  around  the  whole  concrete  slab.  Next  we  opened  a 
hole  in  the  entrance  side  steel  sheet  so  that  the  penetration  of  the 
entrance  side  steel  sheet  did  not  influence  the  problem 
significantly.  We  have  several  significant  observations. 

1.  The  mesh  size  affects  the  results.  We  tried  0.1  cm  cell  size, 
but  that  gives  us  1  E6  number  of  cells  and  our  machine  will  not  run 
the  problem  incore.  We  later  tried  mixed  cells,  but  at  the 
interfaces  where  the  cell  sizes  change  (0.2  to  0.3  to  J.4  cm)  we  see 
discontinuities  in  pressure  and  density. 

2.  The  station  data  show  a  monotonic  decrease  in  pressure  after  the 
initial  pressure  pulse  (after  about  30  microsec).  Even 
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though  the  penetrator  bulges  the  steel  and  produces  a  fracture  (at 
about  5000  microsec)  the  pressure  in  front  of  the  penetrator  does 
not  rise.  We  are  not  sure  why  this  is  so. 

3.  The  steel  sheet  on  the  entrance  side  shows  a  very  small  amount 
of  ••bulging" .  We  believe  that  the  displaced  volume  of  the  concrete 
caused  by  the  insertion  of  an  8  cm  dia  x  50  cm  long  penetrator, 
should  cause  the  pressures  to  rise,  just  as  when  the  penetrator 
approaches  the  exit  side  steel  sheet  we  believe  that  the  pressure 
should  rise.  Our  output  shows  the  pressures  are  inordinately  low. 

4.  Our  current  material  library  on  concrete  does  not  contain  a 
fracture  model  and  the  constitutive  equation  does  not  give  realistic 
results . 

5.  The  steel  sheet  on  the  exit  side  show  bulging  and  fracture  but 
the  region  of  distortion  is  limited  to  about  3  diameters  of  the 
penetrator.  In  the  field  observations  of  failed  slabs,  the 
distortion  of  the  reinforcing  steel  extended  at  least  ten  diameters 
of  the  penetrator. 

6.  In  our  computer  analyses  the  concrete  flows  past  the  penetrator 
and  does  not  scrape  or  join  the  lateral  surfaces.  This  is 
consistent  with  the  observation  of  recovered  penetrators. 

We  include  some  of  the  density  contours  and  station  data  for  several 
of  our  runs . 


SUGGESTIONS  FOR  FUTURE  WORK 

The  results  of  this  work  suggest  that  more  work  needs  to  be 
performed  to  find  an  analysis  that  conforms  to  the  observations  of 
field  tests.  We  suggest  the  following: 

All  of  our  runs  have  been  with  an  eulerian  mesh.  We  have  not 
evaluated  the  differences  that  would  result  from  using  a  lagrangian 
mesh.  We  know  that  cell  size  has  a  significant  influence  on  the 
results.  We  suggest  a  0.1  cm  cell  size  and  an  out  of  core 
calculation.  This  will  greatly  increase  the  time  for  the 
calculation.  Furthermore,  some  defaults  in  the  Hull  main  program 
have  to  be  changed  to  get  the  program  to  continue  after  long  run 
times . 

We  have  obtained  another  material  library  for  concrete.  We  suggest 
that  the  results  of  two  identical  runs  be  compared  with  these  two 
sets  of  materials  data. 

We  also  suggest  that  the  Hull  program  be  used  to  evaluate  low 
velocity  impact  of  steel  -  concrete  -  steel  cylinders  as  we  are 
currently  using  in  our  experimental  work.  It  is  not  known  whether 
any  hydrocode  will  propertly  treat  elastic  impact. 

We  suggest  that  a  dynamic  constitutive  equation  be  developed.  This 
function  must  be  history  dependent.  Increases  in  density  are  partly 
recoverable  and  partly  permanent.  Evidently  the  work  at  the 
University  of  Dayton  is  addressing  some  of  these  questions. 
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ABSTRACT 

Several  solution  methods  have  been  investigated  which  yield  an 
exact  solution  (to  machine  accuracy)  of  the  backwards  Euler 
implicit  solution  method  of  the  Navier-Stokes  equations.  These 
methods  include  the  generalized  minimum  residual  (GMRES)  [11,  19]  , 
the  conjugate  gradient  (CG)  [6]  ,  and  the  conjugate  gradient  squared 
(CGS)  [14] .  These  methods  have  been  coded,  and  applied  to  several 
versions  of;  first,  a  prototypical  model,  second,  many  versions  of 
actual  flow  code.  A  version  of  the  CGS  method  was  found  to  be  the 
most  robust  and  gives  the  best  convergence  properties  on  the 
computational  fluid  dynamics  (CFD)  class  of  problems.  The  method 
was  applied  to  Whitfields'  numerically  computed  Jacobian,  and 
compared  well  with  conventional  methods.  We  have  applied  the 
approximate  derivative  concept  to  the  CGS  solution  method,  and 
after  the  solution  is  well  formed  (after  the  time  marched  solution 
is  very  close  to  a  steady  state),  the  solution  works  well.  There 
are  problem  in  obtaining  quadratic  convergence,  the  expected 
convergence  for  Newton's  method.  During  the  early  unformed 
solution,  the  Newton  iteration  does  not  converge  with  a  full  Newton 
step.  This  means  relaxation  parameters  which  limit  the  step  size 
must  be  determined  to  optimize  the  convergence  in  this  early  phase. 


If  proper  preconditioning  were  obtained,  the  convergence  in  the 
early  steps  should  improve. 


I.  INTRODUCTION: 


The  numerical  solution  of  the  Navier- Stokes 
equations  over  complex  multiple  bodies  [1,3,9]  ,  is  usually  based  on 
a  backwards  Euler- Implicit  solution  method  that  proceeds  in  time 
(or  pseudo-  time)  to  a  steady  or  unsteady  state  solution  [15]  .  The 
final  optimum  solution  does  not  currently  exist,  and  will  always 
need  some  user  input  to  decide  the  combination  of  'best'  algorithms 
for  a  particular  configuration  and  boundary  conditions.  Often,  a 
steady  state  solution  is  sought  of  the  motion  around  the  particular 
configuration.  In  this  process,  it  is  not  so  important  to  obtain  a 
time  accurate  solution.  The  numerical  solution  method  involves 
several  complex  techniques  which  will  now  be  described.  1.  A 
transformation  from  physical  space  to  computational  space  (rectan¬ 
gular)  is  introduced.  This  allows  proper  application  of  the  correct 
boundary  conditions  to  the  body. 

2.  Solving  the  resulting  system  of  equations.  The  class  of  methods 
that  are  currently  most  effective  are  backward  Euler- Implicit 
schemes,  utilizing  some  version  of  an  LU  splitting  of  the  operator. 
The  resulting  nonlinear  system  is  solved  by  Newton's  method 
[4,5,7] .  This  research  has  added  the  (CGS)  solution  method  to  this 
class  of  solution  methods,  and  can  iteratively  attain  any  amount  of 
precision  at  each  step  in  the  solution  process.  This  allows  a  much 
larger  CFL  number  than  that  permitted  by  the  approximate  LU 
factorization  (the  divergence  in  the  standard  LU  factorization 
occurs  at  about  CFL=10,  according  to  Barth  [2]). 


II.  OBJECTIVES  OF  THIS  RESEARCH  EFFORT: 

The  problem  of  speeding  the  convergence  of  the  flow  solver  is  a 
complex  problem  due  to  the  strong  nonlinearities  and  the  large 
dimensionality  of  this  class  of  problems. 

The  main  thrust  of  this  research  was  to; 

1.  include  the  CGS  method  in  a  variety  of  CFD  code  developed  by 
Dave  Whitfield  of  Mississippi  State  and  amoung  others,  his  Air 
Force  colleagues  Dave  Belk,  Bruce  Simpson,  and  Kirk  Vanden 
[5,12,16]. 

2.  apply  the  CGS  solution  method  to  a  variety  of  realistic  flow 
and  boundary  conditions,  both  for  steady  and  unsteady  state 
solutions . 

3.  obtain  quadratic  convergence  using  the  conjugate  gradient  CGS 
algorithm.  This  has  produced  limited  success,  without  any 
preconditioning,  converging  only  after  the  solution  is  near  to 
a  steady  state  configuration.  Much  additional  work  is  needed 
here  to  determine  the  limits  of  the  direct  application  of 
Newtons'  method  to  obtaining  CFD  steady- state  solutions. 
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III.  ORGANIZATION: 

When  using  the  conventional  LU  splitting  and  the  resulting  error, 
the  final  solution  is  not  as  accurate  as  it  could  be  if  an  'exact' 
iterative  solver  is  used  As  was  pointed  out  earlier,  the  numerical 
experimental  limit  on  the  CFL  number  using  approximate  LU  factoriz¬ 
ation  is  about  10.  The  use  of  solvers  such  as  the  conjugate- 
gradient  squared  (CGS)  method  which  require  for  solution  only 
multiplication  of  the  resulting  Jacobian  by  an  arbitrary  vector  is 
a  promising  alternative  to  LU  solution  methods  and  allows  a  large 
CFL  number  in  most  problems.  Less  complex  versions  of  this 
technique,  the  CG  method,  has  had  excellent  success  in  complex  oil 
reservoir  simulations,  and  in  many  other  large  scale  simulations. 

Any  improvement  in  the  computational  efficiency  of  a  typical  flow 
code  should  have  immediate  impact  on  the  entire  class  of  CFD 
numerical  methods.  The  inclusion  of  the  CGS  'exact'  solution  method 
into  a  large  variety  of  CFD  code  as  provided  by  this  research  is  a 
major  step  in  this  direction. 

By  solving  the  problems  with  more  accuracy  at  each  step,  the 
resulting  trek  to  steady  state  can  be  done  with  a  higher  degree  of 
accuracy.  The  same  technique  yields  more  accurate  solutions  to 
unsteady  problems  (time  accurate) .  The  important  advantage  to  using 
conjugate  gradient  class  solvers  is  that  the  computations  can  be 
easily  vectorized  on  CRAY  style  computers.  As  the  preliminary  part 
of  this  research  was  to  establish  that  the  CGS  conjugate  gradient 
methods  were  viable  and  produced  useful  results,  the  codes  were  not 
vectorized. 

The  application  of  these  solution  methods  to  CFD  problems  is 
complicated  by  the  fact  that  the  highly  nonlinear  interactions 
cause  many  problems  [l] .  The  main  class  of  methods  we  are  looking 
aC.  iryv°lve  using  the  conjugate  gradient  method  to  accelerate 
existing  code.  The  idea  is  that  oscillations  caused  by  complex 
eigenvalues  can  greatly  slow  the  convergence  history.  As  the 
problem  complex  eigenvalue  propagates  through  the  system  of 
nonlinear  equations,  by  minimizing  the  new  residual  using  a  fixed 
set  of  residuals  from  past  iterations,  The  determination  of  the 
parameter  k  is  critical  as  it  determines  the  number  of  eigenvalues 
that  will  be  'filtered'  out  of  the  iteration.  The  generalized 
minimum  residual  concept  is  due  to  Saad  and  Schultz  [11]  .  A 
nonlinear  version  of  the  generalized  minimum  residual,  is  being 
studied  for  inclusion  in  existing  code  [7] .  The  conjugate  gradient 
method  will  theoretically  produce  a  solution  to  the  equation  in  a 
finite  number  of  steps,  while  preserving  the  sparsity  structure  of 
the  matrix.  Thus,  the  conjugate  gradient  method  is  in  a  sense,  an 
exact  solution  method. 
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IV  RESULTS: 


This  work  established  that  conjugate  gradient  methods,  such  as 
the  CGS  method  are  effective  and  can  give  'exact'  solutions  to  the 
Newton's  iteration  needed  to  solve  the  backwards  Euler  method. 

Particular  code  that  has  been  modified  to  run  the  CGS  solver  are; 

the  Redcoon  and  Juli  codes  which  use  common  blocks  to  pass  data. 

several  versions  of  the  Ruby  code  that  passes  data  in  the  subrou¬ 
tine  arguments. 

A  time  accurate  version  of  Ruby  which  produces  time  accurate 
solutions  at  each  iteration. 

Several  of  these  codes  have  been  ported  to  386  pc's  with  extended 
memory.  The  code  were  run  on  simple  configurations  and  much 
computer  time  on  the  CRAY  was  saved.  The  preliminary  testing  was 
done  on  the  smaller,  slower  pc's.  The  final  versions  were  ran  on 
the  CRAY . 

The  use  of  approximate  derivatives  as  generated  by  Whitfield  were 
solved  with  no  problem.  When  the  approximate  derivatives  were  used 
in  the  CGS  to  calculate  the  Jacobian  approximately,  we  had  trouble 
converging.  Since  we  were  using  the  origional  preconditioner, 
instead  of  a  numerically  generated  preconditioner  as  was  done  in 
Whitfields'  version,  it  is  felt  that  the  difficulties  can  be 
overcome  by  designing  a  numerical  preconditioner.  A  similar 
approach  was  considered  in  Obayashi  [10] . 

All  of  these  modified  code  are  stored  on  the  mass  storage  device  on 
the  Eglin  CRAY  and  are  available  to  the  Air  Force  personel. 
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IV  RECOMMENDATIONS: 


While  the  application  of  the  conjugate  gradient  class  of  solution 
CGS  method  to  a  real  flow  solver  has  produced  results  on  every 
problem  it  was  applied  to,  the  full  vectorization  of  the  CGS  method 
is  still  to  be  done.  This  will  increase  the  speed  of  the  calcula¬ 
tions  on  the  CRAY  machine. 

The  study  of  the  best  preconditioners  to  use  is  only  begining,  and 
the  more  work  done  here,  the  less  iterations  will  be  needed.  The 
use  of  preconditioners  that  do  not  need  the  Jacobian  are  being 
investigated  and  look  promising. 

The  selection  of  proper  preconditioners  and  more  work  on  the 
calculation  of  the  numerical  derivatives  in  the  CGS  method  will 
lead  to  quadratic  convergence.  This  will  require  a  special 
application  of  the  solver,  one  which  will  require  a  closer 
connection  to  the  standard  Newton  method  which  does  have  quadratic 
convergence. 

The  use  of  conjugate  gradient  solution  methods  is  a  promising  step 
to  furthering  the  advancement  of  computational  fluid  dynamics  (CFD) 
simulations . 

Domain  decomposition  methods  are  easily  exploited  by  the  CGS 
method.  These  concepts,  (see  [8])  should  prove  useful  in  CFD  with 
complicated  multi-body  geometries. 

The  ideas  of  vector  sequence  convergence  [13]  are  extremely 
important,  and  require  further  study. 
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Executive  Summary 


The  Executive  Summary  provides  a  brief  synopsis  of  the  content  of  the  Final  Report 
(Contract  No.  S-210-1 1MG-083). 

This  research  was  motivated  by  the  need  to  appropriately  interpret  the  Jones  and 
Mueller  matrices  and  develop  algorithms  to  extract  polarization  properties  from  these 
matrices. 

It  is  well  known  that  a  polarization  element  can  be  charactered  by  either  a  Jones 
matrix  or  a  Mueller  matrix.  In  fact,  in  many  laboratories,  the  Mueller  matrices  are  now 
being  routinely  measured.  However  the  polarization  properties  described  by  a  Jones  or 
Mueller  matrix  are  not  apparent.  The  interpretation  of  the  Jones  and  Mueller  matrices 
then  becomes  crucial,  but  not  yet  clear.  Nevertheless,  two  useful  quantities-  diattenua- 
tion  and  retardance,  were  introduced  for  this  purpose  by  several  authors.  They  are  used 
to  quantify,  respectively,  the  amplitude  and  phase  properties  of  the  homogeneous 
polarization  elements.  However,  the  definition  and  meaning  of  these  quantities  are  not 
evident  for  inhomogeneous  polarization  elements.  The  inhomogeneous  polarization  ele¬ 
ments  denote  the  polarization  elements  having  nonorthogonal  eigenpolarizations.  The 
importance  of  inhomogeneous  polarization  elements  can  not  be  overemphasized.  First, 
combinations  of  polarization  elements  are,  in  general,  inhomogeneous.  Also,  skew  rays 
in  optical  systems  tend  to  be  slightly  inhomogeneous.  Understanding  inhomogeneous 
polarization  elements  is  important  in  understanding  depolarizing  elements,  since  it  is 
necessary  to  separate  the  inhomogeneous  part  of  the  Mueller  matrix  from  depolarizing 
part  m  data  reduction  for  depolarizing  elements. 

To  interpret  Jones  and  nondepolarizing  Mueller  matrices,  particularly  those 
describing  inhomogeneous  polarization  elements,  is  the  aim  of  this  research.  The  out¬ 
line  of  this  research  consists  of  analysis  of  the  exponential  representation  of  Jones 
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matrices,  analysis  of  inhomogeneous  polarization  elements,  and  the  data  reduction  of 
Jones  and  Mueller  matrices. 

The  following  is  an  outline  of  the  work  contained  in  this  final  report.  When  the 
Jones  matrix  is  expressed  in  exponential  from  J  =  exp  (d0a0  +  d,a,  +  d2o2  +  d  3  a  3 ) , 
then  the  real  part  and  imaginary  part  of  the  d-coefficient  have  simple  physical  meaning. 
The  real  parts  of  d -coefficients  correspond  to  diattenuation  and  imaginary  parts  corre¬ 
spond  to  retardance.  By  applying  Baker-Campell-Hausdorff  identity,  it  is  shown  that  this 
representation  has  several  interesting  order  independent  properties  and  is  very  useful  in 
addressing  deeper  questions. 

From  the  polar  decomposition  theorem  of  matrix  theory,  any  polarization  element, 
nomogeneous  or  inhomogeneous,  is  equivalent  to  a  cascade  of  diattenuator  and 
retarder,  and  then  the  diattenuation  and  retardance  of  the  polarization  elements  can  be 
unambiguously  defined.  We  introduce  a  new  unitary  invariant,  called  inhomogeneity, 
which  measures  the  nonorthogonality  of  the  eigenpolarizations.  Explicit  expressions  for 
diattenuation  and  retardance  have  been  derived  in  terms  of  the  unitary  invariants-  the 
eigenvalues  and  the  inhomogeneity.  The  resulting  expressions  are  suitable  for  perform¬ 
ing  data  reduction  on  Jones  and  nondepolarizing  Mueller  matrices.  Finally  it  is  shown 
that  two  Jones  matrices  which  share  the  same  eigenvalues  and  inhomogeneity  are 
related  by  a  unitary  transformation,  or  in  physical  terms,  one  polarization  element  can  be 
transformed  into  the  other  through  the  addition  of  a  pair  of  orthogonal  retarders,  one  on 
either  side  of  the  element.  We  term  this  the  unitary  equivalence  theorem  of  Jones 
matrices  Based  upon  this  theorem,  the  classification  of  Jones  matrices  by  unitary  invari¬ 
ants  is  established.  The  generalization  of  this  results  to  include  depolarizing  elements  is 
now  in  progressing,  and  some  preliminary  results  have  been  found. 

The  achievements  of  this  research  are  the  following:  First,  several  interesting  prop- 
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erties  of  exponential  representation  of  Jones  matrices  are  found.  Second,  the  definition 
of  diattenuation  and  retardance  are  unambiguously  generalized  to  include 
inhomogeneous  polarization  elements.  Third,  a  measure  of  inhomogeneity  is  intro¬ 
duced.  Finally  a  new  classification  scheme  for  Jones  matrices  is  established. 

The  remainder  of  this  final  report  consists  three  parts.  First  is  a  draft  copy  of  our 
paper  in  preparation  for  the  Journal  of  the  Optical  Society,  "Generalized  diattenuation 
and  retardance  for  inhomogeneous  polarization  elements".  Section  two  and  three  con¬ 
tain  studies  on  the  Jones  and  Mueller  calculus  which  include  the  results  on  the  exponen¬ 
tial  representation  of  the  Jones  matrix.  As  these  studies  are  completed,  further 
manuscripts  will  be  prepared  for  refereed  journals.  The  last  section  contains  the 
materials  from  a  presentation  regarding  this  research  given  at  Eglin  Air  Force  Base  in 
December,  1991. 


41-4 


Table  of  Contents 


1 .  Generalized  diattenuation  and  retardance  for 
inhomogeneous  polarization  elements 

2.  Studies  on  the  Jones  calculus 

3.  Studies  on  the  Mueller  calculus 

4.  Presentation  materials 


41-5 


Generalized  diattenuation  and  retardance  for 
inhomogeneous  polarization  elements 


Shih-Yau  Lu  and  Russell  A.  Chipman 
Department  of  Physics 
University  of  Alabama  in  Huntsville 
Huntsville,  Al  35899 


41-6 


ABSTRACT 


Two  useful  quantities-  diattenuation  and  retardance,  that  characterize  the  ampli¬ 
tude  and  phase  properties  of  a  polarization  element,  are  generalized  to  inhomogeneous 
polarization  elements,  i.e.,  the  polarization  elements  with  nonorthogonal 
eigenpolarizations.  Their  explicit  expressions  are  obtained,  and  they  are  suitable  for  per¬ 
forming  data  reduction  on  the  Jones  and  nondepolarizing  Mueller  matrices.  Besides, 
classification  of  polarization  elements  by  their  unitary  invariants  is  discussed. 
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1.  INTRODUCTION 


In  this  paper  the  polarization  element  (PE)  denotes  an  optical  element  or  a  cascade 
of  optical  elements  that  can  modify  the  polarization  state  of  incident  light.  In  this  paper 
we  treat  only  nondepolarizing  elements.  PEs  can  be  classified  into  two  types,  namely 
homogeneous  and  inhomogeneous1 -2,  based  upon  the  orthogonality  of  their  eigenpola- 
rizations.  These  terminologies  were  introduced  by  Shurcliff  who  used  them  to  classify 
polarizers2.  A  homogeneous  PE  has  two  orthogonal  eigenpolarizations.  Most  of 
polarizers  and  retarder  are  homogeneous.  Homogeneous  and  isotopic  interfaces,  such 
as  lens  and  mirrors,  are  homogeneous  with  linear  eigenpolarizations.  An  inhomoge¬ 
neous  PE  has  nonorthogonal  eigenpolarizations.  In  many  cases,  an  inhomogeneous  PE 
results  from  a  cascade  of  several  different  homogeneous  PEs.  Also,  skew  ray  paths 
through  optical  systems  intend  to  be  slightly  inhomogeneous.  An  inhomogeneous  PE, 
which  has  only  one  eigenpolarization,  is  called  degenerate3.  It  is  common  to  character¬ 
ize  a  PE  by  either  a  Jones  matrix  or  a  Mueller  matrix.  The  Jones  and  Mueller  matrices 
explicitly  show  how  incident  Jones  and  Stokes  vectors  map  into  transmitted  Jones  and 
Stokes  vectors.  However  the  diattenuation  and  retardance  described  by  a  Jones  or  a 
Mueller  matrix  are  not  readily  apparent.  The  interpretation  of  Jones  and  Mueller  matrices 
then  becomes  crucial,  but  it  is  not  yet  clear.  Thus  it  is  highly  desirable  to  define  the  diat¬ 
tenuation  and  retardance  properties  of  a  PE  and  express  them  explicitly  in  terms  of  the 
matrix  elements  of  the  Jones  or  Mueller  matrix. 

In  general,  a  PE  does  not  only  modify  the  polarization  state  of  the  incident  light,  but, 
consequently,  it  also  changes  the  intensity  and  phase  of  the  incident  light.  Two  useful 
quantities,  diattenuation  and  retardance,  that  reflect  the  intensity  and  phase  properties  of 
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a  homogeneous  PE,  are  summarized  in  Ref.  1.  According  to  Ref.  1,  the  diattenuation  £> 
and  the  retardance  of  a  homogeneous  PE  are  respectively  defined  as 


•r>  = 


IMZ-;M  1 

t  F  I2  ’ 

i  I  *  !  M 


1  >  £>  >  0 


(1) 


’£  =  !6„-6r|,  rx>£>0,  (2) 

where  iS„jexp(  ibq)  and  =  |^r|exp(«6r)  denote  the  eigenvalues  of  the 
Jones  matrix  of  this  PE.  Throughout  this  paper  we  use  D  and  to  denote  diattenua¬ 
tion  and  retardance.  It  should  be  noted  that  |  ^ ,  | 2  and  |  ^  r  | 2  are  the  maximum  and 
minimum  intensity  transmittance  of  the  homogeneous  PE.  Eqs.  (1)  and  (2)  imply  that 
diattenuation  is  a  measure  of  the  dependence  of  the  PE’s  intensity  transmittance  upon 
the  incident  polarization  state,  and  retardance  is  a  measure  of  the  dependence  of  the 
PE’s  optical  path  length  upon  the  incident  polarization  state.  Besides,  the  diattenuation 
defined  in  Eq.  (1)  is  equal  to  the  degree  of  polarization  (DOP)  produced  by  the  homoge¬ 
neous  PE  when  incident  light  is  unpolarized1.  Thus,  diattenuation  can  be  regarded  as  a 
generalization  of  the  polarizance  which  was  also  introduced  by  Shurcliff2  as  a 
performance  factor  for  a  polarizer.  The  homogeneous  PE  of  zero  diattenuation  is  called 
a  (homogeneous)  retarder,  and  the  homogeneous  PE  of  zero  retardance  is  called  a  (ho¬ 
mogeneous)  diattenuator.  However,  Eqs.  (1)  and  (2)  can  not  be  applied  to  inhomoge¬ 
neous  PEs.  It  is  the  main  purpose  of  this  paper  to  generalize  the  definition  of 
diattenuation  and  retardance  to  include  inhomogeneous  PEs,  and  develop  algorithms  to 
calculate  them  from  these  matrices. 

The  outline  of  the  approach  in  this  paper  are  the  following.  By  applying  the  polar 
decomposition  theorem  of  matrix  theory,  any  PE,  homogeneous  or  inhomogeneous,  is 
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equivalent  to  a  cascade  of  diattenuator  and  retarder,  and  then  the  diattenuation  and 
retardance  of  the  PE  can  be  unambiguously  defined.  Moreover,  we  introduce  a  new  uni¬ 
tary  invariant  called  inhomogeneity  which  is  a  measure  of  the  inhomogeneity.  Geometri¬ 
cally  it  relates  to  the  nonorthogonality  of  eigenpolarizations.  The  explicit  expressions  for 
diattenuation  and  retardance  in  terms  of  unitary  invariants-  eigenvalues  and 
inhomogeneity,  are  obtained.  Finally,  the  unitary  equivalence  theorem  are  readily  formu¬ 
lated. 

in  fact,  Gil  and  Bernabeu4  also  used  polar  decomposition  theorem  to  define  the 
polarizing  and  retardation  parameters  for  nondepolarizing  Mueller  matrices.  Though  the 
approach  in  this  paper  is  similar  to  theirs,  results  of  this  paper  are  more  useful.  First  only 
two  parameters,  diattenuation  and  retardance,  instead  of  seven  are  used  to  characterize 
the  polarization  properties  of  a  PE.  We  also  obtain  the  expressions  for  diattenuation  and 
retardance  in  terms  of  the  unitary  invariants  of  the  Jones  matrix.  In  addition,  a  parameter 
that  is  an  indicator  of  inhomogeneity  is  introduced.  Compared  with  other  classification 
schemes3-4-5,  the  classification  discused  in  this  paper  has  the  advantage  that  it  is  based 
upon  the  unitary  invariants  of  Jones  matrices. 

2  DIATTENUATON  AND  RETARDANCE  FOR  AN  INHOMOGENEOUS  PE  -  An  Applica¬ 
tion  of  the  Polar  Decomposition  Theorem 

From  the  polar  decomposition  theorem,  any  PE  is  equivalent  to  a  cascade  of  diat¬ 
tenuator  and  retarder.  Furthermore,  the  diattenuation  and  retardance  of  a  PE  can  be, 
respectively,  defined  to  be  corresponding  quantities  of  the  diattenuator  and  retarder  in 
polar  decomposition. 
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Consider  an  arbitrary  PE  whose  Jones  matrix  is  denoted  by  j  .  According  to  the 
polar  decomposition  theorem4-®-7,  the  Jones  matrix  can  be  written  as 


J  j  R  j  0  i 

(3) 

or 

J  =  J  D  J  R 

(4) 

where  J  n  and  J  n '  are  nonnegative  define  Hermitian  matrices  and  J  *  is  unitary. 

Moreover,  J  D  and  J  D  are  respectively  given  by7 

J0  =  VJ'J, 

(5) 

and 

J  o  =  V  J  J 

(6) 

where  the  dagger  denotes  the  Hermitian  conjugate.  In  general,  J 

D  and  JD'  are  not 

identical,  but  their  eigenvalues  are  identical.  If  the  determinant  of 

J  is  not  zero,  then 

is  uniquely  given  by 

J,  =  JJ^1  =  JD  1 J 

(7) 

or,  equivalently, 

,  _  J  *  (  Delll)(  J  "  )  ‘ 

J  R  Trll  ’ 

(8) 

where  Det  and  Tr  denote  the  determinant  and  trace,  respectively.  J  R  is  not  unique 
when  the  determinant  of  J  is  zero,  but  it  will  be  pointed  later  that  ambiguity  exits.  Since 
J  „  and  J  0  correspond  to  diattenuators,  and  J  *  corresponds  to  a  retarder,  any  PE 
can  be  interpreted  as  a  cascade  of  retarder  and  diattenuator. 

Since  a  retarder  does  not  change  the  DOP  or  intensity  of  incident  light,  diattenua¬ 
tion  of  the  PE  is  equal  to  the  diattenuation  of  J  D  (or  J  D  ).  The  intensity  transmittance  of 
the  PE  can  be  written  as 

/  =  (  j  r )  ■  (  j  r )  =  e-j-je,  (9) 
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The  Jones  vector  with  karat  is  normalized.  It  can  be  realized  that  the  eigenvalues  of  j  D 
(or  J  ,j  )  are  the  square  root  of  the  PE’s  maximum  and  minimum  intensity  transmittance, 

/  and  T  .  Therefore,  the  appropriate  definition  of  the  diattenuation  for  the  PE  is 

•n  =  r"l,,~  —  [  >D>0  (10) 

'r  -  r  n.,.* 

This  definition  is  a  generalization  of  Eq.  (1),  and  Eq.  (10)  reduces  to  Eq.  (1)  for  homoge¬ 
neous  PEs.  Let  E  mav  and  Emin  be  the  incident  polarization  states  which  yield  the  maxi¬ 
mum  and  minimum  intensity  transmittance.  Since  they  are  the  eigenpolarizations  of  J  D . 
it  follows  that 

E*  E  =  0  01) 

Lmax  ‘-mm  v  » 


and 


(JE,ni)X)-(JE,n,n)  =  0.  02) 

Thus  diattenuation  defined  in  Eq.  (10)  is  always  equal  to  the  DOP  of  the  output  light 

when  the  incident  light  of  the  PE  is  unpolarized. 

Similarly,  the  retardance  of  the  PE  is  defined  to  be  the  retardance  of  J  K ,  and  it  can 


be  written  as 

•£  =  2 cos ' 1  n  >  ^  >  0 .  03) 

in  which  J  R  is  given  in  Eq.  (7)  or  (8).  While  the  determinant  of  J  is  zero,  the  diattenua¬ 
tion  is  unit  and  then  we  define  the  retardance  to  zero.  It  can  be  shown^  that  a  Jones 
matrix  with  zero  determinant  describes  a  PE  whose  output  light  is  always  at  a  fixed  fully 


polarized  state  regardless  of  the  incident  polarization  state.  Such  PEs  are  called  polariz¬ 


ers.  Thus  polarizers  are  PEs  with  unit  diattenuation. 

In  above  paragraphs  we  have  given  unambiguous  definition  for  the  diattenuation 
and  retardance  in  general  case.  In  the  following  we  are  going  to  derive  their  explicit 
expressions. 
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A.  Homogeneous  and  inhomogeneous  polarization  elements 


Again,  let  J  be  the  Jones  matrix  of  the  PE  under  consideration.  Its  eigenpolariza- 
tions  £,  and  £r  satisfy  the  characteristic  equations 

J  £q  =  ,J  £r  =  ^ r £r  (14) 

where  ?>f,  =  ^q\ exp(r&7)  and  =  |^r|exp(i6r)  are  the  eigenvalues  of  the  Jones 
matrix.  For  an  inhomogeneous  PE,  understandably,  the  inner  product  of  eigenpolariza- 
tions  will  be  an  interesting  quantity.  We  define  the  parameter  n  as 

n  =  j£;£r|,  1  >  n  >  o.  (15) 

For  a  homogeneous  PE,  n  is  zero.  Thus  n  can  be  regarded  as  a  indicator  of  inhomo¬ 
geneity,  and,  for  convenience,  we  call  it  inhomogeneity.  Its  geometrical  meaning 
becomes  transparent  on  the  Poincare  sphere.  Let  s  q  and  s  r  be  the  eigenpolarizations 
represented  the  on  the  Poincare  sphere,  and  xqr  be  the  half  of  angle  subtended 
between  them  (Fig.  1 ).  It  can  be  shown8  that 

H  =  COS  Xqr  ■  (16) 

Thus  cos ' '  n  can  be  regarded  as  the  "angle"  between  eigenpolarizations.  xqr  is  \  for 
a  homogeneous  PE  and  xqr  is  less  than  ^  for  an  inhomogeneous  PE.  Note  that,  from 
Eq.  (16),  ir  is  equal  to  the  "similarity  factor"  of  the  eigenpolarizations  introduced  in  Ref. 

9. 

Since  rmax  and  T  min  are  the  extreme  values  of  intensity  transmittance,  by  differ¬ 
entiating  Eq.  (9)  it  can  be  found  that  they  satisfy  the  following  quadratic  equation 

(  I  -  'I  )/  •’- 1  ;f,„  'f.,:2-  ~S,5,*)]T-(1  -  n2)l?J2l?.l2- 0,(18) 

i.e. 
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I 


7*  T  =  t  $  ■  2  1 1  • 

1  max'  nun  j  if  S  r 


T  +7 
‘  max  '  mm 


'V  -:«ri  -n  '?Q?r*) 

■ 


From  Eqs.  (10),  (19)  and  (20),  the  expression  for  the  diattenuation  are: 


V  =  <  1  - 


4(i-n2)  i?,|2|?fr 


1  /  2 


l|M  -IM-n2(V«r-«,«r*>J 


(19) 

(20) 

(21) 


The  combination  of  Eqs.  (8),  (13),  (19)  and  (20)  yields  the  expression  for  the  retardance: 


J£  =  2  cos 


_ <i-n2)(H,H*,|)2 _ 

<l«9HM)2-n2(2|t*l|tr|*  V«r*«,?,*) 


1  /2 

cos 


While  inhomogeneity  is  zero,  Eqs.  (21)  and  (22),  as  expected,  reduce  to  Eqs.  (1)  and  (2), 
respectively.  From  Eq.  (21),  diattenuation  can  not  be  zero  except  for  retarder.  From  Eq. 
(22),  retardance  is  zero  only  for  diattenuator  and  polarizer.  In  other  words,  inhomoge¬ 
neous  PEs  have  both  nonzero  diattenuation  and  nonzero  retardance,  except  for  inhomo¬ 
geneous  polarizers  whose  retardance  is  defined  to  be  zero. 

B.  Degenerate  polarizing  elements 

In  case  that  the  Jones  matrix  J  describes  a  degenerate  PE,  we  need  to  redefine  r\ 
as 

=  n,  n  *  o  (23) 

where  l  is  the  identity  matrix,  £,q(=  ^r)  is  the  only  eigenvalue,  and  is  an  arbitrary 
Jones  vector  normal  to  the  eigenpolarization  Eq.  It  is  shown  in  appendix  A  that  J  can 
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be  transformed  into 


0 


(24) 


by  an  unitary  change  of  basis.  From  the  representation  in  Eq.  (24),  n  can  be  regarded 
as  the  "coupling"  between  two  orthogonal  polarization  states  Eq  and  F,. 


With  n  given  above,  the  7max  and  7min  are  given  by 


7  max  =  ^i2  +  T+  ^  V 

i?J2  +  T 

(25) 

7 min  =  2  +  7  -  T\y[\ 

L  2  +  - 

Sp  1  4  • 

(26) 

And  then  we  have 

n2, 

(27) 

^  =  2  COS  '  1  .  . 

/  ,  „2 

Vl^1  *T 

(28) 

If  1 1  is  zero,  the  Jones  matrix  is  proportional  to  the  identity  matrix  and,  as  expected,  Eqs. 
(27)  and  (28)  give  zero  diattenuation  and  zero  retardance. 

C.  Classification  of  polarization  elements 

It  is  important  to  realize  that  both  eigenvalues  and  n  of  a  PE  are  unitary  invariants, 
so  are  diattenuation  and  retardance.  The  following  theorem  indicates  that  the  eigenva¬ 
lues  and  1 1  are  also  the  only  three  unitary  invariants  a  Jones  matrix  can  have. 

The  Unitary  Equivalence  Theorem:  Two  PEs  have  the  same  eigenvalues  and  n  if  and 
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only  if  they  are  unitarily  equivalent,  i.e.,  there  exists  an  unitary  matrix  u  such  that 
J  2  =  U  *  J  ,  U  (29) 

where  J  ,  and  J  2  are  their  Jones  matrices. 

A  proof  is  given  in  Appendix  B.  The  transform  in  Eq.  (29)  is  the  same  as  a  base  chang¬ 
ing  transform,  and  then  we  can  say  that  two  unitarily  equivalent  Jones  matrices,  in  fact, 
represent  a  same  PE  but  in  different  basis.  Thus  PEs  can  be  classified  into  sets  of 
unitarily  equivalent  PEs  with  same  parameters  (£, .  :  n)  ■  The  order  between  %q  and 

r  are  irrelevant.  The  set  of  unitarily  equivalent  PEs  with  (%q ,  ;  rj )  will  be  denoted  by 

PE(  ,, .  £  r ;  r\ ) .  it  should  be  noted  that  PEs  of  the  set  PE(  £  q ,  ^  r ;  r\ )  also  have  the  same 
diattenuation  and  retardance.  The  classification  scheme  discussed  here  is  simple  and 
elegant. 

Let  Z\J  2  =  iyiit2+  \ju\2  +  l72ii2  +  I y 23 i 2  Where  jm„(m,n  =  1 ,2)  are 

matrix  elements  of  the  Jones  matrix  of  the  PE.  It  is  easy  to  see  that 
(Zij'|2,7VJ,OetJ)are  another  important  set  of  unitary  invariants  which  will  be  useful 
in  the  following  discussion. 

3.  DATA  REDUCTION 

When  the  Jones  matrix  J  of  a  PE  is  known,  the  eigenvalues  and  ti  are  given  by 

$ , •  $ r  -  ^  *  5  v(7YJ)2-40eM  ,  (30) 

_  i  ( I  (  7Y  J  ) 2  M  (  7>  J  )2  -  4  0e(  J I  ) 

M  =  - r-r - T— - - - ,  (31) 

y  ■  J  :  -  -  (  I  (  Tr  J  )  2  !  -  !  (  Tr  J  ) 2  -  4  Dal  J  |  ) 
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or,  for  a  degenerate  PE, 


n  -  1  y 121  +  {  i  2\\  • 


The  diattenuation  and  retardance  can  be  obtained  by  Eqs.  (21)  and  (22)  or,  for  a  degen¬ 
erate  PE,  by  Eqs.  (27)  and  (28).  The  following  equations  provide  another  way  to  calcu¬ 
late  the  diattenuation  and  retardance: 


.-m  _  i  41  Oaf  Jr 

L*  -  /  1  i 

\  (T.\J\Z)‘ 


(33)3 


X  =  2  COS 


2  V  1 1  J  I  2  *  2 1  De(  J I 


For  each  nondepolarizing  Mueller  matrix,  its  corresponding  Jones  matrix  can 
readily  be  obtained  up  to  an  unimportant  total  phase  by  the  following  equations1 1 : 


m  I  I  *  m  12*  m2l  *  m22 


"'ll*"'  l2-'"2|-m22 


,,-m  |2-">2|*">  22 


■/ni-V  -2  ~  (35) 

/  m  ,|  -  /n  1 2  *  ,n  2 1  ”  m  22 

y  1 2 1  -  \/  2  (Jpy 

'  "•  I  I  -  "*  12“  "*  21  -  22 

y 2 1 1  -  \  2  -  (Jr) 

It l2-"'2l*m22  /OQ. 

y  22 ;  ~  y  ^  (Joy 

-  tan  ~ 1  (  ^7^)  (39) 

<>2.  -<P.I  =  tan"(^7^)  (40) 

=  tan-(^)  (41) 

in  which  o  „„  (m,n  =  1,2)  denote  the  phases  of  jmn.  Thus  the  data  reduction  for  Mueller 
matrices  is  the  same  as  for  Jones  matrices  and  Eqs.  (30)-(34)  are  still  applicable.  From 


-  )  "•«  "'34 
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the  definition  and  property  of  diattenuation,  the  following  equation  also  gives  diattenua¬ 


tion: 


■D  =  - 


> *  m  n*  m  . 


1  3 1 


4.  CONCLUSION 


(42) 


The  definition  of  diattenuation  and  retardance  have  been  generalized  to  include 
inhomogeneous  PEs,  by  applying  polar  decomposition  theorem.  Their  explicit  expres¬ 
sions  in  terms  of  unitary  invariants-  eigenvalues  and  n ,  have  been  obtained.  These 
expressions  can  readily  be  used  to  perform  data  reduction  on  Jones  and 
nondepolarizing  Mueller  matrices.  The  new  unitary  invariant  n  we  introduced  has  been 
proved  to  be  useful.  Our  results  also  include  the  classification  of  PEs  by  their  unitary 
invariants.  Diattenuation,  retardance  and  inhomogeneity  are  useful  quantities  that  char¬ 
acterize  polarization  properties  of  the  PE.  To  treat  depolarizing  elements  modification 
and  further  generalization  of  the  approach  in  this  paper  are  needed. 

APPENDIX  A:  A  Proof  of  the  Property  of  Degenerate  Polarizing  Elements 
According  to  the  Cayley-Hamilton  theorem7,  we  have 

(J-?,ql)2  =  0.  (43) 

Thus,  for  arbitrary  vector  E ,  (J-$„l)Eis  always  parallel  to  Eq .  By  properly  choosing 
the  absolute  phase  of  F,7,  the  following  equation  will  hold: 

(44) 

l  ,  and  f\,  are  orthonormal  Jones  vectors,  so  the  matrix  [E,,  Fq] ,  with  columns  equal 
to  f  ,  and  l: '  q ,  is  an  unitary  matrix.  By  some  vector  and  matrix  manipulation,  we  find 
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(45) 


0 


APPENDIX  B:  A  Proof  of  the  Unitary  Equivalence  Theorem 

Since  the  Jones  matrix  of  any  degenerate  PE  can  be  transformed  into  Eq.  (24), 
obviously  this  theorem  holds  for  degenerate  PEs.  Thus  we  only  have  to  consider  PEs 
with  two  eigenpolarizations.  Since  the  sufficient  condition  is  quite  obvious,  we  give  a 
proof  of  the  necessary  condition  only. 

Necessary  condition:  Without  losing  generality  we  assume  ^ q ,  =  % q2  and  %r,  =  %r2 
where  ?,’s  are  the  eigenvalues  of  the  two  PEs.  Let  (Egl ,  Er/)  and  (E„2,  Er2)  are  the 
eigenpolarizations  of  the  two  PEs.  Since  the  two  PEs  have  the  same  n ,  by  properly 
choosing  the  relative  phases  we  can  have 

Eq/Cr/  =  Eq2£r2.  (46) 

There  exists  a  matrix  U  such  that 

Eq2=  u£q/,£r2=  UEr/  (47) 

From  Eq.  (46),  U  is  unitary.  From  Eq.  (47),  it  is  the  desired  unitary  transform  matrix  in 
Eq.  (29). 
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§1  On  Homogeneous  Retarders  and  Diattenuators  : 

An  nxn  matrix  is  called  normal,  when  it  has  n  orthogonal  eigenvectors.  A 
homogeneous  polarization  element  has  a  normal  Jones  matrix.  Specially,  a 
homogeneous  retarder  (HR)  is  proportional  to  a  unitary  matrix  and  a 
homogeneous  diattenuator  (HD)  is  proportional  to  a  nonnegative  Hermitian 
matrix.  We  find  some  theorems  on  unitary  and  Hermitian  matrices  can  readily 
be  applied  to  HR's  and  HD's.  A  few  examples  of  such  application  are  listed 
below. 


(i)  A  matrix  M  is  normal  if  and  only  if 

MMt  =  MtM. 


(ii)  Any  2x2  Hermitian  matrix  H  can  be 
be  expanded  as 
H  =  c0o0+c^  +C2O2+C3O5 

where  c's  are  real. 


(iii)  For  any  unitary  matrix  U  we  can 
find  a  Hermitian  matrix  H  such  that 
U  =  exp(iH) . 


(iv)  The  product  of  unitary  matrices  is 
unitary. 

(v)  Any  matrix  M  can  be  expressed  in 
form,  the  polar  representation, 

M  =  HU  =  UH' 

and 

H  =  (MMt)1/2  H’  =  (MtM)1/2 
U  =  H-’M  U’  =  MH' 1  (if  det(M)*0) 
where  U's  and  H's  are  unitary  and 
non  negative  Hermitian  matrices 
respectively. 


(i)  A  polarization  element  J  is 
homogeneous  if  and  only  if 
JJt  =  jtj. 

(ii)  The  most  general  form  for  the  HD 
is 

JhD=  a(coao+c1a1+c2a2+C3a3) 

where  a  can  be  complex  and  c’s  are 
real  and 

c02  >  Ca2+C22+C32  . 

(iii)  The  most  general  form  for  the  HR 
is 

JnR  =  a  exPP(coCTo+C1CT1+C2°2+C3°3)J 
where  a  can  be  complex  and  c's  are 

real. 

(iv)  The  cascade  of  HR's  is  still  a  HR. 

(v)  Any  Jones  matrix  J  can  be 
expressed  in  the  form 

J  =  Jhd  Jhr  =  J'hr  J’hd 

which  means  any  Jones  matrix  can 

be  realized  by  only  one  HR  and  HD. 
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Not  all  retarders  and  diattenuators  are  homogeneous.  The  most  general 
form  for  the  retarder  is 


JR 


elct  0 


if* 


A1 


L0  e 

and  for  the  diattenuator  is 


0  y. 


A1 


where  A  is  an  arbitrary  nonsingular  matrix,  a  and  |3  are  real,  y^nd  y2  are 
positive  real.  Note  that  these  forms  can  not  represent  the  "diattenuators"  which 


are  similar  to 


§11  On  the  Couplance  of  Inhomogeneous  Polarization  Elements  : 

In  the  following  discussion,  the  polarization  element  has  the  Jones  matrix 
J  =  C0ao+Cia1+C2a2+C3°3 

and  two  normalized  eigenpolarization  states  Eq  and  Er. 

We  learned  that,  for  homogeneous  polarization  elements,  the  maximum 
coupling  occurs  when  the  input  state  E  is 

E  =  (Eq  +  ei*Er)/V2  (2-1) 

where  5  is  an  arbitrary  real  number,  and  the  couplance  C  is 

C  »  1Ci2+C22+C32|/(|C02|+|Ci2+C22+C321).  (2-2) 

The  objective  of  this  note  is  to  discuss  the  maximum  coupling  input  state  and 
the  couplance  of  the  inhomogeneous  polarization  elements. 

Suppose  the  J  is  an  inhomogeneous  polarization  element  with  two 
eigenpolarization  states.  We  let 

Jg  —  Ci0^+C202+C303 

which  is  the  "effective"  part  of  the  polarization  element,  and  thus 
J  =  c0a0+J0 . 

The  J0  has  the  same  eigenpolarization  states  as  the  J,  i.e., 
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=  ^Eq  J0Er  =  « 

where  £  =  (c12+c22+c32)1/2.  Since  JE  =  c0E+JoE,  the  maximum  coupling  must 
occur  when 

Et(JGE)  =  0  (2-3) 

and  |  J0E|  is  at  its  maxima.  Fortunately  these  two  conditions  can  be  satisfied 
simultaneously.  We  can  express  E  as 

E  =  aEq+  bEr  (2-4) 

and  easily  get 

J0E  =  ^(aEq  -  bEr).  (2-5) 

It  follows  from  eq.(2-3,4,5)  that 
[aEq+  bEr]t[^(aEq  -  bEr)]  =0 
and  then 

q{|a|2-|b|2-2i[lm(a‘bEqtEr)]}  =  0. 

Suppose  I;  *  0,  otherwise  J  =  c0o0  and  no  coupling  can  occur.  We  have 
|a|  =  |b|  and  Arg(a*b)  =  k  -Arg(EqtEr)  or  -Arg(EqtEr). 

Besides,  to  make  |JCE|  maximum,  we  have  to  chose  Arg(a*b)  =  k  -Arg(EqtEr). 
Therefore,  the  maximum  coupling  input  state  E  is 

E  =  (Eq-  eictEr)/(2  -  2|EqtEr|)i/2  and  a  =  tc  -Arg(EqtEr). 

The  relative  phase  can  be  arbitrary  only  for  homogeneous  case.  When  the 
maximum  coupling  occurs,  the  fraction  intensity  in  input  state  is 
lE  =  (c0E)t{c0E)  =  M 

and  the  fraction  intensity  in  orthogonal  state  is 
lE  =  (J0E)t(J0E)  =  k|c12+c22+c32| 

where  k  =  (1  +|EqtEr|)/(1  -|EqtEr|)  >  1 .  Finally  we  get  the  couplance  C 

C  =  ^(Ie+IeJ  =  k:|c12+c22+c32|/(|c02|+k|c12+c22+c32|). 

We  conclude  that,  for  any  polarization  element,  homogeneous  or  not,  with 
two  eigenpolarization  states,  the  maximum  coupling  occurs  when  input  state  is 
an  equal  mixture  of  the  eigenpolarization  states. 


§111  On  the  "Reversibility"  of  Jones  Matrices 

A  polarization  element  or  Jones  matrix  is  said  to  be  reversible  if  it  can  be 
realized  at  least  in  one  way  that  works  for  light  traveled  in  both  direction.  For 
convenience,  we  use  superscript to  denote  the  Jones  matrix  in  which  light 
travels  in  the  opposite  direction.  Clearly,  if 
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J  -  J!  J2'"Jn 
then 

"*  /V 

J  =  Jn—J2Jv 


Let  Jt_(4x£y)  and  Jc(<*)  represent  the  Jones  matrices  which  has  the  following 
forms  and  properties 


JL(«y)  = 


a 


X 

0  4 


yj 


Jl(^)Ex  =  Wc  ^ 


Jc(«)  = 


cosh  a 
,-i  sinha 


i  sinha 
cosha 


Jc(a)ER=  eaER  Jc(a)EL=  e-“EL 


where  £xJ;y  and  a  are  all  complex.  Besides  we  define  the  Jc(°°)  to  be  a  ideal 
RHC  polarizer  and  Jc(-°°)  a  ideal  LHC  polarizer.  The  JL  is  just  a  cascade  of 
LD(0°)  and  LR(0°).  Obviously  the  JL  behaves  similarly  in  reverse,  i.e., 

JL  =  JL. 

The  Jc  is  a  cascade  of  CD  and  CR,  and  it  can  be  realized  by  the  optically  active 
material.  We  do  not  have  to  discuss  the  "reversibility"  of  optically  active 
material,  because  Jc  can  also  be  realized  by 


Jc  =  QWLR(90°)  [JL(ea,e  a)  at  45°]  QWLR(0°) 


'  [cosha 

sinhal 

1  0  ' 

Lsinha 

coshaj  L 

0  -i. 

_  [cosha  isinhal 
L-isinha  coshaj 

By  this  realization  Jc  is  reversible,  i.e., 


Jc  =  QWLR(0°)  lJi(e“,e-«)  at  -45°]  QWLR(90°) 
_[1  01  [cosha  -sinhal[1  01 

“L0  —  i J  L—  sinha  cosha  J  (_0  ij 
_  [cosha  isinhal 
"L-isinha  coshaj 

=  Jc. 
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We  think  that  the  JL  and  Jc  are  the  most  elementary  and  important  reversible 
Jones  matrices.  They  are  very  useful  in  the  discussion  of  the  "reversibility"  of 
Jones  matrices. 

Any  Jones  matrix  J  can  be  expressed  in  the  form 
J  =  Jc(a)JL{Xx>Xv)Jc(P)- 

Since  we  can  solve  the  following  four  equations 


J 1 1 +J22  =  (X^+X^coshta+p) 

(3-1  a) 

Jn-  J22  =  (Xx  -  Xy)cosh(a  -  p) 

(3-1  b) 

J12+J2i  =  -i  -  XJsinh(a-p) 

(3-1  c) 

J12  -  J21  =  i  (Xx+Xy)sinh(a+P) 

(3-1  d) 

to  get  the  four  unknowns,  a,  P,  X*  and  Xy,  at  leat  one  set  of  solutions  exist.  From 
eq.(3-1c),  the  Jones  matrix  J  with  Ji2+J2i  =  0  has  the  symmetrical  expression 

J  =  Jc(a)JL(^,Xy)Jc(a) 
and  thus 

J  =  =  Jc(a)*M^x'^y)^c(cO=  J 

It  is  found  that  J12+J21  =  0  is  the  sufficient  condition  for  the  J  to  be  reversible. 

We  can  go  further.  Clearly  the  most  general  forms  of  the  reversible  Jones 
matrices  are 

JL"JC'JLJC’JL" . 

and 

. Jc^l'JcJl'Jc" . • 

As  (J|_)i2+(^l)2i  =  ®  and  (Jq)  12 "*'('^0)21  =  ^  follows  that 

(Jc'JLJc’)l2+(JC,JLJc’)21  =0 
and 

(Jl’JcJl')i2+(Jl'JcJl')21  =  0- 

Just  by  repeating  the  same  argument  it  is  not  difficulty  to  get  that 

( . . >12+< . Ju'Jo'JlJc'Jl" >21  -  0 

and 

( . Jc"JL,JCJl'Jc" )l2+( . Jc^l'JcJl'Jc" )21  =  0- 

Therefore  J12+J21  =  0  is  also  the  necessary  condition  for  the  J  to  be  reversible. 

To  get  more  physical  insight  we  have  to  go  back  to  the  Pauli  matrices. 

Without  doubt  the  a0  and  o1  are  reversible.  Since  the  c2  is  a  HWLR(45°),  it  is 
not  difficulty  to  understand  that  it  is  irreversible  and  c2  =  -o2-  The  a3  is  a  HWCR  and 
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o3  =  Jc(  ire/2).  Since  the  Jc  is  reversible,  so  is  the  a3  .  The  properties  ot  the 
Pauli  matrices  are  summarized  below. 

ao  =  °o  =  =  ~ °3  =  °3- 

Suppose  the  Jones  matrix  J  has  the  following  expression 

J  as  CqCJo+C^  +0303+0303, 

then  we  have 

^  ^  ^ 

J  =  c0o0+Ci  0,-1-0202+0303  =  000^0,0,-0202+0303. 

The  similar  conclusion  follows  that  J  is  reversible  if  and  only  if  J  does  not 
contain  o2.  As  the  o2  is  the  only  irreversible  one  of  the  Pauli  matrices,  the  result 
is  quite  plausible. 
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§IV  On  the  Decomposition:  M  =  ^(HMU"+U"H") 

This  note  mainly  concerns  the  decomposition 

M=^(H"U"+U"H"),  (4-1) 

for  an  arbitrary  complex  matrix  M,  where  H"  and  U"  are  nonnegative  define 
Hermitian  matrix  and  unitary  matrix  respectively.  I  would  like  to  call  the 
decomposition  of  Eq.  (1)  the  "symmetric  polar  decomposition  (SPD)"  in  contrast 
to  the  polar  decomposition  of  Eq.  (2).  It  is  found  that  not  all  matrices  have  the 
SPD.  Besides  H”  and  U"  have  the  possibility  to  be  unique  only  for  nonsingular 
matrices.  For  convenience,  we  will  use  the  capital  H  to  denote  a  nonnegative 
define  Hermitian  matrix,  and  the  capital  U  for  an  unitary  matrix,  and 
diagfdv-.dn]  for  a  diagonal  matrix  with  diagonal  elements  d1(--,dn. 

The  Polar  Decomposition  Theorem  will  paly  a  crucial  role  in  the  following 
discussion,  so  we  restate  it  below.  In  addition,  several  useful  Propositions  are 
also  cited  without  proof. 

The  Polar  Decomposition  Theorem.  ([1],  [2])  For  any  matrix  M  there  exist 
unique  H,  H'  and  (not  necessarily  unique)  an  U  such  that 

M  =  HU  =  UH  (4-2) 

where  H  and  H'  are  given  by 

H  =  ^MMt  H'  =  JWM.  (4-3) 

and  U  can  be  determined  by  solving  the  linear  equation  of  Eq.  (2)  with  H  and  H' 
given  by  Eq.  (3).  Furthermore  when  M  is  nonsingular  U  is  uniquely  given  by 

U  =  H-’M  =  ^MMt  1  M-  (4*4) 

Proposition  1.  ([1])  A  matrix  M  is  normal  if  and  only  if  H  and  U  in  Eq.  (2) 
commute. 

Corollary  1 .  If  M  is  normal,  then  it  can  be  written  as 

M  =  HU  =  UH  =  |(HU+UH) 

where  H  and  U  are  given  by  the  Polar  Representation  Theorem. 

Proposition  2.  If  is  a  nonnegative  define  Hermitian  matrix,  then  so  are 
U^U^1  and  (o^+U^U^1)  where  Uj  is  an  unitary  matrix  and  to  >0. 
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Proposition  3.  Suppose  U2  is  an  unitary  matrix  and  H2  is  a  nonnegative  define 
Hermitian  matrix.  If  the  equation 

x+u2xu2 1  =  h2 

has  a  solution  X,  then  there  exists  a  solution  X  which  is  Hermitian. 

Proposition  4.[1]  The  matrix  equation 

AX+XB  =  C 

has  a  unique  solution  if  and  only  if  the  matrices  A  and  -B  have  no  eigenvalue  in 
common. 


If  M  is  normal,  the  SPD  problem  is  already  answered  by  the  Corollary  1 . 
(A)  M  is  normal  and  nonsingular.  By  the  property  of  normal  matrix,  M  can  be 
diagonalized  by  an  unitary  matrix  U3,  i.e. 


M  *  U3  diag[p1ei^i,--,pneiH  U^1. 

(4-5) 

Let 

H"  =  U3  diag[pj,  -,pn]  U^1 

(4-6) 

and 

U"  =  u3  diag(e'^i, -.e'^nl  U3*1. 

(4-7) 

Thus  M  has  the  following  expression 

M  =  H"U"  =  U"H"  =  |(H"U"+UMH") 

which  is  the  desired  SPD. 

(B)  M  is  normal  but  singular.  M  can  be  written  as 


Similarity  let 


M  =  U4  diag[p1ei*i,-»,pmeiVO, ••,()]  IV1. 


H"  =  U4  diag[Pl>  •,P(n,0,”,0]  U4  1  (4-8) 

and 

U"  =  U4  diag[ei*i,--,ei*m,  eiXm+1,..,  e'\i]  IV1  (4-9) 
where  0  <  Xm+i  ,Xn  <  2k.  It  follows  that 

M  =  H"U"  =  U"HM  =  j(H"U"+U"H"). 

Note  that,  because  M  is  singular,  U"  is  not  unique. 


Here  we  consider  the  most  general  case.  Suppose  the  SPD  of  Eq.  (1 )  holds 
for  arbitrary  M.  First  we  rewrite  Eq.  (1 )  as 

M  =  { ^  (H"+U"H"UM'1)}U".  (4-10) 


41-30 


Since  we  suppose  H"  is  a  nonnegative  a  nonnegative,  j(H"+U"H"U"-1)  is  also 
a  nonnegative  Hermitian  matrix  by  the  Proposition  2.  Applying  the  Polar 
Decomposition  Theorem,  it  follows  that  UH  can  be  determined  by  the  linear 
equation 


MMt  U' 


(4-11) 


or  by  Eq.  (4-4)  when  M  is  nonsingular.  Note  that  the  U"  is  the  same  as  U  of  the 
polar  decomposition.  After  we  get  U",  H"  can  be  found  by  the  linear  equation 

\  (H"+ITHHU"*1)  =  JtAW  (4-12) 

or,  equally,  by 

H"U"+U"H"  =  2M.  (4-13) 

Since  the  Eq.  (4-12)  or  (4-13)  might  ot  have  a  nonnegatively  Hermitian  solution 
H",  the  SPD  is  not  always  possible.  In  order  to  know  wether  there  is  a 
nonegatively  Hermitian  solution  or  not,  we  have  to, in  general,  exame  Eq.  (4-12) 
or  (4-13)  case  by  case.  To  my  konwledge  there  is  no  principle  to  follow. 

The  above-mentioned  results  can  be  readily  applied  to  Jones  matrices  by 
the  following  correspondence 

M  ---  J 

H"  ----  Jhd 

U"  — -  Jhr 

Since  Jones  matrices  have  very  low  dimension  (in  fact,  2-D),  the  SPD  problem 
is  much  simpler  and  JHD  can  be  written  in  a  closed  form 

Jhd  =  (J+det(JHR)Jt)/Tr(JHR).  (4-14) 

A  simple  computer  program  has  been  written  to  compute  the  SPD  for  real  J. 
And  it  is  found  that  the  SPD  is  valid  only  for  a  resticted  class  of  matrices. 


Example  1- 


1  ^ 

o  or 


■  i  i 

)1  Iz  Iz 

.  ia  id 

I  -e  e 

L7T  X 


■ 1  1 1 r  i  ii 

H  H  IZ 


id  id 

-e  e 


7T  7Z\u2  Iz 


0  <  a  <  2k 


I2  1  f  i  i  1  i  i  1 r  /r 

i+eio  Iz  Iz  IZ  IZ  . 

„  -e'°  e'a  '  +  -eia  e1"  Ji 

0  7T  7?J  LtT  XJ  7^ 


a  *  k. 
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Since 


VI 

JL 

,l+eH< 


Ji 


1+e 


ia 


is  not  a  nonnegative  define  matrix,  I  q 


■G 


has  no  SPD. 
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§V  On  Several  Order  Independent  Equalities 

It  is  well  understood  that,  for  any  matrices  X  and  Y,  if  [X,Y]=0,  then 

expXexpY  =  exp(X+Y). 

This  is  in  general  not  true  when  [X,Y]*0,  and  two  questions  arise  naturally.  One 
is  whether  one  can  obtain  a  explicit  formula  for  expXexpY.  In  other  words  what 
is  the  exponent  of  the  RH  side  of  th^  equation 

expXexpY  =  exp(?). 

The  second  one  is  what  kind  of  operation  makes  the  following  equation  hold. 

expX{?)expY  =  exp(X+Y) 

The  first  question  is  solved  by  the  following  Theorem. 

Theorem  1.  ([1])  Let  X  and  Y  be  two  complex  matrices.  Then  we  have 
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expXexpY  =  exp{X+Y+  [X,Y]+  ^  [[X,Y],  Y]-  ”  [X,Y],X]+ . }  (5-1 ) 

The  high  order  terms  are  quite  complicated  and  a  recursion  formula  can  be 
found  in  [1].  In  the  theory  of  Lie  groups,  Eq.(5-1)  is  called  the  Baker-Campell- 
Hausdroff  (BCH)  formula. 

Cflfflllary  1-  ([2], [3]) 

(i)  exp(tX)exp(tY)  =  exp{  t(X+Y)  +  0(t2)} 

(ii)  exp{t(X+Y)}  =  exp(tX)exp(tY)exp(0(t2)) 

where  t  is  a  small  real  parameter  and  0(t2)  denotes  a  term  of  order  t2. 

Corollary  2. 

(i)  exp(tX+0'(t2))  +  exp(tX+0"(t2))  =  exp(tX+0(t2)) 

where  t  is  a  small  real  parameter  and,  0'(t2),  0"(t2)  and  0(t2)  denote  terms  of 
order  t2. 

The  next  theorem  is  a  consequence  of  Corollary  1 ,  and  it  offers  a  answer  to 
the  second  question. 

Ih.gQ.rem  2.  ([2], [3])  For  any  two  complex  matrices  X  and  Y,  we  have 

{exp(X/n)exp(Y/n)}  "  — =  exp(X+Y).  (5-2) 

I  I  7 


Corollary  3. 

(i)  (exp(Y/n)exp(X/n)}  "  (exp(X/n)exp(Y/n)}  "  exp(X+Y) 

(ii)  fexp(X/n)<8>exp(Y/n)} n  ^==  exp(X+Y)  (5-3) 

(iii)  (exp(X/n)exp(Y/n)exp(Z/n)}  "  ==  exp(X+Y+Z)  (5-4) 

(iv)  (exp(X/n)®exp(Y/n)®exp(Z/n)} n  =====  exp(X+Y+Z)  (5*5) 

n  — »oo 

We  let  exp(X)=A  and  exp(Y)=B  and  so  on,  then  exp(X/n)=A1/n  and 
exp(Y/n)=B1/n,  etc.  So  Theorem  1  and  Corollary  4  can  be  rewritten  as 

(A1/nB1/n)n  — (B1/nA1/n)n 
n  — 

-  (A1/n®  B1/n)n 

n  — ^  °o 

exp(  InA  +  InB) 

A0B 
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(A1/nB1/nC1/n)n==  (A1/n®B1/n®C1/n)n 
n  — >  oo 

exp(  InA  +  InB  +  InC) 

zs.4=jt=  (A®  B  ®C) 
n 

The  inequality  exp(  InA  +  lnB)*(A®B)  is  quite  obviously,  since 

(Oj  ®  a2)  =  0  *  exp(  lnaj+  lna2). 

Therefore,  in  my  opinion,  equalities  in  p.73  &  74  of  the  lecture  note  are  in 
general  not  true. 

Since  any  nonsinguiar  Jones  matrix  J  has  the  exponential  representation 

J  =  exp(d0a0+d ,  +d2a2+d3a3) ,  (5-6 ) 

it  can  be  expressed  as 

{exp{i(d0ijao+d1iia1+d2iia2+d3ija3)/n}{exp{(dO(rao+d1>ra1+d2ira2+d3ira3)/n}}n===  j 

(5-7) 

where  d  j  and  d  r  denote  the  imaginary  part  and  real  part  of  d.  It  is  not  difficulty 
to  understand  that  expi(d0 ,ia0+d1  iio1+d2  ia2+d3ija3)/n  is  a  homogeneous  retarder 
and  exp(d0>ra0+d1ra1+d2  ra2+d3  ra3)/n  is  a  homogeneous  diattenuator.  So 
Eq.(5-7)  has  the  interpretation  that  any  nonsingular  J  is  an  order  independent 
cascade  of  infinite  number  of  weak  JHR  and  JHD. 

J  can  also  be  expressed  as 

{exp(dQa  /n)exp(d1a1/n)exp(d2a2/n)exp(d3o3/n)}n  ==  j.  (5-8) 

n  — >  °° 

Eq.(5-8)  has  a  similarly  explanation  as  Eq.(5-7). 

Since  the  exponential  form  of  J  has  lots  of  order  independent  properties,  I 
think  Eq.(6)  is  the  "cannonical"  form  for  a  Jones  matrix.  Specially, 


cannonical  form 

Total  Phase  and  Amplitude 

exp(d0o0) 

Linear  Polarization  Element  at  0° 

exp(d1o1) 

A<j>=2d1  j 
D=tanh2d1  r 

Linear  Polarization  Element  at  45° 

exp(d2o2) 

A<|>=2d2  j 
D=tanh2d2r 

Circular  Polarization  Element 

exp(d3c3) 

A<f>=2d3  j 
D=tanh2d^f 

The  results  of  Theorem  2  and  Corollary  4, 1  believe,  are  very  useful  in  modeling 
polarization  elements  and  studying  the  intrinsic  properties  of  J 
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§VI  On  Several  Properties  of  the  Canonnical  Representations 
of  Jones  Matrices 

In  the  following  we  consider  an  arbitrary  nonsingular  Jones  matrix  J  of  the 
form 

J  =  CoOo+qo^Og+CgOg  =  exp(d0o0+d1o1+d2o2+d3O3) 

or,  in  vector  notation, 

J  =  c0o0+cTo  =  exp(d0o0+dT<y) 


'<V 

'V 

where  c  = 

C2 

.  d  = 

d  2 

,  0= 

°2 

C 

d 

o„ 

L  3. 

L  3. 

L  3J 
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Eigenvalues,  eigenvectors,  determinant  and  trace 
qp,r  =  c0  +  V^c  =  exp(d0  +  VcTd  ) 

Det(J)  =  c02  -  cTc  =  exp(2d0) 

Tr(J)  =  2c0  =  2exp(  d0)  cosh  VdTd"‘ 

The  d0a0+dTa  and  c0a0+cTa  have  the  same  eigenvector(s). 

Rotated  polarizer 

J(0)  =  R(0)JR(-0) 

=  coCTo+(c1cos20-c2sin20)c1 +(0^(020+0200520)02+0303 
=  exp{dooo+(d1cos20-d2sin20)o1+(d1sin20+d2cos20)o2+d3o3}. 
Note  that  c-  and  d-coetficients  are  transformed  in  the  same  way  under  rotation. 


The  relation  be'.ween  c-  and  d-cofficients 
Provided  cTc*0  or  dTd*0,  then 

sinh  V  dTd 


=  exp(dQ)  cosh  d  ,  c  =  exp(dD) 


and 


sinh 


-1 


cTc 


do  =  2  ln(c02  *  cTc) . 


d  = 


C0  -  CTc 


V  CTC 


Note  that  c  and  d  are  kind  of  proportional  to  each  other.  If  cTc=0  or  dTd=0,  then 

c0  =  exp(d0) ,  c  =  exp(d0)  d 
and 

d0  =  ln(c0),  d  =  77-  c  . 

0 

If  d0o0+dTo  is  normal,  then  J  is  homogeneous.  If  doOo+dVis  Hermitian, 
then  J  is  a  HD.  The  reverse  of  the  above  statement  statements  are  also  true 
provided  J  is  nonsingular.  If  d0o0+dTa  is  anti-Hermitian  if  and  only  if  J  is  a  HR. 

For  singular  J,  some  of  d-coefficients  are  infinite.  I  would  like  to  call  them  the 
"singular  polarization  elements",  since  they  have  infinitive  absorption  at  some 
polarization  states.  The  question  is  how  to  incorporate  the  "singular 
polarization  elements"  into  this  canonnical  representation  scheme. 
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VII  On  the  states  of  extreme  transmittances 


1 )  An  Example 

We  consider  an  inhomogeneous  polarization  elements  with  Jones  matrix 

’  1  1 
0  1  +  i  ■ 


Its  eigenpolarizations  are 


'  [ " 

=  H  ,  Er  =  -!= 

'  1  ' 

_0_ 

V  2 

_  l  _ 

and  the  corresponding  eigenvalues  are 

-  1,  t  =  1-i. 

In  addition  we  have 


The  T  max  and  T  min  are  the  eigenvalues  of  J  *  J.  The  E  max  and  Emin  are  the 
eigenvectors  of  J  *  J.  Since  we  have 


it  can  be  found  that 

T 

1  in  a  x  .  in  i  n 

and 
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=  (4 


2v  2)  1/2 


1  +  72 


Emm  =  (4  -  2\/2) 


1/2 


Li-VzJ 


Clearly  E (f,  E  r,  it  ,unx  and  E  nun  are  not  on  a  great  circle. 

2)  Algebraic  Picture 

The  states  of  extreme  transmittances  of  PE  are  important.  First  they  have  interest¬ 
ing  properties: 

3  o  and  ( J  Emax ) *  ( J  Emin )  =  0.  (1) 

In  addition  they  play  a  crucial  role  in  discussing  the  diattenuation.  For  homogeneous  PE 
the  eigenpolarizations  are  the  states  of  extreme  transmittances.  It  can  be  shown  that,  for 
inhomogeneous  PE, 

£„,x  =  a£,*b£r  (2) 

where 

a/b  -  (3) 

A  similar  formula  can  be  found  for  Emin.  Since  Emin  is  normal  to  Emaxj  we  only  have  to 
discuss  Emav 

Based  on  the  esult  in  Ref.  1 ,  the  following  theorem  can  be  formulated. 

Theorem:  Let  £ 3  =  a  E ,  +  b  E  2.  s ,,  S 2  and  s 3  are  on  a  great  circle  in  the  Poincare  sphere 
if  and  only  if  a '  b  I  j  E  2  is  real. 

Therefore,  for  inhomogeneous  PE,  eigenpolarizations  and  the  states  of  extreme 
transmittances  are  on  same  great  circle  in  the  Poincare  sphere  if  and  only  if  ^  *  £r  is  real. 
We  reconsider  the  example  discussed  before: 
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(4) 


Since  we  have  £>r,  =  I  and  ?>r  =  1  +  /,  is  obviously  not  real.  And  then  we  get  the 
same  conclusion  that  its  eigenpolarizations  and  states  of  extreme  transmittances  are  not 
on  a  same  great  circle  in  the  Poincare  sphere. 

3)  Geometrical  Picture 

The  geometrical  picture  is  more  complicated.  £  max  is  given  by  Eqs.  (2)  and  (3),  so 
we  obtain 


QfTI  ~  *  p. 

cos— =  E„E 


q  ■-  max  I 


and 


cos  —  =  :  E,  Emax 


=  T(7- 


:n  ;i  \  ^  <1  ^  /  ) 


(T 


ill  a  X  ^  q  I 


!)|/V 


where 


')=£;£,.  (7) 

0 is  the  angle  between  s7  and  s  milx  in  the  Poincare  sphere.  9rm  is  defined  similarly. 

The  stokes  vectors  which  have  a  fixed  angle  with  s,  (or  sr)  constitute  a  circle  in  the 
Poincare  sphere.  From  Eqs.  (5)  and  (6)  we  see  that  smax  is  located  at  the  intersection  of 
two  circles  (Fig.  1 ).  Note  that  two  circles  can  have  two  intersection  points.  If 
1/  r/(  )  >  o,  then  sm;ix  is  chosen  such  that  the  sequence  of  points  sqsrsmax  is 

clockwise.  Otherwise  s,„.lx  is  chosen  such  that  the  sequence  of  points  sqsrsmax  is 
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counter-clockwise  (see  following  figure). 
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41-40 


STUDIES  ON  MUELLER  CALCULUS 


Shih-Yau  Lu 
Russell  A.  Chipman 
Department  of  Physics 
University  of  Alabama  in  Huntsville 


41-41 


On  relations  between  elements  of  the  Mueller  matrix 
-  a  summary  - 

1.  INTRODUCTION 

Any  2x2  complex  matrix  is  a  valid  Jones  matrix.  Similar  statement  is  not  true  for  the 
Mueller  matrix.  Hence  there  are  constraints  between  elements  of  a  Mueller  matrix.  For  a 
non-deloparizing  Mueller  matrix,  certain,  in  fact,  nine,  equalities  must  be  satisfied  by  its 
elements.  Since  several  papers  have  been  written  on  this  subject,  in  the  following  we 
mainly  review  the  published  results.  The  physically  realizable  problem  is  also  discussed. 

2.  NON-DEPOLARIZING  MUELLER  MATRICES 

In  this  section  we  consider  only  physically  realizable  Mueller  matrix.  It  is  known  that 
a  non-depolarizing  Mueller  matrix,  say  iVI,  can  be  derived  from  a  Jones  matrix,  say  J,  by 
VI  =  T(J®  J‘)T-‘  (1) 

or 

J®J'  =  T  "  VIT  (2) 

where 

10  0  1 

10  0  -I 

011  0 
0  t  -i  0 
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From  Eqs.  (3)  and  (4)  we  see  that  j klj'mn(k,  l,  m,  n  =  1 , 2 )  is  linear  functions  of  m t/ 
(/.  /  =  l  .2,3.4)  matrix  (Tab.  1),  and  vice  verse  (Tab.  2).  A  Jones  matrix  has  eight 
degrees  of  freedom  and  a  Mueller  matrix  has  sixteen.  Since  a  Mueller  does  not  provide 
the  information  about  total  phase,  nine  equations  must  be  satisfied  by  its  elements. 


In  general,  any  relation  of  the  form 

(J  kl'J  (4) 

will  yield  a  equation  for  elements  of  the  Mueller  matrix.  Only  nine  equations  are  indepen¬ 
dent,  and  they  are  listed  in  Table  3. 

The  nine  equations  listed  in  Table  3  are  not  unique.  Our  choice  coincides  with  Ref.  2  and 
3.  By  summing  first  six  equations  in  Table  3,  it  is  obtained  that3>4>5 


Several  authors4*7  argued  that  Eq.  (14)  is  the  necessary  and  sufficient  conditions  for  a 
Mueller  matrix  to  be  non-depolarizing. 


The  following  equations3  are  directly  resulted  from  equations  in  Table  3: 


U, 

i  +  31  i2 

> 

±  ( M 

3  1 

+ 

A/32) 

> 

± 

(M.u 

+ 

31 

42  ) 

(6) 

\l , 

,  "  M  ,2 

> 

±(M 

3  1 

- 

1  /■,;:) 

> 

± 

(-V/.,, 

- 

M 

4,0 

(7) 

11. 

,  -  W.M 

> 

±(M 

1  3 

4- 

w  ,0 
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tq.  (21)  indicates  that  M  , ,  is  the  maximum  elements  of  a  Mueller  matrix. 


Barakat’s  results 


The  input  and  output  coherency  matrices  are  related  by 
C  =  JCJ*. 


The  coherency  matrix  C  can  be  written  as 

[S|  +  S2  S3-/S4" 

S  3  +  i.  S  4  S  |  -  S  ? 


By  taking  determinants  of  both  sides  of  Eq.  (13),  we  have 
-  52  -  si  -  S24  =  i  Do/j:2(S*  -  S22  -  Si  -  S2). 

Let  G  =  Dicig(  l.-l.-l.-l).  Then  Eq.  (15)  can  be  rewritten  as 

SOS  =  i  Dot  J|2SrGS 

where 
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Since  we  have 


S  =  VI  S  , 

by  Eq.  (16)  it  is  obtained  that 
M  T 0  \l  =  \Det  J[2G. 

Note  that 

\DqL  J!2  =  M?,  -  Mh  -  Mg,  -  M  4, 


From  Eqs.  (18)  and  (19),  nine  equations  are  found: 


Al 

1  1 

.VI 

i .? 

— 

VI 

21 

:Vf 

22 

— 

M 

31 

M 

32  “ 

VI 

4  1 

Al 

42 

= 

0 

11 

1  1 

,W 

i  :3 

- 

XI 

2  1 

.VI 

23 

- 

A 1 

3  1 

XI 

33 

Al 

4  1 

VI 

43 

= 

0 

.VI 

1  1 

A 1 

1  4 

- 

.VI 

21 

.VI 

24 

- 

VI 

3  1 

.VI 

34  ~ 

VI 

4  1 

VI 

44 

= 

0 

Al 

12 

VI 

13 

- 

VI 

22 

/VI 

23 

- 

VI 

32 

/VI 

33  - 

M 

42 

M 

43 

= 

0 

VI 

12 

/VI 

14 

- 

M 

22 

M 

24 

- 

M 

32 

VI 

34  “ 

M 

42 

M 

44 

= 

0 

VI 

1  3 

VI 

1  4 

- 

XI 

23 

,Vf 

24 

- 

M 

33 

/VI 

34  “ 

M 

43 

/VI 

44 

= 

0 

\l 

2 

1  2 

- 

VI 

2 

22 

- 

.VI 

2 

32 

- 

VI 

2 

42 

4- 

VI 

2 

1  1 

VI 

2 

21 

- 

M 

2 

31 

- 

M 

XI 

;> 

!  3 

- 

VI 

2 

23 

- 

VI 

2 

33 

- 

.VI 

2 

13 

4- 

A'l 

2 

1  1 

VI 

2 

21 

- 

V! 

2 

31 

- 

M 

VI 

2 

1  4 

- 

VI 

2 

24 

- 

.VI 

2 

34 

- 

M 

2 

44 

4- 

Al 

2 

1  1 

A-I 

2 

21 

- 

,VI 

2 

31 

- 

A'l 

3.  DEPOLARIZING  MUELLER  MATRICES 


The  depolarizing  Mueller  matrix  is  resulted  from  the  average  of  different  non¬ 
depolarizing  Mueller  matrices,  for  example  scattering  process.  Another  example  is  when 
light  propagating  in  random  medium.  Thus  Eqs.  (1)  and  (2)  can  be  rewritten  as 
\l  =  I  <  J  ®  J*  >  T  1  (29) 

and 

<  .J  ®  J*  >=  l"1  \l  T.  (30) 

From  Schwarz  inequality,  some  of  the  equalities  in  Table  3  become  inequalities^.  These 
inequalities  are  listed  in  Table  4.  It  should  be  noted  that  these  inequalities  are  the  condi¬ 
tions  satisfied  by  physically  realizable  Mueller  matrices. 

Let 

\  =  >:  \i  ,o,  s  o , .  (3i) 

Cloude1^  pointed  that  the  necessary  and  sufficient  conditions  for  a  Mueller  matrix  to  be 
physically  realizable  is  that  the  eigenvalues  of  N  are  all  nonnegative,  i.e.,  N  is  positive 
semi-definite. 
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TABLE  1 


J  nJ  u  *5(^4ii  +  A^i2+Af2i  +  A^2j) 

(J1) 

J  12  J  12  “  j(M  II  ~  M  I2  +  M  2|  ~  M  22) 

(J2) 

■j  2\j  2\  *  $(M  ,,  +  M  ,2  -  M  21  -  M  22) 

(J3) 

J  22  J  22  ~  2(M  u  ~  M  l2~  M  21  +  M  22) 

(J4) 

J  11J  i2~  $(M, 3  + M  23 +  iMl4  +  iM  24) 

(J5) 

J  11^21  *5(^31  +  M  32  - 1 M  4t  -  i  M  42) 

(J6) 

j  1  1  J  22  ~  2  C  ^  33  +  M  44  +  /  M  34  ~  l  M  43  ) 

(J7) 

•j  12^21  **  2^  ^  33  ~  M  44  -  1  M  34  ~  i  M  43) 

(J8) 

j  12  J  22  s  5(^31  ~  M  32~  iM  41  *  iM  42) 

(J9) 

j  21  */  22  "j(  M  13  -  M  23  ♦  i  M  ,4  -  t  M  24  ) 

(J10) 

j  22  J  21  **  (  */  21  ^  22  ) 

(J11) 

J  22^  12  *  (-/  12^22) 

(J12) 

J  2\J  12  *  (•/  1 2  2 1  ) 

(J13) 

J  22  J  II  =  (  -/  11-/  22  ) 

(J14) 

— /  2  1  — /  1  1  “(*■/  1  1  — /  2  1  ) 

(J15) 

*/  12*/  II  =  ll>/  12) 

(J16) 
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TABLE  2 


Mu  *  ~(J  iiJ  u  +  J  12-/  \2  +  J  2\J  2\  +  J  22^  22) 

(Ml) 

M  |2  *  j(-/  ||  J  ||  ~  J  1 2  J  >2'*'  2I~^  2t  ~  22-^  22) 

(M2) 

M|3=-(-/  \  \J  |2  ♦  J  12-/  \  \  +  J  2\^  22  +  J  22^  2\) 

(M3) 

M  \4  =  -(-  J  n  J  ,2+  J  1 2  */  II  —  -/2I*/ 22  "*"'/ 22^/21) 

(M4) 

M2|  “  j  (  </  11-/  1 1  +  -/  12-/  12  —  -/  2 1  ^  2 1  —  J  22*1  22) 

(M5) 

M  22  ~  zi-J  II-/  II  “■/  12-/  1 2  —  -/  2  K  -/  2 1  +  J  22  ^  22) 

(M6) 

M23  =  -(J  ||-/|2  +  -/  12*/  11  —  -/  21*/  22  22*/  2l) 

(M7) 

M24  =  j(--/  11-/  |2  +  -/  12-/  II  +  21  J  22  ~  J  22~f  2l) 

(M8) 

^31  ”  j(*/  II-/21  +  -/  21-/  II  -/  12-/  22  +  -/  22-/  12) 

(M9) 

^32  m  2^-^  1 1  — /  2 1  +  J  21  J  II--/  \2~J  22~  J  22~1  12  ) 

(M10) 

M33*  11-/  22  +  -/  22-/  ll+«/  12*/  21  **"  -/  21  -/  12) 

(Mil) 

M34”j(--/  \\J  22  +  J  22^  l\  +  J  12J  2\~  J  2\J \2^ 

(Ml  2) 

M  41  =  |(-/||-/2i~-/2i-/  ||+-/  1 2  -/  22  —  -/  22*/  12) 

(Ml  3) 

M  42  ~  j(-/  1  1  -/  2 1  -  J  2 1  -/  1 1  ~-/  12-/  22  +  -/  22-/  12) 

(Ml  4) 

^43  ”  j(-/  1  l  'Z  22  —  -/  22  -/  1  1  +  -/  12-/  21  —  -/  21  -/  12) 

(Ml  5) 

M  44  =  -  (J  UJ  22  +  J  22  J  II--/  12-/  21  -  -/  21  -/  12) 

(Ml  6) 
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TABLE  3 


4(*/,,  +  Mi2y-(M2l  +  M22f" 

-(Af31  +  W32)2  +  (M41  +  Af42)a-4CJ11J;1)(J21J;|) 
\iJ ttJ 22)  m  (Af 1 1  -  Af 12)2  -  (M2i  -  Af 22 )2 

-  (Af31  -  Af32)2  ♦  (Af 4,  -  MA2)2  -  4(J  ,2  J\z)Uz2  J\i) 
4(-f|i-f|i)(-^i2«^  12)  “  (  Af  1 ,  +  M2l)2-(Ml2  +  M22)2 

-(m13+m23)2  +  (m14+  m24)2*4(j11j;2)(j12j;1) 
4(^21  j2|  )(  V  22  J  22  )  “  (  Af  1 1  ~  M 2\)2  —  (M  \  2  —  M 22)2 

miM{3-M22)2*(Ml4-M2A)2~4(.J2lJm22XJ22Jli) 

4(^I1^1|)(>/22>/2j)  ”  (W]|  +  M22)2*(M]2+  W21)2 

-(Af33  + Af44)2  +  (M34-Af4,)2- 4(-/„ •/«)(•/ 22./u) 

4(^|2«/l2)(^2l^2l)"(Af  1,  -  M22)2~(M  ,2“  A#2l)* 

-(m33-m44)2-*-(m34  +  m43)2=*4(j1272,)(j21j;2) 
4^e(J22'^U'^i2'/2i)“  Af§3  -  Af  34  +  Af  43  -  Af$4 

■*  Af23-  Af  ?4  -  Af|3  +  Afl4  -  4Re(J  22J  2t  sper*J,2J’u') 
4Re(_J  22J  y\J2\J  i2)”  Af  33  —  Af  43  Af  34  ~  Af  44 

4Ra(J  22J 12*/ 1 1  v/2i )  ™  Af  3l  —  Af *2  ♦  Af  4)  ~  Af  42 

-  Af?4-Afi4  +  Af?3-Af|3-4^e(J22^ii^MVi2) 
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TABLE  4 


4<  Jn  J'u  ><  J2l  J‘2l  >“(Af|1  +  A4|2)2-(Af21  +  Af22)2 

—  (  3 1  +  M32)2  +  (A>f4|  +  M42)2»4|<7  tlJ2|  >|2 

4  <  J 1 2  J  i2Xv/22«/22>— (Mu-  iWi2)2  -  ( 2i  —  M  22  ) 2 

>(M3i-M32)2  +  (M41-M42)2-4|<Ji27;2>|2 
4  <  J  ii  J  ii  ><  J  i2J  i2>“(Mh  +  M2|)2-(A<|2  +  A^22)2 

-(^l3  +  Af23)2  +  (Af,4  +  Af24)2  =  4|<y11Ji2>|2 
4<J2|V2|><J22J22>“(Afi|-A#2j)2-(Af|2-A^22)2 

^CM|3-Af23)2  +  (Af|4-Af24)2«4|<J2,J22>|2 
4  <  V  ||V  ||  ><  J  22  J  22>m  (.M  U  *  M22)2  ~  (.M  M2l)2 

^(W33  +  Af44)2  +  (/W34-Af43)2  =  4|<J,,j;2>|2 
4<  V,2J|2><  J2|72,  >-(Af|,  -W22)2-(iM|2-M21)2 

-(M33-M44)2  +  (M34+A/f43)2-4|<J,2J2|  >|2 

^  ^  11^  11  A'f  1  j  +  Af 1 2  +  Af  2 1  +  Af  22  —  0 
ia^  12^*  Afjj  ~  M  \  2+  M  2\  ~  M  22>  0 
<  «/2l  •/  21  ^  *  A/  1  ,  ♦  Af  ,2  -  Al  21  “  Af  22  -  0 
^  *^22*^22  Af  1 1  “  M  1 2  "  M  21  ^  M  22  ~  0 
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A  simple  derivation  of  the  relations 
between  elements  of  non-depolarizing  Mueller  matrix 


4 


This  note  is  to  present  a  simpler  derivation  of  Barakat’s  results  on  the  relations  bat- 
ween  elements  of  Mueller  matrix.  It  is  a  simple  physical  fact  that  if  the  incident  light  of  a 
non-deloparizing  system  is  completely  polarized,  then  so  are  the  output  light.  In  other 
words,  Sf  -  S2  -  ~  S?  =  0  implies  sf  -  si  -  si  -  si  =  0  where 

S  =  (S|.S2.S3,  S4)t  and  S  =  (S, ,  S2,  S3,  S4)T  are  the  incident  and  output 
Stokes  vectors,  respectively.  Thus  the  following  equation  is  true: 

(S,)2-(S2)2-(S3)2-(S4)2-a(S?-Si-S!-S|)  (1) 

where  a  is  proportional  constant  depending  only  on  the  elements  of  the  Mueller  matrix. 
Let 

M,  3  Mu- 

M  23  M  24 

M  33  M  34 

,  V/  43  M  44 


Mu 

M  12 

M  21 

\i  22 

M  3  i 

'Ll  32 

M  ,, 

•M  42 

be  the  Mueller  matrix  of  the  system  under  consideration.  For  unpolarized  incident  light, 
the  output  light  is  S  =  ( .VI , , ,  M  2) .  M  3, .  M  4,  )r ,  and  then  the  a  of  Eq.  (1)  can  be  written 
as 

cx  =  *L/f,  —  .1/5,  —  ;Vf  3,  -  M  4,  .  (2) 

Then  we  can  rewrite  Eq.  (1)  as 

(5,)2-(Sl,)‘!-(53)2-(S4)z 
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(3) 


=  (Aft,  -  All,  -  M§,  ~  M24I)(S2-S22-S23-S2)- 

Barakat  derived  Eq.  (3)  by  the  coherency  matrix  formalism.  Our  derivation  which  is  only 

based  upon  a  physical  observation,  is  more  straightforward. 


Since  S  and  S  are  related  by 

S  =  MS,  (4) 

from  Eq.  (3)  we  have 

(IA'JuSt)2-  (IM2/S,)2-(ZiW3t5k)2-(IM4iS()2 
=  (Mu  -  AI  |,  -  M 1 1  -  M2,)(Sf-Si-Sl-S2).  (5) 

Compare  the  coefficients  of  both  sides  of  Eq.  (5),  nine  relations  that  are  first  obtained  by 
Barakat,  then,  can  be  found: 


AT  1 1  M 1 2  -  M  2 1  M  22  "  M  3 1  M  32  ~  Af  4  ,  M  42  “  0 
1 1  A<f  13  —  M2,  M23  —  M3,  M33  —  M4,  M4a  “  0 

M  , ,  M  14  —  M  21  M  24  —  M  31  M  34  ~  M  41  M  44  “  0 
M  12  M  13  —  M  22  M  23  "  M  32  M  33  ~  M  42  M  43  *■  0 
M  12  Af  ,  4  —  M  22  Af  24  —  Af  32  M  34  *  M  42  M  44  *  0 
M  1 3  Af  1 4  —  M  23  M  24  *  M  33  M  34  ”  M  43  M  44  *  0 

M?a  -  MI2  -  MI2  -  Af  It  +  Af  ?,  -  Ml,  -  M§,  -  M  -  0 
Af  ?3  -  Af  la  -  Af  §3  -  Af53  +  Af?,  -  Mi,  -  Ml,  -  Af  5,  -  0 

A#?4-Ml4-Ml4-Ml4+ Af?,-Ml, -Ml, -M?,  -0 


REFERENCE 
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A  study  of  the  Mueller  matrix  in  exponential  from 


1.  A  PARTITION  OF  THE  MUELLER  MATRIX 


Let  the  Mueller  matrix  under  consideration  be  denoted  by 


A/,, 

A'l  ,2 

M,  3 

m2I 

M  22 

M  23 

m24 

_  M  !  , 

> 

_ 1 

A/  3  i 

M  32 

A/  33 

M34 

B 

c  J 

A 

A/  42 

Af  4  3 

m44_ 

where 


"  Al !  2  " 

"m2I- 

M  22 

M  23 

m24" 

A  = 

A/  13 

,13  = 

m31 

andC  = 

M  32 

M  33 

M  34 

_  Al  , 

_  Af  4  1  _ 

_m42 

m43 

M  44  _ 

The  partition  of  a  Mueller  matrix  in  Eq.  (1)  will  be  proved  to  be  useful.  For  example, 
under  a  rotation  in  Poincare  sphere,  VI  becomes 


'  1  0 

'Al,, 

\r 

"  1 

0  ' 

~M\\ 

(RA)r 

.0  R 

13 

C 

_0 

Rr, 

_  RD 

RCR r  _ 

where  R  is  a  3-dimensional  rotation  matrix,  and 
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Ur  =  R'1 


Besides,  the  equation 
S  '  =  VI  s 
can  be  written  as 

S,'  =  MllS,-*-ArS,  and 
S'  =  BS,  +  CS 

in  which 


s  = 

"sr 

and  S '  = 

"s,*' 

5  _ 

5 ' 

(3) 

(4) 

(5) 

(6) 


In  next  section,  we  will  use  above  partition  to  discuss  the  exponential  form  of  the  Mueller 
matrix. 


2.  THE  EXPONENTIAL  FORM 


It  was  pointed  out  by  Azzam  that  the  Stokes  vector  of  light  propagating  in  continu¬ 
ous  medium,  homogeneous  or  not,  can  be  described  by  a  differential  equation: 

rmS.  (7) 

In  case  that  m  is  independent  of  2,  the  general  solution  of  Eq.  (7)  is 

S(.;)  =  <>,H:S(0),  (8) 

and  we  have 

VI  (::)-r»™s  (9) 

such  that 

S(::)  =  \l(::)S(0).  (10) 
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Thus  hi  can  be  regarded  as  a  exponential  representation  of  the  Mueller  matrix. 


The  Stokes  vector  satisfies  Eq.  (7),  but  we  want  to  know  equation  satisfied  by  the 
degree  of  polarization.  First,  we  partition  m  into 


in 


m , , 

I) 


(11) 


by  the  same  way  as  of  Eq.  (1).  Then  Eqs.  (7)  can  be  rewritten  as 

=  m  1 1  S  .  +  n  T  S  ,  and 
^=b5,*c5. 

And  we  have 

=  ±f  !! 

rtzVsf 

and 


'i  n  2  ■  •,  /  5  ' r  *  1 1 ) 5  i> 7  .•> 

—  P  =  ^  v  - +■  - 

\  af  5- 


(12) 

(13) 


(14) 


in  which  P  denotes  the  degree  of  polarization  (DOP).  Eq.  (14)  is  the  equation  governed 
the  square  of  DOP. 


3.  NON-DEPOLARIZING  CASES 


For  non-depolarizing  medium,  completely  polarized  light  can  only  produce  com¬ 
pletely  polarized  light,  i.e.,  dP2/dz  =  OifSrS  =  Si  In  this  case  Eq.  (14)  can  be  writ¬ 
ten  as 
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-  r 

i>  (  c  -  m  . , ;  i 


(l>  r 


-a T)S  \ 


(15) 


or 


^P2  =  2{Sr(c-mn)S-(b-a)rS> 


(16) 


in  which  S  denotes  the  unit  vector 


S2/S, 

s3/s, 

s4/s, 


Eq.  (15)  or  (16)  equal  to  zero,  only  if 


c  -  m  n  is  anti-symmetric 

and  a  = 

b . 

Therefore,  for  non-depolarization  medium,  the 

general  form  of  m  is 

a 

P 

Y 

6  " 

P 

a 

u 

V 

(17) 

in  = 

Y 

-  |i 

a 

n 

6 

-  V 

-  n 

a 

This  is  exactly  the  results  obtained  by  Azzam  in  a  paper  in  1 978.  It  is  quite  clear,  as 
pointed  out  by  Azzam,  that  the  meaning  of  parameters  in  Eq.  (17)  can  be  explained  as 
follows: 

«  total  absorption 

|T  linear  diattenuation 

y  linear  diattenuation  at  45° 

h  circular  diattenuation 
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>1 


linear  retardance 
linear  retardance  at  45° 
circular  retardance 


V 

I* 
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On  the  relations  between  the  elements 
of  a  Mueller  matrix 


Consider  an  arbitrary  non-depolarizing  system.  Its  Mueller  matrix  and  correspond¬ 
ing  Jones  Matrix  are  denoted  by  M  =  [  M  „  ]  and  J  =  [  J  mn  ] .  It  is  known  that 


l^lll  J  (  ^  'I  +  A/f  1 2  +  M  2 1  M.  22  )  —  0  ("I) 

!-A2|2  =  s  (  Af  , ,  -Ml2+M2l  -M22)>0  (2) 

l^2i!2  =  5 ( A-f , ,  -  M12-M21  -M22)>  0  (3) 

!  J  22  i  ~  2  (  M  l  I  ~  M  l2  ~  M  21  +  M  22)  —  0  .  (4) 


The  product  of  physically  realizable  Mueller  matrices  is  a  physically  realizable  Muel¬ 
ler  matrix.  Moreover,  the  product  of  non-depolarizing  Mueller  matrices  is  a  non¬ 
depolarizing  Mueller  matrix.  The  Mueller  and  Jones  matrices  of  a  left  handed  QWCR  are, 
respectively, 

1  0  0  O' 

0  0  10 

0  -  1  0  0 
0  0  0  1_ 
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So  the  following  Mueller  matrix  m  '  are  non-depolarizing  and  its  oones  matrix  is  denoted 
by  J  : 


\l  =  M  c  \l  \1  [  = 


■W  „ 

M,3 

•M  31 

Af  33 

“  Af  2  l 

~  M  23 

M41 

M43 

-  A4  12 

M,4 

-  AJ  32 

At  34 

A4  22 

-Af24 

-M42 

M  44 

J '  =  J  c  J  j ; 


I  I  +  J  12  +  ^21+J  22 

-^11  ~  J  \2  +  J  21  +  J  22 


"  J  1  1  +  J  1 2  J  21  +  J  22 
J  J  \  2~  J  2\  +  J  22 


And  then  we  have 

4  I  ^  I  I  +  J  1 2  +  2  I  +-Jr22|2  =  5(A4n  +  M13+M31-,'M33)>0  (5) 

>1  +  ^  12"  ^2,  +  -/22|2  =  5(M  -  M  ,3+  M3I  -  M33)  >  0(6) 

-|-.yM  -  j  12  +  J  21  +  '-^22|2  “  ^  (  M  ]  i  +  M  1 3  —  AJ  3 1  —  M  33  )  —  0(7) 

;|Jn-7l2- J21  + J22|2  =  i(Mn-M,3-M3l  +  M33)>0.(8) 

The  Mueller  and  Jones  matrices  of  a  QWLR  at  135°  are 


VI,- 


I 

0 

0 


0 


0 

0 

0 

-  I 


0  0 
0  l 
l  0 
0  0 
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The  following  Mueller  matrix  iVl ' '  are  non-depolarizing  and  its  Jones  matrix  is  denoted  by 

J" : 


'  M  1 1 

,vi=  1V1  m  r  =  M  41 

Ai  3i 
M  21 


M14 

M  13 

-M,2 

M  44 

M  43 

~  M  42 

M34 

M  33 

“  M  32 

-m24 

-  M  23 

M  22 

J  .  .  _  J  j  j  -  I  _  1  v/  ,  |  +  l  J  \  2  i-J  21  J  22  1  1  +  J  12  +  J  2\  iJ  22 

2  _  ~  I  I  ■*’  J  12  +  J  21  +  iJ  22  I  1  “  ^  12  +  +  ^22 

And  then  we  have 

4  I  I  I  \2  ~  2\  ^  22  I  2  =  2  (  M  1  |  "*■  A4  1 4  +  M  4 1  M  44  )  —  0(9) 

Jl^ll  +  ^.2  +  ^2.  -^22|2  =  i(Mn  -  M,4^M4I  -M44)>0(10) 

Jl-^u  +  > 7.2  +  ./2l  +  ^22i2  =  5(Mu  +  M14- M41-M44)  >0(11) 

-  \  J  \  \  ~  1  J  \  2  +  lj  2\  +  J  2?.  \  =  2  (  ^  1  I  —  ^  I  4  _  ^  4 1  +  ^  44  )  —  0  . ( 1  2) 

The  Eqs.  (5-8)  or  (9-12)  can  be  regarded  as  a  generalization  of  Eqs.  (1-4). 
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On  the  non-depolarizing  Mueller  matrices 


In  this  note,  a  statement  in  the  book  by  Azzam  and  Bashara  will  be  proved  to  be 
incorrect  and,  therefore,  clarify  the  concept  of  non-depolarizing  Mueller  matrix. 

The  following  words  are  copied  form  the  book  by  Azzam  and  Bashara: 

Therefore,  for  a  non-depolarizing  optical  system,  the  degree  of 
polarization  l$1.8]  of  the  output  light  from  the  system  9a  is  either  greater 
than  or  equal  to  the  degree  of  polarization  of  the  input  light  d*,.  or. 


90  s*  9  „ 


(2.222a) 


for  all  incident  states  of  (total  or  partial)  polarization.  The  case  of 
depolarizing  optical  systems  that  include  incoherent  scattering  processes 
and  for  which 


9,. 

at  least  for  one  incident  state  will  be  studied  in  $2.12. 


(2.222b) 


But  some  non-depolarizing  systems  indeed  can  have  'depolarization  effect'  for  some 
partially  polarized  states.  For  example,  consider  a  non-depolarizing  system,  in  fact  a  lin¬ 
ear  diattenuator,  with  a  Jones  matrix 


When  the  incident  light  is  polarized  light  with  a  coherency  matrix 


0  2  ' 


the  coherency  matrix  of  the  output  light  is  then 
=  JC,J* 
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'  1 

0 

r 1  °" 

'  1  0 

_0 

1  /  •.  2  _ 

j_0  2 

_0  1  A/2_ 

1  0 

~  _0  1 . ' 

In  this  example,  the  output  light  can  be  unpolarized,  even  though  the  incident  light  is  not. 
Therefore,  P0>  Pt  is  not  always  true  for  non-depolarizing  system.  This  example  indi¬ 
cates  that  a  non-depolarizing  system  may  decrease  the  DOP  for  partially  polarized  inci¬ 
dent  light.  The  statement  quoted  above  is  not  true. 

The  definition  of  non-depolarizing  Mueller  matrix  is  that 

(a)  this  Mueller  matrix  can  be  derived  from  a  Jones  matrix,  or 

(b)  completely  polarized  light  produces  only  completely  polarized  light. 

Definition  (b)  is  a  simple  but  powerful  statement.  From  it  all  the  properties  of  the  non¬ 
depolarizing  Mueller  matrix  can  be  derived.  On  the  other  hand,  the  definition  of  depolar¬ 
izing  Mueller  matrix  is  that  completely  polarized  light  may  produce  partially  polarized 
light.  It  should  be  noted  that  a  non-depolarizing  Mueller  matrix  does  not  decrease  the 
DOP  for  completely  polarized  light,  but  a  depolarizing  Mueller  matrix  does  not  increase 
the  DOP  for  completely  polarized  light.  In  other  words,  Azzam  and  Bashara’s  Eqs. 
(2.222a)  and  (2.222b)  is  true  only  for  completely  polarized  incident  light,  not  for  all  inci¬ 
dent  states  of  (total  or  partial)  polarization. 
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Statement  of  Work 


Mueller  matrices  are  now  being  routinely  measured. 

Their  interpretation  in  terms  of  the  associated  polariza¬ 
tion  phenomena  are  not  yet  clear. 

Also,  how  do  we  interpret  inhomogeneous  polarization 
elements? 
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Outline  of  Research  Program 


1 .  Analyze  inhomogeneous  polarization  elements 

within  the  Jones  calculus 

2.  Analyze  the  exponential  form  of  Jones  matrices 

3.  Establish  and  publish  data  reduction  algorithms  for 

Jones  matrices 

4.  Map  these  results  into  the  non-depolarizing 

subspace  of  the  Mueller  matrices 

5.  Establish  algorithms  for  data  reduction  of 

non-depolarizing  Mueller  matrices 

6.  Establish  and  publish  data  reduction  algorithms  for 

Mueller  calculus 
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Reasons  for  Interest  in  Inhomogeneous 
Polarization  Elements 


1.  Combinations  of  polarization  elements  are  usually 

inhomogeneous 

2.  Skew  rays  through  optical  systems  are  slightly  inho 

mogeneous. 

3.  It  is  necessary  to  separate  the  inhomogeneous  part 

of  the  Mueller  matrix  from  the  depolarizing  part  in 
data  reduction 
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Properties  of  Inhomogeneous  Polarization  Elements 


1.  Established  a  measure  of  inhomogeneity 

r)  =  |  E*Er|  ,  0  <  rj  <  1 . 

E  q  and  Er  are  eigenpolarization. 

2.  n  relates  to  the  nonorthogonality  of  eigenpolariza- 

tions. 

3.  Tmax  and  Tmin  are  associated  with  orthogonal 

polarization  states 

4.  Polarization  behavior  is  well  characterized  by  eigen 

values  and  t). 

5.  Definitions  of  diattenuation  and  retardance  for  inho¬ 

mogeneous  polarization  elements 
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Homogeneous  and  Inhomogeneous 
Polarization  Elements 


Eigenpolarizations 


JE„  = 


E 

q 


JE  r 


Homogeneous  polarization  elements 
Orthogonal  eigenpolarizations 

E;£r  =  0 


Diattenuation:  D  = 


i?o|2H?r|2 


,  0<V< 1 


Retardance: ^=|6g-6r|  ,  o  <  <  n 


Note: 


?r|2  are  Tmax,T 


mm 


F  F  are  F  F 

g  >  r  Ul  v  ■-max  J  Lmin 


Inhomogeneous  polarization  elements 
Nonorthogonal  eigenpolarizations 

£;£r *  o 

What  are  diattenuation  and  retardance? 
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0  <  rj  <  1 

n  =  Ofor  homogeneous  polarization  elements 
rj  =  inhomogeneity 

j]  ®  cos  \  or  x  ®  2  cos  “ 1  r|  ,  x  is  the  angle  between 
eigenpolarlzations  on  the  Poincare  sphere 


Poincare  sphere 


Polar  Decomposition  of  Jones  Matrices 

Any  polarization  element  is  equivalent  to  a  cascade  of 
homogeneous  diattenuator  and  homogeneous 
retarder. 


J  D  and  J  D  are  uniquely  defined 

In  general,  J  D  #  J  D ' ,  but  they  have  same  eigenvalues 

J  R  is  unique  only  when  DqL  J  #  0 

r) 
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Eigenvalues  of  J  D  (or  J  D  *)  are  V'T  max  >  V  T  min 


E max  Emm  =  0 

(JEmax)"(JEmin)  =  0 


T>  = 


T  -T 

max  min 

T  ♦  T  ~~ 

max  mm 


0  is  equal  to  the  OOP  of  the  output  light  when  incident 
light  is  unpolarized,  i.e., 

D  =  DOP(U) 

L:  —  Emax  ©  Emin 
JU  =  JEmax  ©  JEmi„ 


If  DotJ  =  0, 

©  -  1 ,  %  =  0 

polarizer-  output  light  is  always  at  a  fixed  polariza 
tion  state 
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Poincare  Sphere:  Homogeneous 


Poincare  Sphere:  Inhomogeneous 


Inhomogeneous  Polarization  Elements 


Tmax  and  Tmin  are  extreme  values  of  T 

TmaxTmin  =  |?J2|?r|2  =  |DetJ|2 


T  4  T 
1  max  1 


im2*IM2-n2(M>s;?r) 


Thus  V  and  are  given  by 


©  =  {  l  - 


4(l-n2)2|Sj2|6r|2 
[|e,|2M5r|2-n2(5,?;*?;ir)]' 


%  =  2  cos  1 


(iV)(|*,H*r>|2 


COS 
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Degenerate  polarization  elements 


Only  one  eigenpolarization  E  q 

Example:lP(0°)//l/Li?(45°)  =  g  q  *i  o 


o  l 
o  o 


Redefine  |(J-i,ql)F|=Ti  where  F  is  normal  to  the 
eigenpolarization  Er 


v  = 


2  2*n2 


2  +  r\2 


^  =  2cos 


There  exists  a  unitary  matrix  Li  such  that 

. —  r?,  nl 


uju  = 


o  t 
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An  Unitarily  Equivalent  Theorem  and 
Classification  of  Polarization  Elements 


Any  two  Jones  matrices  with  the  same  eigenvalues  and 
t)  are  unitarily  equivalent,  i.e.,  there  exist  an  unitary 
matrix  U  such  that 

U  J  !  IT  =  J2. 

.  ?r ;  ti)  can  be  used  to  denote  the  set  of  unitarily 

equivalent  polarization  elements  with  the  same 
eigenvalues  and  r). 
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Exponential  Representation  of  Jones  Matrices 


Examples  of  decomposition 

0) 

J-  (aoOo  +  a^  ,  +  a  2a2  +  a  3c3)  exp  *(6000  +  6,0,  +  62cf2  +  63a3) 

(2)  J  =  c0ao  +  c1a1+C2<J2  +  c3a3 

(3)  J  =  exp  i(d0a0  +  dta,  +d2a2  +  d3a3) 


What  is  the  best  form  for  interpreting  the  properties 
(i.e.,  diattenuation,  retardance,  inhomogenity)  of 
polarization  elements? 

(1)  has  its  use 

(2)  is  better  for  routine  manipulation 

(3)  is  better  to  address  deeper  questions  of  the 
fundamental  properties  and  the  structure  of  the 
calculus. 
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An  identity 

(  J  1  J2  \N 

lim^exp-exp-J 

=  hm(exp-exp-j 
=  exp(  J !  +  J2) 


Generalization 


lim(  i 

N  —too  \ 


do  ao 


d,0. 


d2a2 


d. 

exp- 


exp  — exp  — exp  v 
=  exp(d0a0  +  dies  x  +  d2o 2  +  d3a3) 


Similarly 

3?d0o0  +  Xd  ,a  ,  +  Xd2a2  +  Sd3fl3  3d0o0 

limexp - - - expi 

N  —too  'V 

=  exp(d0o0  +  d  |  a  i  +d2a2  +  d3o3) 

3?d0  =  real  part  of  d0 

3  d  o  =  imaginary  part  of  d  0 

etc. 
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Canonical  Forms 


Total  amplitude  and  phase 

exp (d0o0) 

Linear  polarization  element  at  o  ° 

exp(d1o1) 

A<j>  =  23d, ,  D  =  tanh23?d , 

Linear  polarization  element  at  45  ° 

exp(d2o2) 

A<j)  =  23  d2 ,  D  =  tanh2!Rd2 

Circular  polarization  element 

exp(d3o3) 

A<()  =  23  d3,D  =  tanh23Sd3 
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1 


08-lfr 


COMPARISON  OF  THE  JONES  AND  MUELLER  CALCULUS 


Jones 


Type 

Amplitude 

Polarization  State 

Jones  Vector 

2  complex  elements 
4  parameters 

Describes  Absolute  Phase 

Yes 

Describes  Partial 

Polarized  Light 

No 

Polarization  Elements 

Jones  Matrix 

2x2  complex 
elements 

8  parameters 

Describes  Phase 

Change 

Yes 

Describes 

Depolarization 

No 

Ease  of  Use 

Simpler 

Strengths 

Theory 

Applications 

Experimental 

Interferometry 

Theoretical 

Interferometry 

Diffraction  Theory 
Aberration  Theory  , 

Polarimetry 

Mueller 

Intensity 


Stokes  Vector 
4  real  elements 
4  parameters 

No 


Yes 


Mueller  Matrix 
4x4  real  elements 
16  parameters 


No 


Yes 


Involved 

Experimental 


Polarimetry 

Scattering 

Polarimetry 
Scattenng  Theory 
Radiative  Transfer 
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Depolarization 


The  coupling  of  polarized  light  into  unpolarized  light. 

Diattenuation  and  retardance  which  vary  rapidly  in  space  and/or  time. 
Intrinsically  associated  with  scattering. 

Milk  Ground  Glass 

Clouds  Optical  Fibers 

Rough  Surfaces 

Ideal  Depolarizer 

/ 10  0  0\/s«\  /s„\ 

0  0  0  0  If  s,  W  0  ] 

0  0  0  0  II  sa  rl  0  I 

Vo  o  o  o/Vw  V°/ 


Partial  Depolarizer 


Incident  Polarized  light  is  reduced  to  a  degree  of  polarization,  cl 

0  0  Q\f  fs0\  /so’ 


0  d  0  0 

0  0  CL  0 

\0  0  0  d. 


Cy 


-d-d) 
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Reasons  for  interest  in  depolarization 


1.  Loss  of  coherence  is  an  irrecoverable  loss  of 

information  in  optical  processors  and  optical 
computers. 

2.  Depolarization  conveys  information  on  the  statistics 

of  optical  media  and  interfaces. 

3.  Depolarization  may  be  a  useful  tool  for  investigating 

thin  films  microstructure. 

4.  Depolarization  may  limit  performeance  of  polarizers 

and  organic  thin  films. 

5.  Depolarization  of  laser  radar  targets  is  an  area  in 

need  of  development. 
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How  many  types  of  depolarization  are  there? 


7  degrees  of  freedom  in  non-depolarizing  Mueller 
matrix  (8  elements  in  Jones  matix  minus  absolute 
phase) 

16  degrees  of  freedom  in  Mueller  matrix 

9  potential  degrees  of  freedom  for  depolarization 

Do  the  inequalities  between  elements  of  the  Mueller 
matrix  reduce  the  number  of  degrees  of  freedom? 
Probably  not. 
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A  Brief  Analysis  of  Depolarizing  Mueller  Matrices 


For  maximum  transmittance : 


41-85 


Define 


Depolarizing  index  =  ~J(  1  -  DOP)da 

*  H  . 

i  ■■■.aft 

nondepolarizing  elements:  depolarizing  index =0 
ideal  depolarizer:  depolarizing  index =1 
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Methods  of  decomposing 
depolarizing  Mueller  matrix 


N  non-depolarizing  Mueller  matrix 
D  "pure  depolarizing  Mueller  matrix" 

a.  M  =  ND  =  DN 

b.  M  =  N  +  D 

Ca  M  =  Ct  |  N  j  CL  2  N  2  ct  3  N  3  ^  ct  4  N  4 
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How  accurate  will  a  polarlmeter  need  to  be  to  measure 
a  specified  amount  of  depolarization? 

Which  types  of  samples  will  be  most  suitable  for  initial 
experimental  studies  of  depolarization? 

How  is  depolarization  related  to  scatering  and  BRDF? 

How  will  the  depolarization  vary  with  the  solid  angle  of 
light  collected  about  the  specular  beam? 
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Abstract 

Successful  deployment  of  a  space-based,  exoatmospheric  interceptor  will  require  an  accu¬ 
rate  and  reliable  guidance,  navigation,  and  control  (GNC)  system  for  the  interceptor.  During 
the  summer  of  1990  the  proposer  began  to  investigate  the  application  of  artificial  neural  net¬ 
works  to  the  interceptor  control  problem  under  the  UES/AFOSR  Summer  Faculty  Research 
Program.  Following  the  summer  research  period,  AFOSR  provided  funding  to  continue  this 
work  under  the  Research  Initiation  Minigrant  Program.  The  results  of  the  research  funded 
through  the  Minigrant  are  described.  First,  the  context  of  the  research  grant  is  given,  in¬ 
cluding  a  statement  of  the  research  objectives  and  an  overview  of  the  results  of  the  research. 
Next,  research  results  in  five  specific  areas  are  discussed:  (1)  an  approach  to  model  reference 
adaptive  control  for  nonlinear  systems  with  characteristics  similar  to  those  of  the  intercep¬ 
tor;  (2)  development  of  a  control  scheme  based  on  reinforcement  learning;  (3)  a  method  for 
solving  optimal  control  problems  using  neural  nets;  (4)  an  iterative  learning  control  scheme 
using  artificial  neural  networks;  and  (5)  an  approach  to  the  analysis  and  design  of  learning  in 
recursive  neural  nets  using  the  theory  of  singular  perturbations  and  time  scales.  The  report 
ends  with  a  description  of  future  research  topics. 
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I.  Introduction 


Successful  deployment  of  a  space-based,  exoatmospheric  guided  interceptor  is  an  essential 
component  in  the  development  of  strategic  space-based  weapon  systems.  Such  an  interceptor 
will  require  an  accurate  and  reliable  guidance,  navigation,  and  control  (GNC)  system.  The 
Guided  Interceptor  Technology  Branch  (SAI)  of  the  Air  Force  Armament  Lab  (AFATL) 
at  Eglin  AFB  is  concerned  with  the  development  of  a  space-based  exoatmospheric  guided 
interceptor.  Within  SAI  the  GNC  Section  is  responsible  for  evaluating  advanced  GNC 
technologies  for  the  interceptor.  A  variety  of  approaches  to  interceptor  control  are  currently 
being  studied,  including  both  modern  and  classical  methods.  A  new  technology  that  has 
been  applied  to  control  problems  in  other  fields  is  artificial  neural  networks  (ANNs).  The 
GNC  Section  is  interested  in  determining  the  potential  use  of  this  new  technology  for  the 
interceptor  problem  and  in  comparing  its  effectiveness  to  that  of  the  other  control  schemes 
under  consideration.  In  this  report  we  consider  both  basic  and  applied  research  related  to 
the  control  of  dynamical  systems  using  ANNs,  motivated  by  the  interceptor  control  problem. 

The  research  reported  here  is  an  outgrowth  of  work  completed  by  the  author  as  a 
UES/AFOSR  Summer  Faculty  Fellow  at  AFATL/SAI  during  the  summer  of  1990.  Dur¬ 
ing  the  summer  research  period  we  began  to  investigate  the  application  of  neural  nets  to 
the  interceptor  control  problem.  The  main  results  of  the  summer  research  period  were  de¬ 
velopment  of  a  description  of  the  interceptor  control  problem  and  a  preliminary  analysis  of 
the  feasibility  of  using  a  neural  net  to  control  the  interceptor  during  the  terminal  phase  of 
flight.  This  included  development  of  the  system  configuration,  derivation  of  a  math  model 
of  the  interceptor,  and  an  analysis  of  current  approaches  to  the  GNC  problem.  Various  ways 
to  introduce  neural  nets  into  the  problem  were  suggested  and  it  was  shown  that  a  feedfor¬ 
ward  neural  network  can  be  used  to  identify  the  forward  dynamics  of  the  interceptor.  A 
control  scheme  based  on  a  neural  net  learning  paradigm  called  reinforcement  learning  was 
also  proposed.  A  more  complete  description  of  these  results  can  be  found  in  the  final  report 
[1]. 

A.  Research  Objectives 


Based  on  the  results  of  the  summer  research  period,  the  following  objectives  were  estab¬ 
lished  to  continue  the  research  on  ANNs  for  the  interceptor  problem.  These  objectives  were 
taken  directly  from  the  Minigrant  proposal,  hence  the  future  tense  in  their  wording. 

1.  Model  reference  adaptive  control  of  nonlinear,  multi-input,  multi-output  systems  using 
neural  networks  for  both  identification  and  control  will  be  studied.  This  will  include 
additional  development  of  identification  of  the  system’s  forward  dynamics. 
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2.  Our  proposed  multilayered,  real-valued  output  reinforcement  learning  scheme  will  be 
developed  and  refined,  including  theoretical  analysis  and  practical  application  to  single- 
input,  single-output  nonlinear  systems.  This  will  include  work  on  developing  the  ability 
to  learn  in  response  to  time-varying  reinforcement  signals. 

3.  The  controllers  developed  in  (1)  and  (2)  will  be  applied  to  the  interceptor  using  the 
GESIM  and  the  resulting  performance  and  robustness  properties  will  be  evaluated  and 
compared  to  that  of  alternate  control  schemes. 

4.  Basic  development  of  neural  network  implementations  of  both  optimal  control  and 
Kalman  filtering  algorithms  will  be  conducted. 


B.  Research  Results 

In  the  sequel  we  describe  the  progress  made  toward  achieving  the  objectives  stated  above. 

The  description  is  divided  into  the  following  five  parts: 

1.  Adaptive  Control  Strategies  for  GNC:  An  approach  to  adaptive  control  for  nonlinear 
systems  with  models  similar  to  those  of  the  interceptor  is  presented.  The  method  is  a 
derivative  of  the  computed-torque  method  found  in  robotic  manipulator  control. 

2.  Reinforcement  Learning  with  Neural  Nets:  The  reinforcement  learning  scheme  pro¬ 
posed  during  the  summer  of  1991  has  been  developed.  It  is  demonstrated  for  a  single¬ 
input,  single-output  example. 

3.  Optimal  Control  Using  Neural  Nets:  A  neural  network  approach  to  solving  linear  quadratic 
regulator  problems  is  given. 

4.  Iterative  Learning  Control  Using  Neural  Nets:  A  method  for  using  neural  networks  to 
devise  a  dynamical  iterative  learning  controller  is  presented. 

5.  SPATS  for  Learning  in  Neural  Nets:  We  discuss  some  preliminary  results  which  inter¬ 
pret  recursive  Hopfield  neural  nets  from  the  perspective  of  singular  perturbations  and 
time  scales. 

Items  (1)  through  (3)  correspond  to  items  (1),(2),  and  (4)  of  the  objectives,  respectively. 

For  these  items  we  have  been  generally  successful  in  meeting  the  stated  objectives,  although 
simulations  of  the  techniques  we  have  developed  are  not  complete  for  items  (1)  and  (3). 
However,  we  were  not  able  to  complete  item  (3)  of  the  objectives:  the  application  of  our 
control  schemes  to  the  interceptor  using  GESIM  (the  software  at  Eglin  AFB  that  is  used 
to  simulate  the  interceptor).  The  methods  were  not  sufficiently  developed  soon  enough  to 
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incorporate  them  into  the  software.  This  will  require  additional  research  and  development 
effort. 

Also  note  that  items  (4)  and  (5)  of  the  results  were  not  specifically  tasked  in  the  stated 
objectives.  However,  research  as  a  creative  activity  is  a  fluid  affair,  acting  to  fill  the  space 
around  it  and  often  following  many  paths.  This  is  distinct  from  development,  which  often 
has  product-oriented  goals.  In  a  broad  sense  we  view  our  Minigrant  as  supporting  the  general 
problem  of  control  of  nonlinear  systems  using  neural  nets  (which  encompasses  the  interceptor 
control  problem).  Items  (4)  and  (5)  deal  with  aspects  of  this  general  problem  and  were 
motivated  by  questions  raised  by  our  Minigrant  activity.  Because  they  were  completed  during 
the  time  of  the  Minigrant  funding,  we  see  these  items  as  indirectly  part  of  the  Minigrant 
and  feel  they  should  be  included.  Although  it  is  unlikely  that  item  (4)  will  ever  be  applied 
to  the  interceptor  problem,  it  will  be  useful  in  some  types  of  robotics  problems,  and  as  such, 
it  may  be  of  interest  in  some  AFOSR  programs.  However,  in  the  long  term,  item  (5)  may  in 
fact  prove  useful  to  the  interceptor  problem. 

In  the  remainder  of  the  report  we  separately  describe  each  of  the  results  stated  above. 
For  the  most  part,  each  section  is  self-contained  and  generic.  That  is,  if  a  given  approach 
applies  to  nonlinear  control  in  general,  then  it  is  presented  in  that  way.  Much  of  this  work 
has  been  reported  as  it  was  developed.  The  work  on  adaptive  strategies  for  GNC  has  not 
yet  been  published.  The  work  on  reinforcement  learning  was  presented  in  [2],  The  work 
on  optimal  control  has  been  reported  at  a  number  of  meetings,  notable  [3]  and  [4].  The 
iterative  learning  control  technique  has  been  submitted  to  a  conference  [5].  The  work  on 
the  application  of  singular  perturbations  and  time  scales  is  presented  in  [6].  Finally,  a  result 
on  system  identification  that  we  refer  to  in  the  final  section  is  given  in  [7].  This  work  was 
initiated  by  a  student  prior  to  the  Minigrant  funding  and  hence  it  is  not  included  in  this 
final  report. 


II.  Adaptive  Control  Strategies  for  GNC 

In  this  section  we  present  some  preliminary  ideas  about  using  artificial  neural  nets  (ANNs) 
as  elements  in  adaptive  control  systems  for  guidance,  navigation  and  control  (GNC)  appli¬ 
cations.  We  first  describe  how  the  nonlinear  mapping  property  of  an  ANN  can  be  used 
to  produce  dynamic  behavior.  We  then  consider  model  reference  adaptive  control  in  gen¬ 
eral.  Next  we  describe  the  computed-torque  method  used  for  adaptive  control  of  robotic 
manipulators.  Finally,  we  show  how  a  similar  scheme  can  be  developed  for  typical  GNC 
models. 

A.  Nonlinear  Mappings  from  Neural  Nets 
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As  noted  in  [1],  one  of  the  most  appropriate  uses  of  a  neural  net  for  control  applications  is 
as  a  nonlinear  mapping.  This  is  motivated  by  Kolmogorov’s  result  from  the  late  1950’s  which 
states  that  neural  net  structures  can  be  used  to  implement  arbitrary  functions  (although 
there  is  no  indication  how  these  functions  can  be  learned,  i.e.,  it  is  an  existence  result,  and 
of  course,  certain  restrictions  apply).  The  reason  this  is  a  good  approach  to  neural  nets 
for  control  is  that  most  control  systems  are  designed  for  dynamical  systems  which  can  be 
described  by  equations  such  as 

Vk+\  =  f{yk,yk-i,yk-2,uk,uk-i) 

In  this  discrete-time  system  we  see  that  the  next  output  is  a  function  of  the  current  input, 
the  current  output,  the  most  recent  past  input,  and  the  two  most  recent  past  outputs.  Thus 
we  could  imagine  the  configuration  shown  in  Figure  1,  where  the  operator  z~l  denotes  a  unit 
time  delay  and  the  neural  net  implements  the  nonlinear  mapping  /  [8].  This  leads  naturally 
to  several  approaches  to  control.  Below  we  describe  one  of  those  approaches:  model  reference 
adaptive  control. 


Figure  1:  Neural  net  implementation  of  discrete-time  system. 


B.  Model  Reference  Adaptive  Control 

Recently  Narendra  and  Parthasarathy  have  presented  an  approach  for  the  use  of  neural 
networks  within  the  framework  of  conventional  model  reference  adaptive  control  (MRAC) 
[8].  Consider,  for  instance,  a  nonlinear  system  (example  taken  from  Narendra’s  paper)  of 
the  form 

yk+ 1  =  f{yk,Vk- 1)  +  Ufc 


/(*> y) 


xy(x  +  2.5) 

1  +  x2  +  y2 
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where 


The  goal  of  the  control  is  to  match  the  dynamics  of  a  reference  model  given  by 

J/Ar+1  =  -6y*  +  -2yjfc-i  +  rk 
where  vk  is  the  reference  signal.  Suppose  the  control  law  is 

uk  =  -N(yk,yk. ,)  +  .6  yk  +  .2yk.x  +  rk 

where  N(x,y)  is  a  feedforward  neural  network  which  is  identified  using  backpropagation 
training.  It  can  be  seen  that  if  the  net  successfully  learns  the  system  dynamics  then  the 
output  behavior  will  approach  the  desired  behavior  because  the  closed-loop  system  is  now 
given  by 

Vk+i  =  /(y*,y*-i)  -  N(yk,yk.i)  +  .6 yk  +  .2y*_i  -f  r(k). 

Narendra  has  considered  various  examples  of  this  type,  showing  how  neural  nets  can 
be  used  to  identify  nonlinear  dynamics,  with  the  results  of  the  identification  then  used  as 
part  of  a  control  law.  Although  most  of  this  work  is  demonstrated  for  single-input,  single¬ 
output  systems,  it  gives  a  very  good  foundation  for  the  use  of  neural  nets  in  control.  In 
particular,  notice  that  this  approach  allows  us  to  account  for  the  temporal  characteristics  of 
the  system  using  a  feedforward  neural  network,  which  is  typically  only  used  for  identifying 
static  patterns. 

C.  Adaptive  Control  of  Manipulators 


We  are  interested  in  how  a  model  reference  adaptive  control  scheme  using  neural  net¬ 
works  can  be  applied  to  GNC  problems.  One  feature  of  Narendra’s  approach  is  that  it  uses 
knowledge  of  the  system’s  structure  to  configure  the  neural  net  control  system.  In  order 
to  consider  model  reference  control  for  the  interceptor,  we  must  first  consider  the  structure 
of  the  interceptor’s  equations  of  motion.  When  we  do  this  (see  below),  we  notice  that  the 
dynamical  equations  of  the  interceptor  are  similar  in  structure  to  those  that  arise  in  robotic 
manipulator  control.  Thus  it  is  reasonable  to  consider  the  adaptive  control  schemes  that 
have  been  applied  in  robotics. 

The  equations  of  motion  for  an  n-jointed  manipulator  can  be  written  as 

=  02 

02  =  T(0j,02)  +  fl(0i,02)u 

where  0x(t)  is  the  vector  of  joint  angles;  02(f)  is  the  vector  of  joint  angular  velocities;  T(0i,02) 
and  Z?(0i,02)  are  matrices  whose  elements  are  nonlinear  functions  of  6X  and  02;  and  u  is  the 
torque  vector  applied  to  the  joint  actuators.  It  is  assumed  that  B  has  full  rank.  By  making 
the  substitution  02  =  0i  we  obtain 

0i  =  T(0l,01)  +  #(0:,0i)u 
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Often  this  equation  is  further  reduced  to  the  form 


A/(W  +  m,<M  +  G(0i)  =  u 

where  M{6i)  is  identified  as  the  inertial  mass  matrix;  V(9i,  #i)  represents  centrifugal  and 
coriolis  forces;  and  G{9\)  represents  the  effect  of  gravitational  forces  [9]. 

A  popular  approach  for  adaptive  control  of  an  n-jointed  manipulator  with  the  model 
described  above  is  the  so-called  computed-torque  method.  This  scheme  assumes  a  control 
law  given  by 

u  =  M{0x){9d  +  Kvk  +  Kpe)  +  V(0i,0i)  +  G{9 ,) 

In  this  equation  Oj  is  the  desired  joint  position  trajectory,  e  =  9j  —  9 1  is  the  measured  joint 
error,  and  Kv  and  Kp  are  diagonal  gain  matrices.  In  this  case,  if  an  appropriate  identification 
scheme  is  developed  so  that  M  — *  M,  V  — ►  V,  and  G  —>  G,  then  the  closed  loop  system  is 
described  by 

e  -I-  Kve  +  Kpe  =  0. 

Thus,  it  is  possible  to  drive  the  joint  error  to  zero  if  the  identification  proceeds  properly. 
This  scheme  has  been  shown  to  be  successful  both  theoretically  and  experimentally. 

One  idea  for  using  ANNs  to  control  manipulators  is  to  use  neural  nets  to  identify  the 
matrices  A/,  V,  and  G.  The  nets  would  then  be  used  in  the  computed-torque  law  to  compute 
the  required  input  torque  needed  to  achieve  a  desired  motion.  Below  we  suggest  a  similar 
approach  for  the  interceptor. 

D.  Adaptive  Control  of  GNC  Models 


The  equations  of  motion  that  arise  in  many  GNC  problems  are  similar  to  those  describing 
an  n-jointed  manipulator.  However,  due  to  the  presence  of  both  inertial  reference  frames  and 
body-fixed  frames,  there  are  some  differences.  Specifically,  usually  92  ^  Q\.  For  instance,  in 
[1]  it  is  noted  that  the  interceptor  model  has  the  general  form 

Ox  =  TxMO, 

92  —  T2(02)92  -f-  Bu 


For  systems  modelled  in  this  way  we  can  still  proceed,  if  it  is  possible  to  assume  that  (i) 
Tfl(9x)  exists  and  (ii)  the  partial  derivative  of  7\  with  respect  to  time  is  well-defined.  In 
this  case  we  proceed  as  follows.  Given  the  system  above,  then  ^2  =  Tj_l(#i  )0\  and 


Ox 


^(«i )«» +  r,  ((,  >r, («,)»,  +  r,  («,  )B  u 
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T-l(9l)9l  +  Tx(0l)Bu 


We  can  then  write 


where  we  can  identify 


M(9l)91  +  V(9U9X)  =  « 


M(9X)  =  [Ti(6i)B] 
V(du6x)  =  [Ti(9i)B] 


dTx 

dt 


(9x)  +  Tx(9x)T2(Tx-l(9x)9x) 


TC'(» i)#i 


This  is  now  in  essentially  the  same  form  as  the  system  equations  of  the  robotic  ma¬ 
nipulator.  Thus  we  may  consider  the  following  variation  of  the  computed  torque  method. 
Let 

u  =  M(9x){9d  +  Kve  +  Kpe)  +  V(9U9X)9X. 

Again  we  have  defined  9d  as  the  desired  position  trajectory,  e  =  9d  —  9X  as  the  measured 
error,  and  Kv  and  Kp  as  diagonal  gain  matrices.  In  this  case,  if  an  appropriate  identification 
scheme  is  developed  so  that  M  — »  M  and  V  — ►  V  then  the  closed  loop  system  will  described 
by 

e  +  Kve  +  Kpe  =  0. 

In  order  for  such  a  scheme  to  work  a  number  of  issues  must  be  resolved.  Most  importantly, 
we  must  be  sure  of  an  identification  scheme  that  will  converge.  Based  on  our  preliminary 
results  with  a  six  degree-of-freedom  model  [1],  we  believe  that  a  neural  network  can  do 
this  job.  Other  issues  involve  the  existence  of  the  inverses  used  in  the  development  above. 
Current  work  is  concentrating  on  establishing  that  the  assumptions  are  in  fact  satisfied  for 
realistic  problems,  such  as  the  interceptor.  We  are  also  conducting  simulations  of  the  method 
for  a  simple  two-dimensional  system  given  by 


9X  =  (cos  -f-  sin  92)92 
92  =  "b  u. 


III.  Reinforcement  Learning  with  Neural  Nets 

Reinforcement  learning  in  neural  nets  is  an  approach  to  the  problem  of  credit  assignment 
during  learning.  As  opposed  to  gradient  descent  techniques  such  as  backpropagation,  a 
reinforcement  learning  scheme  uses  a  single  reinforcement  signal  from  the  environment  to 
adjust  the  network  weights.  In  this  section  we  describe  reinforcement  learning  and  propose 
a  multilayer  neural  network  with  real-valued  outputs  which  learns  using  a  combination  of 
reinforcement  learning  and  backpropagation.  This  method  combines  several  ideas  from  the 
literature.  We  illustrate  the  use  of  the  method  with  an  example  of  the  control  of  a  nonlinear 

system.  This  develops  and  extends  our  previous  work  in  this  area  [1]. 
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A.  Background 

A  key  issue  in  the  development  of  a  neural  network  application  is  the  type  of  learning 
algorithm  that  is  used.  Most  algorithms  currently  being  studied,  such  as  backpropagation, 
are  gradient  descents  in  the  weight  space.  This  is  one  approach  to  what  has  been  called  the 
“credit  assignment”  problem.  That  is,  how  to  reinforce  weights  that  contribute  to  desirable 
behavior  of  the  system.  Another  approach  to  this  credit  assignment  problem  is  to  use 
a  single  reinforcement  signal  from  the  environment  (system)  to  decide  how  to  adjust  the 
weights.  This  is  decidedly  different  from  backpropagation,  which  essentially  attempts  to 
form  an  error  signal  at  the  output  of  each  neuron  in  the  network.  In  reinforcement  learning 
we  have  only  one  error  signal,  which  is  used  by  all  the  neurons. 

To  describe  reinforcement  learning  consider  Figure  2.  Here  we  suppose  the  neural  net 


Reinforcement 


Neural 

Plant 

Net 

Control 

Action 

System 

State 


Figure  2:  Reinforcement  learning. 


generates  control  actions,  based  on  measurements  of  the  current  system  state.  It  then  receives 
a  reinforcement  signal  from  the  environment  which  evaluates  the  effect  of  the  action.  The 
weights  are  then  adjusted  based  upon  the  reinforcement.  In  the  learning  algorithm,  the 
only  information  given  to  individual  neurons,  regardless  of  their  location  in  the  net,  is  the 
reinforcement  signal.  There  is  no  error  that  is  backpropagated  through  the  net.  In  a  typical 
algorithm,  only  those  weights  which  contributed  to  the  action  of  the  net  will  be  adjusted. 
Specifically,  there  is  an  eligibility  function  associated  with  each  weight  which  indicates  how 
recently  the  weight  has  been  changed  or  how  recently  the  neurons  associated  with  a  weight 
have  fired.  Only  weights  that  are  “eligible”  are  updated  during  learning.  Learning  initially 
proceeds  randomly,  but  as  the  net  learns  which  actions  lead  to  desirable  reinforcement  in 
response  to  specific  system  states,  the  weights  begin  to  adjust  in  such  a  way  that  a  bias  is 
built  up  toward  these  actions.  Thus  this  type  of  system  is  in  fact  a  stochastic  automata. 
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Past  work  in  reinforcement  learning  followed  work  in  stochastic  automata  theory  [10]  and 
psychology  [11].  More  recently,  a  research  group  based  around  Barto  and  Sutton  has  been 
very  active.  Initial  results  on  using  reinforcement  learning  control  for  the  pole  balancing 
problem  can  be  found  in  Barto,  Sutton,  and  Anderson  [12].  A  key  feature  of  this  work  was 
that  it  used  an  input  decoder  to  partition  the  state  space  into  “boxes”.  They  used  a  network 
made  up  of  what  they  called  “neuron-like”  elements  which  were  basically  perceptrons  with 
stochastic  outputs.  The  network  applied  control  forces  to  a  pole-cart  system  in  an  attempt 
to  balance  the  pole.  Learning  (adjustment  of  the  weights,  based  on  a  reinforcement  signal 
from  the  system)  took  place  only  when  the  pole  fell  over.  The  system  was  able  to  learn  which 
actions  would  keep  the  pole  balanced.  They  also  introduced  the  idea  of  trying  to  predict  the 
value  of  the  reinforcement  signal.  This  allowed  faster  learning.  One  limitation  in  the  work 
of  Barto,  Sutton,  and  Anderson  was  that  it  allowed  only  binary  outputs.  Franklin  [13]  was 
able  to  extend  their  ideas  to  allow  real-valued  outputs  and  has  applied  this  to  the  control 
of  a  simulated  robot  arm.  Anderson  [14]  has  also  considered  the  use  of  multilayer  networks 
with  reinforcement  learning.  Again,  this  work  uses  binary  outputs,  but  the  input  decoder 
is  replaced  by  a  neural  net  which  learns  to  distinguish  the  system  state  without  quantizing 
it  into  boxes.  Also,  the  learning  algorithms  are  based  on  the  theory  of  temporal  differences 
which  has  been  introduced  by  Sutton  [15]  and  include  a  combination  of  reinforcement  learning 
theory  and  gradient  descent  rules. 

B.  Reinforcement  Learning  Using  Multilayered  Nets  with  Real- Valued  Outputs 

Figure  3  shows  a  proposed  configuration  we  have  developed  for  the  control  of  a  nonlinear 
system.  This  scheme  is  a  combination  of  Franklin’s  ideas  and  Anderson’s  ideas.  The  system 
uses  multilayer  neural  nets  with  real-valued  outputs.  Specifically,  there  are  two  parts  of 
the  control  system:  an  action  part  and  a  predictor.  The  control  output  (u)  is  a  normal 
random  variable  with  mean  equal  to  the  output  of  the  action  net  ( fj. )  and  with  standard 
deviation  given  by  the  predicted  reinforcement  (<x).  The  idea  is  that  if  the  predictor  expects 
the  proposed  action  //  to  result  in  a  large  (bad)  reinforcement  signal  then  the  actual  control 
action  is  computed  as  a  random  variable  centered  at  /i,  but  with  a  large  variance.  Thus  the 
system  is  less  likely  to  take  an  action  close  to  On  the  other  hand,  if  a  is  small  this  means 
the  predictor  expects  a  more  favorable  reinforcement  and  thus  allows  the  control  action  to 
be  closer  to  /i. 

Both  the  action  net  and  the  predictor  net  are  standard  feedforward  nets  with  outputs 
defined  by 

-  =  5Zc(*)«(0  +  JLb(j  Hj) 

»  j 

v(j)  =  /(Ha  U,k)iv(k)) 

k 
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Figure  3:  Proposed  control  scheme. 

where  z  is  the  net’s  output,  the  v(j)'s  are  the  outputs  of  the  hidden  layer,  the  w(i)'s  are  the 
inputs  to  the  net,  and  /(•)  is  a  sigmoidal  nonlinearity.  Note  that  in  our  scheme,  the  action 
net  uses  only  the  system  states  (xk)  as  inputs.  The  predictor  net  receives  as  inputs  the 
system  states  and  either  the  control  action  (u)  or  the  expected  control  action  (/x),  depending 
on  whether  the  net  is  being  used  in  a  learning  mode  or  in  an  operational  mode,  respectively. 

The  weights  of  the  predictor  net  are  continuously  adjusted  using  the  error  between  the 
output  f*+ 1  and  the  actual  reinforcement  r*+ x.  Thus  the  predictor  learns  the  reinforcement 
resulting  from  a  given  action  (u*)  within  a  given  context  ( x *).  Training  of  this  net  is  done 
using  standard  backpropagation  with  an  error  signal  e  =  r  —  r.  This  gives 

6(<)»ew  =  b(i)0id  +  Vbev(i) 

c(i)neu/  =  c(t)old  "b  U>(l) 

a(i,j)new  =  a{i,j)oid  +  T/ae[6(z>(*)(l  -  v(i))M» 

Once  the  weights  of  the  predictor  are  adjusted  on  a  given  trial,  it  is  then  used  in  an  opera¬ 
tional  mode  to  compute  the  variance  <7/t+i  used  for  computing  the  next  control  action.  Note 
that  after  learning  we  have  crjfc+i  =  ffc+i  = 

Learning  in  the  action  layer  is  a  combination  of  conventional  backprop  and  reinforcement 
learning.  The  same  type  of  backpropagation  equations  given  above  for  the  predictor  are  used, 
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except  that  the  error  signal  is  modified  to  be 


e  =  Rv  (u-  n) 


where 


f  1  ...  if 

|  0  ...  ot 


if  <rfc+1  >  rfc+1 
otherwise 


Rv  is  a  flag  which  indicates  a  desirable  system  response  to  the  most  recent  action.  Specifically, 
this  signal  provides  a  mechanism  for  positive  reinforcement.  The  idea  behind  this  algorithm 
is  the  following.  Suppose  that  a  given  control  action  u  results  in  an  actual  reinforcement 
that  is  smaller  than  the  predicted  reinforcement.  Then  we  should  adjust  the  weights  so  that 
for  the  same  set  of  inputs  the  mean  signal  (p)  is  moved  closer  to  the  actual  control  action  u 
that  gave  a  better  reinforcement.  However,  if  the  reinforcement  is  larger  than  expected,  then 
we  make  no  changes,  but  simply  allow  the  stochastic  search  for  a  better  action  to  proceed 
on  the  next  trial. 

C.  Example  and  Comments 


This  scheme  was  applied  to  the  following  example  plant  from  Narendra’s  paper  [8]. 

„  _  ykyk-i{yk  +  2.5) 


J/fc+l  —1,2,2  1  “*• 

i  +  yi  +  yl-i 

In  our  example,  the  goal  was  to  train  the  system  output  to  follow  a  reference  signal.  In 
order  to  do  this  the  configuration  of  Figure  3  was  modified  to  allow  input  of  the  desired 
reference  signal  to  each  net.  The  reinforcement  signal  for  the  example  was  taken  to  be 
the  error  between  the  system  output  and  the  reference  signal.  The  predictor  provided  an 
estimate  of  this  error.  Training  of  the  action  net  occurred  whenever  the  actual  error  was 
smaller  than  the  predicted  error.  In  general  the  scheme  gave  good  results.  Although  the  net 
architecture  is  somewhat  different,  the  predictor  could  identify  the  system  dynamics  with 
the  same  results  as  the  example  in  Narendra’s  paper.  We  also  found  the  control  action  to 
be  successful.  Figure  4  shows  a  sample  result  after  about  20,000  time  steps  for  the  case  of  a 
square  wave  reference  signal.  In  this  trial  both  the  predictor  net  and  the  action  net  had  five 
hidden  layer  nodes  and  all  the  learning  gains  were  set  to  be  0.01. 

We  feel  there  are  several  aspects  of  reinforcement  learning  schemes  which  make  them 

potentially  useful  for  the  control  of  the  nonlinear  systems.  In  particular,  because  the  training 

is  stochastic  in  nature,  the  neural  net  will  have  learned  to  respond  to  a  variety  of  input 

conditions.  Thus  we  expect  it  to  be  fairly  robust  with  respect  to  parameter  variations  or 

unmodelled  dynamics.  We  are  encouraged  by  our  preliminary  results,  but  more  work  remains 

to  be  done.  We  are  particularly  interested  in  how  these  ideas  could  be  applied  to  response 

shaping  and  we  are  currently  working  to  develop  a  system  which  learns  in  response  to  a 
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Figure  4:  Tracking  response. 
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time-varying  reinforcement  signal.  Such  a  signal  could  be  bounded  above  and  gradually 
grow  to  zero  to  reflect  overshoot  and  settling  time  requirements,  respectively.  We  are  also  in 
the  process  of  performing  a  detailed  analysis  of  the  behavior  of  the  proposed  control  scheme, 
including  additional  simulations  of  a  two-input,  two-output  system.  Our  goal  is  to  realize  a 
successful  application  of  the  scheme  to  the  full  six  degree-of-freedom  model  of  the  interceptor. 

IV.  Optimal  Control  Using  Neural  Nets 

In  this  section  we  describe  the  use  of  artificial  neural  networks  for  solving  optimal  con¬ 
trol  problems  for  discrete-time  linear  systems  with  quadratic  cost  functions.  The  result  is 
obtained  by  formulating  the  optimal  control  problem  as  a  quadratic  programming  problem 
with  inequality  constraints  and  then  applying  a  result  by  Kennedy  and  Chua  (16).  We  first 
describe  the  result  that  allows  us  to  use  a  neural  net  to  solve  optimization  problems.  Then 
we  show  how  to  formulate  the  linear  quadratic  regulator  (LQR)  problem  as  a  nonlinear  pro¬ 
gramming  problem.  It  is  then  possible  to  directly  apply  Kennedy  and  Chua's  result  to  find 
the  optimal  control  solution  using  a  neural  net. 

A.  Neural  Nets  for  Nonlinear  Programming 

Consider  the  general  nonlinear  programming  problem  with  inequality  constraints: 

min  6(v) 
v€H" 


42-16 


subject  to 


/i(»)  >  0 


for  i  =  l,...,qr.  All  the  standard  assumptions  about  constraint  qualification,  continuity, 
existence  of  first  and  second  partials,  etc.,  are  assumed  to  apply.  It  has  been  shown  that 
canonical  nonlinear  programming  circuit  of  Kennedy  and  Chua  solves  this  nonlinear  pro¬ 
gramming  problem  [16].  The  circuit,  shown  in  Figure  5,  is  in  the  form  of  a  Hopfield  neural 
network  and  is  described  by  the  following  equations. 


n  dvj  _  d<{>  t  r  (  w^fj 

C'dt~  dvi 


for  i  =  1,...,<j.  Here  the  <7j(-)’s  are  nonlinear  resistors  which  are  required  to  be  passive 
monotone  nondecreasing  functions. 


+ 


Figure  5:  Neural  network  for  optimization. 


B.  Optimal  Control  Solution 


Optimal  control  problems  for  linear  systems  with  quadratic  cost  functions  have  been 
described  in  a  variety  of  references.  For  a  typical  development  see  Kailath  [17].  In  this 
section  we  restrict  our  analysis  to  the  discrete-time,  deterministic  problem.  Specifically,  we 
consider  the  following  finite-horizon  problem. 

1  1  ^_1 

min  J  =  -xJjSxN  +  -  £  {x£ Qxk  + 

4  *  fc=0 

subject  to  the  plant  dynamics 

xk+i  =  Ax, t  +  Buk 

for  k  =  0, . . . ,  N  —  1 ,  with  i0  given.  We  assume  x  €  Rn  and  u  €  Rp.  Also  assume  the  pair 
(A,  B)  is  controllable. 
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Now,  define  a  “supervector”  v  €  il^+1'n+/Vp  by 
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v 


XN 

Uq 

Ui 


\  UN- 1  / 

Then  notice  that  the  optimal  control  problem  can  be  written  more  compactly  as 


min./ 

V 

subject  to 

Tv  =  0. 
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Note  that  this  optimization  problem  involves  only  equality  constraints.  In  order  to  apply 
the  result  of  Kennedy  and  Chua  it  is  necessary  to  transform  the  Nn  equality  constraints  into 
Nn  +  1  inequality  constraints  by  applying  the  following  result  from  Taha  [18]. 


i/ 

'O 

\;V. 


Theorem  Given  m  equality  constraints, 


I  ai]Xi  -  h 

J=l 


for  i  =  1, . . .  ,m,  these  are  equivalent  to  the  following  m  inequality  constraints 

n 

y  ■  aijxj  S 
;=i 

for  i  =  1 , . . .  ,  m,  along  with  the  additional  inequality  constraint 

n  m  m 

S(Eau)x;  >  IX 

j=i  i=i  .=i 
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That  is,  we  add  a  constraint  involving  a  column  sum  of  the  elements  of  the  A  matrix. 
Using  this  fact  we  can  now  rewrite  the  LQR  problem  in  the  following  form 


min  J 

V 

subject  to 

(N+l)n+Np 

/.»  =  £  aijVj  >  0 

j= i 

for  *  =  1, . . . ,  jVn  4- 1,  where  the  a,/s  are  elements  of  the  matrix  A ,  defined  by 
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In  this  matrix  a,  1,  and  0  are  defined  as  the  following. 

i=i  i=i  i=i 

i  =  (i,i,...,i)r 

i=l, n  i=l  i=l 

Using  this  formulation  of  the  problem  we  now  have  a  direct  correspondence  between  the 
optimal  control  problem  and  the  nonlinear  programming  problem,  which  can  be  solved  with 
the  neural  net  given  above. 


C.  Implementation 


Given  the  problem  as  formulated  above  it  is  necessary  to  map  the  constraints  and  all  the 
appropriate  derivatives  into  the  neural  network  circuit.  This  involves  computing 

dJ_ 

dv, 


dU 

dvj 

To  compute  the  partials  of  f,  with  respect  to  the  variable  vj,  the  easiest  thing  to  do  is 

first  form  the  matrix  A.  Then  the  derivative  of  the  inequality  /,  with  respect  to  the  variable 
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Vj  is  simply  the  (ij)tk  element  of  A  (that  is,  a,,).  However,  the  matrix  A  has  some  definite 
structure  that  can  add  insight  into  the  neural  net  implementation  of  the  equations.  A  topic 
for  future  study  is  the  nature  of  this  structure  and  its  implications. 

To  compute  the  partials  of  the  cost  functional  J  with  respect  to  the  variable  Vj,  first 
suppose  that  the  weight  matrices  Q,  S,  and  R  are  defined  as 


Q  = 


<?11  *  *  •  q\n 


Qnl  ‘ * ‘  ?nn 


•SlI  •••  *Sln 


5  = 


R  = 


rn  *  •  •  p 


"pi  •  •  •  rpp 


Then  it  is  straightforward  to  show  the  following. 


£j=l  Qi-lnjVln+j 


£j= l  Si-NnjVNn+j 


...  i  =  In  +  1,...,(/+  l)n 
/  =  0,  ...,7V  —  1 

...  i  =  Nn  +  1, . . . , (N  +  l)n 


i  £”=i  r,_(yv+i)„j  V(^+i)„+y  ...  i  =  (N  +  1  )n  +  1 , . . . ,  (N  +  1  )n  +  p 

where  and  rtJ  are  the  elements  of  the  matrices  Q,  S,  and  R,  respectively.  As  in 

the  case  of  the  partials  of  J  with  respect  to  the  vj's,  there  is  some  obvious  structure  inherent 
in  these  equations  which  we  plan  to  study  in  the  future. 

We  are  currently  working  to  develop  this  approach  to  optimal  control.  One  focus  has  been 
to  develop  some  numerical  examples  of  the  technique  using  P-SPICE  to  simulate  the  net.  To 
date  this  has  not  been  successful,  primarily  because  of  hardware  and  software  limitations. 
However,  we  do  expect  to  obtain  successful  simulations  in  the  near  future.  This  will  allow 
us  to  compare  the  method  to  conventional  solutions  of  optimal  control  problems.  Other 
directions  for  extending  these  ideas  are  detailed  in  the  final  section  below. 

V.  Iterative  Learning  Control  Using  Neural  Nets 


In  this  section  we  describe  some  results  which  apply  neural  nets  to  the  problem  of  iterative 
learning  control.  Although  this  was  not  specifically  tasked  as  part  of  the  objectives  of  the 
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Minigrant,  we  report  these  results  because  they  were  obtained  during  the  period  of  the 
Minigrant  research.  Also,  we  were  led  to  reconsider  the  iterative  learning  control  problem 
as  a  result  of  our  work  on  the  reinforcement  learning  scheme,  which  is  a  stochastic  type 
of  iterative  learning  controller.  Further,  these  results  represent  basic  research  in  the  area 
of  neural  network  control  of  nonlinear  dynamical  systems,  an  area  which  encompasses  the 
interceptor  control  problem. 

A.  Iterative  Learning  Control 

A  new  approach  to  the  problem  of  designing  for  specific  transient  response  performance 
is  the  approach  of  iterative  learning  control.  Originally  suggested  and  developed  by  Arimoto 
and  his  co-workers  [19],  learning  control  is  an  iterative  approach  to  generating  the  optimal 
system  input  so  that  the  system  output  is  as  close  as  possible  to  the  desired  output.  The 
method  can  be  used  to  improve  the  transient  behavior  of  systems  that  are  repetitive  in 
nature  and  operate  over  a  fixed  time  interval.  An  example  of  such  a  system  is  an  antenna 
servomechanism  in  a  radar  which  performs  a  given  scan  pattern  over  and  over.  The  trajectory 
which  the  antenna  follows  is  completed  in  a  finite  time  and  then  it  is  repeated.  Learning 
control  is  also  well-suited  to  problems  in  which  a  system  must  be  able  to  follow  different  types 
of  inputs.  For  example,  in  a  flexible  manufacturing  system  a  robot  may  be  placed  on  an 
assembly  line  and  programmed  to  paint  a  straight  line  on  a  car  door  as  it  passes  the  robot’s 
station.  Later,  as  production  needs  change,  the  manufacturing  facility  may  be  reconfigured 
and  perhaps  the  robot  is  needed  to  execute  a  circular  trajectory  to  paint  whitewall  tires. 
Thus  the  robot  is  required  to  execute  a  given  trajectory  many  times  for  a  specific  task,  but 
may  be  configured  to  perform  several  different  tasks,  depending  upon  production  needs. 

To  understand  the  approach  of  learning  control  consider  Figure  6.  Each  time  the  system 


Ufc 


Ufc+l 


yd 


Figure  6:  Learning  control  configuration. 
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operates  the  input  to  the  system,  w*,  is  stored,  along  with  the  resulting  system  output,  y*. 
That  is,  we  store  the  complete  trajectory  over  the  entire  interval  of  interest.  The  learning 
controller  then  evaluates  the  performance  error  as  compared  to  some  desired  signal  yj  and 
computes  a  new  input,  u*+i,  which  is  stored  for  use  the  next  time  the  system  operates.  The 
new  input  is  chosen  in  such  a  way  as  to  guarantee  that  the  performance  error  will  be  reduced 
on  the  next  trial.  The  task  in  learning  control  is  to  specify  the  algorithm  for  generating  the 
next  input,  given  the  current  input  and  output,  so  that  convergence  to  the  desired  output 
(in  the  sense  of  some  norm)  is  attained.  Ideally,  the  convergence  property  of  the  learning 
control  algorithm  would  require  a  minimal  knowledge  of  the  system  parameters  and  would 
be  independent  of  the  desired  response  yd-  Thus  it  would  be  possible  to  derive  the  best 
control  signal  for  any  of  several  desired  responses,  in  the  presence  of  modelling  uncertainty, 
simply  by  using  input-output  data  obtained  from  actual  operation.  Note  this  key  difference 
between  learning  control  and  optimal  control  design  techniques.  In  conventional  optimal 
control  methodologies  the  optimal  input  is  computed  a  priori  using  a  model  of  the  system. 
Also  note  that  learning  control  differs  from  conventional  adaptive  control  in  that  the  learning 
algorithm  is  executed  off-line,  at  the  end  of  each  trial.  Most  adaptive  control  schemes  are 
on-line  algorithms  and  involve  changes  of  the  controllers  parameters.  Because  of  its  iterative 
nature,  learning  control  can  be  described  as  training.  Given  a  desired  response,  we  train  the 
system  to  follow  it  with  as  little  transient  error  as  possible. 

B.  A  Learning  Control  Scheme 

In  previous  work  [20]  we  have  noted  that  the  real  usefulness  of  iterative  learning  control 
may  be  for  problems  in  which  we  wish  to  control  the  transient  response  behavior  of  nonlinear 
or  time-varying  systems.  In  this  case  it  makes  sense  to  consider  learning  controllers  which 
also  have  a  nonlinear  or  time- varying  structure.  A  non-trivial  question,  however,  is  what  type 
of  nonlinear  system  should  be  considered.  The  class  of  all  nonlinear  systems  is  very  large 
and  it  is  not  clear  what  structure  would  work  best  for  learning  control  applications.  One 
answer  to  this  question  is  to  consider  learning  controllers  based  on  artificial  neural  networks. 
ANNs,  with  their  nonlinear  structure  and  their  ability  to  learn  internal  representations  and 
recall  associations,  are  good  candidates  for  nonlinear  learning  control. 

Previously  we  have  developed  a  method  for  learning  control  which  required  the  same 
number  of  neurons  in  both  the  input  layer  and  the  output  layer  as  the  number  of  time 
steps  in  the  interval  of  interest  [20].  Here  we  propose  to  use  the  approach  of  Narendra 
and  Parthasarathy  to  develop  a  nonlinear  learning  controller  which  takes  into  account  the 
dynamic  relationship  between  the  input/output  data.  The  advantage  of  this  method  over 
our  earlier  technique  is  that  the  number  of  inputs  to  the  neural  net  of  the  learning  controller 
need  be  no  greater  than  twice  the  order  of  the  plant.  This  is  true  no  matter  how  large  we 
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make  jV,  the  length  of  the  learning  control  trials.  In  contrast,  our  previous  method  would 
require  N  inputs  and  N  outputs.  This  will  have  convergence  problems  as  N  increases. 

The  proposed  learning  control  scheme  is  shown  in  Figure  7.  We  call  this  a  dynamical 
learning  control  scheme  because  the  final  learning  controller  actually  acts  like  a  dynamical 
system,  using  past  outputs  as  inputs. 


yk(t) 


Stage  1 

Collect  I/O  data 
during  ifcth  trial. 


Vk(t) 


«*(  0 


Stage  2 

Train  inverse  and 
forward  nets,  using 
I/O  data  from 
the  4rth  trial. 


Stage  3 

Compute  ut+i(0  by 
cascading  NN\  and  NN? 
and  adjusting  N N\  with 
NN 2  held  fixed. 


Figure  7:  Dynamical  learning  controller. 


Notice  that  in  this  system  there  are  two  neural  networks.  One,  N /W,  is  used  to  identify 

the  dynamics  of  the  plant.  The  other,  NN\,  is  used  to  identify  the  inverse  dynamics  of  the 

plant.  It  is  assumed  that  the  order  of  the  plant  is  known  and  that  each  neural  net  block 

shown  in  the  figure  is  actually  configured  as  in  Figure  1.  The  reason  for  using  two  nets  rather 
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than  just  one  net  is  to  avoid  the  problem  of  projecting  through  a  gradient  of  the  plant  to 
get  the  correct  error  for  learning  the  inverse  dynamics. 

The  following  procedure  is  used  to  compute  a  new  input  to  the  system  at  the  end  of  each 
trial. 


1.  Set  the  initial  input  uo(t)  =  yd{t)  and  set  k  =  0. 

2.  Stage  1:  Run  the  plant  with  ti*.  Collect  and  store  in  memory  Uk(t)  and  y*(f). 

3.  Stage  2:  Parallel  learning.  Train  the  inverse  and  forward  nets  using  the  input/output 
data  from  the  &th  trial.  Training  is  on  a  point-by-point  basis,  using  backpropagation. 

4.  Stage  3:  Series  learning.  Cascade  the  inverse  and  forward  nets.  Apply  the  desired 
output  yj(t)  as  the  input  to  the  inverse  net.  Train  to  force  the  output  of  the  forward 
net  to  match  the  desired  output.  Use  backpropagation,  but  hold  the  forward  net  fixed. 
Only  the  inverse  net  is  adjusted. 

5.  Series  learning  and  parallel  learning  are  alternated  until  convergence  of  error  signals. 

6.  Recover  the  input  for  the  next  trial,  tifc+i  from  the  output  of  the  inverse  net  after 
learning  is  complete. 


7.  Set  k  =  k  +  1  and  GOTO  Step  2  (Stage  1). 


A  key  aspect  of  this  algorithm  is  the  alternation  between  Stage  2  and  Stage  3  during  learning. 
This  ensures  that  we  have  adjusted  the  net  at  each  trial  to  invert  our  best  approximation  to 
the  plant  (in  the  direction  of  the  desired  output)  while  preserving  the  integrity  of  the  inverse 
model  for  the  actual  input/output  data. 

To  illustrate  the  technique,  we  simulated  the  scheme  for  the  plant  mentioned  above,  from 
Narendra  and  Parthadarathy’s  paper  [8]. 


y*+i  — 


ykyk-i(yk  +  2.5) 

i  +  yl  +  yl-i 


+ 


The  performance  of  the  learning  control  scheme  is  shown  in  Figure  8  (the  desired  signal 
is  a  triangular  waveform).  As  can  be  seen,  within  about  ten  trials  the  actual  performance 
matches  the  desired  performance  almost  exactly. 


VI.  SPATS  for  Learning  in  Neural  Nets 


In  this  section  we  describe  an  approach  we  have  developed  for  the  analysis  and  design  of 
learning  in  neural  nets  and  in  systems  containing  neural  nets  as  subsystems.  Throughout, 
we  restrict  our  attention  to  recursive  ANNs,  such  as  the  Hopfield  net.  However,  much  of 
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Figure  8:  Dynamical  learning  controller  performance:  Top  graph  -  desired  and  actual  re¬ 
sponse.  Bottom  graph  -  control  input  signal. 

the  discussion  also  applies  to  feedforward  nets  operated  in  feedback  configuration.  The 
approach  introduced  in  this  section  is  to  apply  well-developed  techniques  from  the  theory  of 
singular  perturbations  and  time  scales  (SPATS)  to  standard  neural  network  architectures. 
The  basic  idea  is  to  cast  a  neural  network  system  into  the  standard  framework  of  SPATS  by 
identifying  the  “learning”  and  “computing”  processes  in  the  neural  net  as  “slow”  and  “fast” 
phenomena,  respectively.  As  in  the  previous  section,  this  topic  was  not  explicitly  tasked 
in  the  objectives  of  the  Minigrant.  Again,  however,  these  ideas  were  developed  during  the 
period  of  the  Minigrant  and  we  feel  they  are  relevant  to  the  interceptor  control  problem  from 
the  context  of  their  relevance  to  the  general  problem  of  control  of  nonlinear  systems  using 
neural  networks. 

A.  Singular  Perturbations  and  Time  Scales 

Consider  a  (usually  nonlinear)  dynamic&l  system  described  by 

=  /(x  i,x2) 

=  g{x\,x2,u) 

with  external  input  u,  boundary  conditions  Xj(0)  and  x^O),  and  e  a  small  positive  param¬ 
eter.  Systems  which  can  be  described  in  this  way  are  said  to  be  singularly  perturbed  as  a 

result  of  the  parameter  t.  Typically  we  identify  e  as  the  time  constant  of  the  “fast”  state 

42-25 


dx\ 

~dt 

dxi 

l~dt 


x2  and  we  identify  xx  as  the  “slow”  state  of  the  system.  This  description  of  the  system  into 
fast  and  slow  modes  results  in  multiple  time  scales  for  use  in  describing  the  system  behavior. 
Other  phenomena  associated  with  such  systems  include  order  reduction,  loss  of  some  initial 
conditions,  and  boundary  layer  formation  (region  of  rapid  transition).  The  theory  of  singu¬ 
lar  perturbations  and  time  scales  (SPATS)  is  well-developed  [21].  There  are  three  aspects 
of  the  theory  that  we  will  mention:  degeneration,  asymptotic  expansion,  and  time-scale 
decomposition. 

(i)  Degeneration:  By  setting  the  small  parameter  e  =  0  we  obtain  the  degenerate  (low-order) 
problem 

=  /(x<°\40)) 

=  0(xi°\4O),u) 

where  x^\t)  and  4°*(0  represent  the  degenerate  solutions  of  the  slow  and  fast  modes  of 
the  system,  respectively.  In  general,  because  the  order  has  been  reduced,  this  problem  can 
be  solved  more  easily  than  the  full  problem.  Hence,  in  degeneration,  our  main  interest  is  to 
find  conditions  under  which  the  full  problem  tends  to  the  degenerate  problem,  because  then 
we  may  concentrate  on  the  lower-order  problem.  A  theorem  due  to  Tickhonov  [21]  relates 
the  solutions  of  the  two  problems,  under  the  restriction  of  various  assumptions. 

(ii)  Asymptotic  Expansion:  To  obtain  the  full  solution  of  the  singularly  perturbed  system, 
suppose  the  solution  has  the  form 

xi  (t,e)  =  xu(t,e)  +  xic(r,e) 

*a(M)  =  x2o(t,e)  +  x2c(r,e) 

where  xi a(t,e)  and  x2o(f,e)  (the  outer  solutions),  and  x1c(t,  e)  and  x2c(r,  e)  (the  boundary 
layer  correction  solutions),  are  all  assumed  to  have  asymptotic  expansions.  The  outer  solu¬ 
tions  are  close  to  the  exact  solutions  (xj (t,e)  and  x2(f,e),  respectively)  for  values  away  from 
t  =  t0  or  outside  the  boundary  layer,  while  the  boundary  layer  corrections  are  significant  only 
near  t  =  t0  or  inside  the  boundary  layer.  Thus  we  ask  that  all  the  terms  in  the  expansions 
for  Xie(r,  e)  and  x2 c(r,  e)  tend  to  zero  as  r  — *  oc.  By  computing  these  asymptotic  expansions 
it  is  possible  to  separate  the  analysis  of  the  singularly  perturbed  system  into  two  parts:  fast 
and  slow.  The  complete  details  about  when  it  is  possible  to  compute  the  expansions  and 
how  to  actually  obtain  them  can  be  found  in  the  literature  [21]. 


dx^_ 

dt 

0 


(iii)  Time  Scale  Decomposition:  Often  one  is  interested  in  controlling  a  singularly  perturbed 

system.  One  approach  to  this  is  to  decompose  the  problem  into  two  control  problems:  one 
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for  the  fast  subsystem  and  one  for  the  slow  subsystem.  In  this  case  the  external  input  might 
be  a  state  feedback  control  law  of  the  form 

u  =  Ksxi  +  Kf  x2 

where  K,  and  Kf  are  the  gain  matrices  for  the  slow  and  fast  subsystems,  respectively.  The 
highlight  of  this  approach  is  that  it  is  often  computationally  much  simpler  to  obtain  the 
composite  controls  from  the  lower-order  subsystems  than  to  obtain  a  completely  coupled 
feedback  control  for  the  original  higher-order  system.  We  believe  that  it  is  this  aspect  of 
SPATS  that  is  particularly  applicable  to  the  design  of  convergent  learning  algorithms  in 
neural  networks. 

B.  Hopfield  Networks  with  Learning 

Next  consider  the  Hopfield-type  recursive  neural  network  shown  in  Figure  9.  We  assume 


Vi  V2  VN 

Figure  9:  Recursive  neural  network. 


that  the  dynamic  behavior  of  this  interacting  system  of  neurons  is  governed  by  the  following 
set  of  nonlinear  differential  equations  [22]. 


,  dx^ 
'  dt 


N 

+  X  wiMxj)  +  v < 


j=i 


for  i  =  1,  •  •  • ,  N.  In  this  equation  we  identify 
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N:  number  of  neurons, 

x:  output  potential  of  a  neuron. 

C :  cell  capacitance. 

R:  membrane  resistance. 

w :  synaptic  weight. 

h(x ):  sigmoidal  function  describing  the  firing  rate. 
y:  external  input  to  the  net. 

(i)  Computing :  Typically,  such  a  system  is  used  in  an  operational  or  “computing”  mode, 
with  the  weight  vectors  fixed  to  give  the  network  some  desired  property.  For  example,  the 
Hopfield  network  might  be  used  as  an  associative  memory.  Because  the  network  is  described 
by  a  system  of  nonlinear  differential  equations,  it  contains  a  number  of  equilibrium  points. 
When  run  in  an  operational  mode  the  network  receives  an  input  vector  which  acts  as  a  forcing 
function  (or  initial  condition)  for  the  system  equations.  Eventually  the  system’s  trajectory 
converges  to  an  equilibrium,  which  defines  the  output  pattern.  Because  the  equilibria  of  such 
nonlinear  systems  usually  have  a  non-trivial  basin  of  attraction  it  is  possible  to  converge  to 
the  same  equilibrium  point  if  we  input  a  vector  which  is  different  from  the  original  input  but 
inside  the  region  of  attraction  [23].  In  this  way  the  network  acts  as  a  memory,  associating 
a  particular  output  pattern  with  a  particular  input  pattern.  This  feature  leads  to  obvious 
applications  for  pattern  recognition  problems  in  image  and  signal  processing.  Other  applica¬ 
tions  of  Hopfield  nets  are  in  the  area  of  optimization.  In  this  case  the  weights  are  preset  to 
values  that  are  determined  by  the  optimization  problem  one  wishes  to  solve.  If  this  is  done 
properly,  the  equilibrium  state  of  the  net  will  define  the  solution  to  the  problem  [16]. 

A  key  question  in  the  use  of  Hopfield  nets  is  how  to  pick  the  weights  so  the  net  solves  a 
given  optimization  problem  or  stores  a  desired  equilibrium  point.  One  answer  to  the  question 
has  been  given  in  [24].  In  this  work  it  is  shown  how  to  design  the  weights  a  priori  so  that 
the  system  has  a  desired  equilibria  set.  The  solution  involves  solving  a  linear  programming 
problem  (which  can  be  done  by  a  second  net). 

(ii)  Learning-.  An  alternate  approach  to  selecting  the  weights  in  a  Hopfield  net  to  produce 
a  desired  equilibrium  point  is  to  introduce  learning.  That  is,  we  design  an  algorithm  which 
adjusts  the  weights  online  until  the  network  output  is  satisfactory.  We  call  this  a  “learning” 
mode  of  operation  for  the  Hopfield  neural  network  and  will  refer  to  the  “Hopfield  net  with 
learning”  or  “the  learning  Hopfield  network.”  Note  that  learning  is  not  a  feature  in  most 
analyses  of  the  Hopfield  network,  although  it  commonly  arises  in  applications  of  feedforward 
neural  networks. 

The  next  obvious  question  would  be  what  learning  algorithm  should  be  used.  For  the 

continuous  Hopfield  network  described  above,  a  possible  weight  adjustment  algorithm  might 
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dw{j  ^ 

A——  =  Xj[Xi  -  2^  wikxk\et. 

dt  k= i 

Here  A  could  be  thought  of  as  the  time  constant  of  the  change  of  the  synaptic  weight. 
et  =  (xf  —  £,)  is  the  error  between  the  desired  equilibria  and  the  actual  equilibria.  A  is  also 
called  the  learning  gain  and  is  chosen  so  that  the  learning  proceeds  at  a  slower  rate  than 
the  time  constant  of  the  neural  net.  This  is  a  logical  extension  of  the  idea  of  learning  in 
feedforward  nets.  We  see  the  learning  as  a  loop  operating  around  the  input/output  behavior 
of  the  net.  As  in  feedforward  nets,  choosing  the  learning  gain  would  likely  be  one  of  the 
most  difficult  tasks  in  developing  a  Hopfield  neural  network  which  can  learn.  Further,  a 
more  fundamental  problem  is  that  of  determining  the  best  structure  for  a  learning  algorithm 

A^r = 

that  can  ensure  convergence  of  the  learning  process,  while  maintaining  overall  system  sta¬ 
bility.  The  problem  is  hard  because  of  the  highly  coupled  nonlinearities  inherent  in  the 
process. 

C.  An  Example  of  SPATS  in  a  Learning  Hopfield  Net 


(i)  Interpretation  of  Computing  and  Learning:  Consider  a  Hopfield  net  with  learning  as 
described  in  the  previous  subsection.  We  can  interpret  the  computing  mode  of  operation 
defined  by  the  system  equations  as  a  “fast”  synaptic  event  associated  with  a  small  time 
constant  ( RC ),  whereas  the  learning  (synaptic  weight  change)  can  be  thought  of  as  a  “slow” 
process  associated  with  a  large  time  constant  (A).  Thus  the  computing  and  learning  processes 
together  constitute  a  “time-scale”  behavior  in  the  recursive  neural  network.  This  makes  it 
natural  to  consider  applying  SPATS  techniques  to  the  convergence  analysis  and  design  of 
recursive  neural  networks  with  learning. 

In  order  to  fit  the  learning  and  computing  equations  into  the  standard  framework  of 
SPATS  we  recast  the  net  equations  as 

dw 

—  =  /(m,  x,  e) 

dx 

elt  =  9(w'x'y)> 

with  the  initial  conditions  x(0)  and  u>(0).  Here  the  neuron  states  x  (representing  the  com¬ 
puting  process)  and  the  network  weights  w  (representing  the  learning  process)  have  been 
redefined  as  fast  and  slow  variables  of  order  n  and  n2,  respectively,  e  is  a  small  positive 
parameter  representing  the  time  constant  associated  with  the  fast  state  x,  e  is  the  error 
vector,  and  y  is  the  input  vector  of  the  net.  Note  that  we  have  absorbed  A  into  /(•).  Also 
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note  that  t  is  in  some  sense  an  artificial  constant  that  can  be  chosen  by  the  designer.  The 
key  point  is  that  learning  should  proceed  slower  than  computing.  Hence  there  should  be  a 
singularly  perturbed  character  inherent  in  the  process.  This  is  introduced  via  the  parame¬ 
ter  e.  We  believe  that  this  set  of  equations,  called  the  singularly  perturbed  neural  network 
equations,  will  exhibit  all  the  characteristic  features  of  singularly  perturbed  systems  and, 
consequently,  we  expect  to  be  able  to  take  advantage  of  this  in  the  analysis  and  design  of 
ANNs  and  ANN-based  systems. 

(ii)  Example :  To  illustrate  the  fact  that  the  Hopfield  net  with  learning  does  in  fact  exhibit  a 
singularly  perturbed  character,  consider  a  two  neuron  Hopfield-net.  Let  the  net  be  described 
by  the  computing  equations 

eii  =  -x\  +  u>n/i(xi)  -f  u>n/i(x2)  +  V\ 
ex2  =  -x2  +  u>21/i(xi)  +  id22Mx2)  +  2/2 

and  let  the  weight  update  equations  be 

ton 
ti>i2 

to-n 
io22 

Here  the  error  is  defined  by 

e,  =  x?  -  x, 

with  xf  the  desired  output  state  for  the  ith  neuron,  in  response  to  the  input  signal  y,.  The 
weight  update  equations  are  simpler  than  the  general  form  suggested  above.  They  were 
constructed  to  be  similar  in  structure  to  the  Delta  rule  used  for  training  perceptrons. 

Figure  10  shows  a  typical  trajectory  for  this  set  of  equations  when  driven  with  the  in¬ 
put/output  pair  y  =  (1, 1),  xd  =  (0,0).  It  can  be  seen  that  the  system  approaches  the  desired 
equilibrium  state.  We  found  this  to  be  true  for  all  the  training  pairs  ( y,xd )  that  we  tried. 
This  is  interesting  because  we  did  not  choose  the  weights  a  priori  to  ensure  convergence 
to  the  correct  output.  Rather,  the  system  learned  the  correct  weight  values  on  its  own. 
Further,  define  wt]  to  be  the  steady-state  value  of  the  weights  wtJ  after  the  complete  system 
converges.  Then  let  the  system  be  operated  independent  of  the  weight  adjustment  equations, 
using  the  u><;  as  constant  weights.  In  this  case,  the  output  of  the  net  will  still  converge  to 
the  correct  values.  This  suggests  that  such  an  approach  to  learning  can  be  effective.  A  final 
comment  on  the  example  system  is  indicated  in  Figure  11.  Here  it  is  seen  that  the  system 
exhibits  the  classic  SPATS  characteristic  of  multiple  time  scales.  We  see  the  initial  region 

of  rapid  transition,  corresponding  to  the  fast  dynamics  of  the  computing  portion  of  the  net 
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=  x\e\ 

=  iie2 
=  x2ei 
=  x2e2 


Figure  10:  State  trajectory  of  example  system. 

and  wc  also  see  the  long  time  constant  of  the  learning  equations.  Thus  the  Hopfield  net  with 
learning  is  in  fact  a  singularly  perturbed  system.  Future  research  activities,  as  described 
below,  will  consider  how  to  exploit  this  fact  for  analysis  and  design  of  neural  networks. 

VII.  Future  Research  Directions 

The  investigations  leading  to  the  results  described  in  the  previous  five  sections  also  led  to 
new  questions.  In  this  final  section  we  briefly  describe  three  of  these  questions.  We  currently 
have  research  proposals  pending  with  both  ^SF  and  AFOSR  to  continue  our  investigations 
in  each  of  these  areas. 


A.  Learning  in  Feedforward  Neural  Nets 

One  problem  we  have  encountered  in  our  research  deals  with  learning  and  generalization 
in  neural  nets.  In  most  experiments  we  must  use  random  training  pairs  to  ensure  adequate 
coverage  of  the  input/output  space.  However,  there  is  no  good  basis  for  such  an  approach. 
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Figure  11:  State  x\  and  weight  W21  of  the  example  system. 

We  have  developed  some  preliminary  ideas  for  overcoming  this  problem  by  viewing  it  from 
a  control-theoretic  perspective.  We  discuss  two  ideas  below. 

(i)  Persistence  of  Excitation  for  Neural  Nets:  Suppose  we  are  given  a  discrete-time,  nonlinear 
system  defined  by  the  input-output  map 

**+ 1  =  f(xk,uk) 

where  x*  is  the  output  of  the  system  and  uk  is  the  input  to  the  system.  Also  suppose  we 
have  a  feedforward  neural  net  defined  by  * 

A 

xk+ 1  =  f(xk,Uk}Wk) 

where  x*  is  the  output  of  the  neural  network  and  u?k  is  a  vector  of  the  weights  of  the  net 
at  time  t  =  k.  It  is  assumed  that  the  input  signal  uk  is  available  to  the  net.  Further,  we 
suppose  that  the  weights  are  updated  according  to  the  algori  thm 

Wfc+i  =$(£*,«>*,  e*) 
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where  ek  =  x*  —  xk.  This  is  depicted  in  Figure  12.  Note  that  the  learning  algorithm  does  not 
necessarily  need  to  be  backpropagation,  but  does  represent  a  dynamical  system  with  respect 
to  the  weights  (just  as  backpropagation  does).  This  interpretation  is  somewhat  limited  in 
that  it  precludes  consideration  of  some  type  of  learning  schemes,  such  as  ones  based  on 
competitive  learning  [25],  clustering  [26],  or  random  mappings  within  the  net  architecture 
[27,2,28].  However,  it  is  an  accurate  depiction  of  many  learning  algorithms. 


System:  xk+i  =  f(xk,uk) 
Net:  i*+1  =  f{xkluk,wk) 
Learning:  wk+l  =  g{xk,wk,ek) 


Figure  12:  Learning  as  a  nonlinear  control  problem. 


The  problem  we  are  interested  in  is  this:  for  the  configuration  shown  in  Figure  12,  with 
g  fixed,  how  should  the  training  inputs  uk  be  chosen  to  ensure  that  the  net  will  learn  the 
training  pairs  and  will  also  give  the  correct  output  for  inputs  it  has  never  seen?  Typically 
this  is  done  by  choosing  uk  randomly,  according  to  some  distribution.  An  alternate  approach 
is  to  introduce  the  issue  of  persistence  of  excitation. 

The  problem  of  persistence  of  excitation  has  been  studied  extensively  in  the  adaptive 
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control  community.  The  following  example  illustrates  the  concept  [29].  Consider  a  linear 
plant  with  known  structure,  but  unknown  parameters 

y  =  —apy  +  bpu 

The  task  is  to  find  a  feedback  control  input  it  so  that  the  output  of  the  closed-loop  system 
tracks  the  reference  model 

i/m  —  ^mj/m  4"  bm  r(  t  ) 

In  [29]  the  following  control  law  is  suggested.  Let 

u(t)  =  a„(f)y(t)  +  ar(t)r(t) 

where  the  parameters  ay(t )  and  ar(t)  are  adjusted  according  to  the  adaptation  law 

ay  =  —sgn(bp)'yey 
dr  =  -sgn(6p)7cr 

Here  e  =  y  —  ym  is  the  tracking  error,  7  is  an  adaptation  gain,  and  the  sign  of  bp  must  be 
known.  It  can  be  shown  that  with  this  control  law  and  adaptation  algorithm  then 

lim  e(t)  =  0 

t— *  OO 

If  we  examine  this  control  system  we  see  that  the  ideal  choice  of  the  controller  gains  is 
a*  =  (ap  —  am)/bp  and  a*  =  bm/bp.  A  logical  question  at  this  point  would  be  to  ask  if  the 
parameters  converge  to  the  optimal  value  when  the  error  converges  to  zero.  The  (surprising) 
answer  is  sometimes.  It  turns  out  that  it  is  possible  to  have  the  error  converge  to  zero  even 
when  the  controller  parameters  converge  to  some  value  other  than  the  optimal  gains.  For  the 
example  above  this  can  occur  when  the  reference  input  r(t)  is  a  simple  step  input.  However, 
if  the  input  to  the  system  is  a  sinusoidal  signal,  then  it  can  be  shown  that  the  parameters 
converge  to  their  optimal  values  as  the  tracking  error  goes  to  zero.  We  describe  this  by  saying 
that  the  sinusoidal  input  is  persistently  exciting  for  the  first-order  system,  but  the  step  input 
is  not.  In  general,  it  turns  out  that  to  identify  2n  parameters  in  a  linear  system  (of  order  n) 
it  is  necessary  to  include  n  sinusoids  of  different  frequency  in  the  reference  signal.  (We  must 
know  the  order  of  the  system  and  the  sign  of  the  coefficient  of  the  highest-order  derivative 
of  the  forcing  function  of  the  plant.) 

It  is  possible  to  extend  these  ideas  to  some  classes  of  nonlinear  systems.  For  instance,  in 
[29]  it  is  shown  how  to  achieve  adaptive  control,  with  parameter  convergence  of  the  system 

y  =  ~avV  ~  Cvf(y)  +  bpu 

where  /(•)  is  a  known  nonlinear  function  and  ap,  6P,  and  Cp  are  unknown  parameters.  It 

is  also  shown  in  [29]  how  to  develop  adaptive  controllers  (based  on  sliding-mode  control 
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principles)  for  systems  of  the  form 


y(n)  +'52aifi{x,t)  =  bu 
«= 1 

where  x  =  (y,y, . .  .y^n_1^)r  is  the  state  vector,  the  /<(•)’ s  are  known  nonlinear  functions 
and  the  a.’s  and  6  are  unknown  parameters.  Unfortunately,  even  for  these  relatively  tame 
systems  it  is  only  possible  to  give  very  general  conditions  for  parameter  convergence  and 
persistence  of  excitation.  For  instance,  it  is  not  yet  possible  to  give  a  nice  result  on  the 
number  of  sinusoids  required  in  the  input  signal. 

Now  consider  the  implications  these  observations  have  for  the  problem  of  learning  in 
neural  nets.  First,  although  not  apparent  from  the  brief  comments  above,  it  is  likely  that 
any  successful  learning  scheme  will  involve  feedback  of  the  modelling  error  into  the  input 
signal  Uk  shown  in  Figure  12.  This  is  true  in  the  linear  case  and  in  the  nonlinear  examples 
given  in  [29].  However,  our  original  interpretation  of  learning  in  a  neural  net  shown  in 
Figure  12  did  not  consider  such  feedback,  but  rather  assumed  an  arbitrary  plant  input  signal 
for  training.  This  suggests  that  our  learning  algorithm  should  affect  not  only  the  weight 
update,  but  also  the  plant  input.  That  is,  we  should  control  the  plant  at  the  same  time  we 
are  identifying  it. 

Second,  even  with  feedback  in  place,  it  will  be  necessary  that  we  have  some  idea  about 
what  constitutes  a  persistently  exciting  input  signal  for  learning  with  neural  nets.  Because 
this  question  does  not  have  a  satisfactory  answer  for  even  the  simple  nonlinear  example  given 
above  (where  the  unknown  weights  are  linearly  related  to  the  nonlinear  functions,  as  opposed 
to  the  neural  net  case  where  the  unknown  weights  are  embedded  in  the  nonlinearity),  it  will 
likely  be  hard  to  find  a  satisfactory  answer  for  the  neural  net  case.  This  suggests  the  need 
for  more  basic  research  on  the  topic  of  nonlinear  adaptive  control. 

A  third  comment  is  that  if  our  objective  is  to  control  a  system  using  a  neural  network  it 
may  not  be  essential  that  we  achieve  convergence  of  the  weights  to  the  exact  values  of  the 
optimal  net  which  models  a  system  or  its  inverse.  If  the  control  objective  is  achieved,  then  it 
may  not  be  important  that  the  parameters  have  converged.  However,  from  the  perspective 
of  learning,  our  main  interest  is  when  the  weights  of  the  net  converge  to  values  that  are 
correct  in  the  sense  of  providing  the  best  possible  representation  of  the  desired  nonlinear 
mapping  over  the  desired  region  of  the  input  space. 

(ii)  A  Stochastic  Approach  to  Learning:  Searching  for  a  persistently  exciting  input  for  a 

system  is  a  deterministic  approach  to  the  learning  problem.  The  goal  is  to  find  a  deterministic 

sequence  of  inputs  that  will  excite  the  system  to  produce  a  type  of  basis  for  the  output  space, 

so  we  can  ensure  adequate  representation  of  the  input-output  space.  An  alternate  approach 

is  to  obtain  the  necessary  coverage  of  the  input-output  space  through  the  use  of  random 

input  signals.  This  is  the  approach  used  in  almost  every  example  of  training  given  in  the 
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literature.  One  shortcoming  of  this  approach  when  used  with  most  training  algorithms, 
such  as  backpropagation,  is  that  the  training  algorithm  does  not  take  into  account  the  fact 
that  the  input  sequence  is  random.  In  this  section  we  discuss  an  approach  to  stochastic 
training  based  on  optimal  filter  theory  (i.e.,  Kalman  filtering)  which  does  take  into  account 
the  randomness  of  the  input  sequence. 

The  results  we  describe  were  first  reported  by  Wabgaonkar  and  Stubberub  [30].  An 
implicit  assumption  in  this  work  is  that  there  exists  a  network  that  will  solve  the  problem 
and  that  we  know  the  number  of  neurons  in  its  hidden  layer.  In  practice  this  may  not  be 
true.  However,  a  practical  learning  system  based  on  this  approach  would  probably  assume 
some  number  of  neurons,  perform  the  estimation,  increase  the  number,  and  iterate  until  the 
error  is  reduced  to  an  acceptable  level.  Thus  this  is  a  reasonable  assumption  and  is  implicit 
in  many  other  papers  on  neural  nets  (in  the  Narendra  paper  [8],  for  example). 

We  briefly  summarize  the  Kalman  filter  [31].  Consider  a  dynamical  system  defined  by 

xfc+i  =  Axk  +  wk 
zk  =  Hik  +  Vk 

Here  zk  is  our  output  measurement,  the  system  matrices  A  and  H  are  known,  and  wk  and 
vk  are  uncorrelated  white-noise  sequences  representing  input  noise  and  measurement  noise, 
respectively,  with  known  covariance  matrices.  The  Kalman  filtering  problem  is  to  find  an 
estimate  xk  of  the  system  state  x*  such  that  the  expected  value  of  the  mean-square  error 
Cfc  =  Xfc  —  ik  is  minimized.  The  solution  to  this  problem  is  well-known  and  is  given  by 

ik  =  xk  +  Kk(zk  ~  Hxk) 

where  Kk  is  a  time-varying  gain  matrix  obtained  by  solving  a  Riccati  equation.  Note  that 
this  problem  statement  was  for  a  linear  system.  For  nonlinear  systems  a  similar  solution  to 
the  state  estimation  problem  can  be  developed.  This  is  called  an  extended  Kalman  filter 
and  involves  a  linearization  of  the  nonlinear  system. 

Now,  suppose  that  we  wish  to  learn  an  input-output  mapping  yk  =  /(u*).  As  mentioned 
earlier,  we  assume  that  a  solution  exists  using  a  standard  feedforward  net  (one  hidden  layer 
with  a  fixed  number  of  neurons)  with  nonlinearities  in  the  neurons  defined  by 

g:{x)  =  aJ0  +  b\  cos(27r &£x)  +  a\  sin^Trajx) 

Notice  that  the  form  of  the  assumed  nonlinearity  causes  the  net  to  perform  a  type  of  Fourier 
series  approximation  of  the  unknown  function.  Next  define  a  vector  x  composed  of  all  the 
weights  and  biases,  together  with  all  the  parameters  a30,  b{,  a\,  and  a2.  If  a  real  net  exists 

that  implements  the  nonlinear  function  of  interest,  then  the  vector  x  of  the  real  net  will  be 
a  constant  so  the  net  could  be  described  by 

•Efc+l  —  xk 

dk  =  yk  +  Vk  =  fk{xk,Uk)  +  Vk 
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where  z  is  the  state  of  the  true  net,  y  is  the  actual  output,  u  is  the  actual  input,  and  is 
viewed  as  random  measurement  noise  (which  has  some  assumed  statistics)  which  corrupts 
the  actual  output,  giving  the  desired  or  measured  output  d*. 

Given  this  model  of  the  unknown  function  it  is  straightforward  to  derive  an  extended 
Kalman  filter  for  the  system.  Simulation  results  arising  from  such  an  analysis  indicate  that 
the  method  works  quite  well  [30].  However,  these  results  considered  only  one-  and  two- 
neuron  systems.  It  will  be  important  to  see  how  well  the  method  will  work  for  systems  with 
a  larger  number  of  neurons.  Also,  it  is  not  clear  that  the  use  of  sinusoidal  nonlinearities  is 
optimal.  More  work  must  be  done  to  determine  the  best  choice  of  neuron  transfer  functions. 
The  important  thing  to  note  is  that  by  using  a  Kalman  filter  to  estimate  the  parameters 
of  the  net  (i.e.,  the  weights)  we  are  able  to  develop  a  training  algorithm  which  takes  into 
consideration  the  random  nature  of  the  input  during  training. 

(iii)  Proposed  Research  Activities  In  light  of  the  discussion  presented  above,  the  following 
research  questions  are  posed  on  two  specific  problems. 

1 .  Persistence  of  Excitation  and  Feedback  in  Learning:  We  propose  to  study  the  develop¬ 
ment  of  learning  algorithms  for  feedforward  networks  using  traditional  adaptive  control 
techniques  as  a  point  of  departure.  We  consider  two  different  tracks  of  investigation. 

(a)  Persistence  of  excitation:  We  propose  to  study  the  role  of  persistence  of  excitation 
in  the  generation  of  training  data.  We  first  restrict  ourselves  to  feedforward  nets 
with  one  hidden  layer  and  sigmoidal  nonlinearities.  We  assume  that  such  a  net 
accurately  describes  an  unknown  nonlinear  dynamical  system  for  some  set  of 
optimal  weights.  We  then  ask  the  question:  What  is  the  nature  of  a  persistently 
exciting  input  for  learning  in  a  scheme  such  as  in  Figure  12  so  the  weights  will 
converge  to  the  optimal  weights?  We  will  attempt  to  answer  this  question  in  as 
general  a  way  as  possible,  but  expect  that  the  answers  will  have  to  be  given  for 
specific  classes  of  functions  to  be  learned. 

(b)  Feedback:  As  noted  above,  most  adaptive  control  schemes  use  feedback  of  the 
identification  error  or  the  tracking  error  into  the  input.  This  is  unique  in  the 
learning  algorithms  typically  used  for  training  feedforward  nets.  We  propose  to 
investigate  the  use  of  feedback  as  an  aid  to  training. 

2.  Kalman  Filtering  Approach  to  Learning:  We  propose  to  study  in  more  depth  the  op¬ 
timal  filtering  approach  of  [30].  Specifically  we  will  consider  the  choice  of  nonlinearity 
to  use  in  the  net.  We  will  consider  this  from  the  perspective  of  the  representation 
problem  from  approximation  theory.  In  particular,  we  will  consider  a  marriage  of  the 
Kalman  filtering  approach  to  learning  to  the  radial  basis  function  approach  to  the 
representation  problem. 
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B.  Learning  in  Recursive  Neural  Nets 

The  discussion  in  section  VI  regarding  singular  perturbations  and  time  scales  applied  to 
neural  nets  suggests  the  following  research  questions. 

1.  SPATS  Analysis  of  Neural  Networks:  We  have  seen  how  a  Hopfield  net  with  learning 
could  be  interpreted  as  a  singularly  perturbed  system  and  we  presented  an  example  to 
show  that  such  a  system  does  in  fact  produce  multiple  time-scale  behavior.  e  first 
step  is  to  perform  a  complete  analysis  of  the  general  iV-neuron  Hopfield  net  witu  learn¬ 
ing  from  the  perspective  of  SPATS.  In  particular  we  will  demonstrate  that  the  Hopfield 
net  with  learning  satisfies  the  assumptions  necessary  to  compute  both  degenerate  solu¬ 
tions  and  asymptotic  expansions  for  the  system.  We  will  also  compute  such  solutions 
and  expansions  for  representative  examples.  It  is  expected  that  such  an  analysis  will 
offer  insight  into  how  to  ensure  convergence  of  the  net  with  learning.  Note  that  the 
analysis  requires  that  we  assume  a  structure  for  the  learning  algorithm.  Initially,  we 
will  begin  with  the  Delta  rule-type  equation  given  in  the  example  above.  However,  as 
the  research  progresses  we  expect  to  gain  insight  into  the  type  of  learning  law  that  will 
be  most  effective.  This  insight  can  be  gained  by  first  applying  SPATS  analysis  tech¬ 
niques  to  the  preliminary  learning  algorithm.  This  also  leads  to  the  following  problem. 
Suppose  we  are  given  a  system 

e^r  = 

The  question  is  what  function  </(-,  •,  •)  satisfying 

dw 

—  =  g(w,x,e) 

will  cause  e  — »  0  (or  some  other  criterion)?  This  problem  is  an  inverse  singular  per¬ 
turbation  problem,  and  is  novel  in  the  study  of  SPATS.  However,  this  problem  arises 
in  the  design  of  learning  algorithms  for  a  Hopfield  net.  We  propose  to  address  this 
problem  as  part  of  our  future  research  activities. 

2.  SPAT-Based  Design  of  Learning  in  Neural  Networks:  One  feature  of  SPATS  is  that  it 
is  often  possible  to  decouple  the  fast  and  slow  modes  of  the  singularly  perturbed  system 
for  the  purpose  of  analysis  and  design.  We  will  attempt  to  do  this  for  the  Hopfield 
network  with  learning.  We  can  then  design  stabilizing  controls  separately  for  both 
the  computing  (fast)  modes  and  the  learning  (slow)  modes.  If  this  is  successful  it 
may  be  possible  to  design  learning  algorithms  that  guarantee  convergence  without  the 
usual  problems  of  local  minima.  At  the  least,  we  should  be  able  to  develop  sufficient 
conditions  for  such  convergence. 
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3.  Systems  Containing  Neural  Networks:  The  third  goal  of  the  research  is  to  consider  the 
design  of  neural  network  controllers  for  dynamical  systems.  Consider  Figure  13,  where 
we  suppose  that  the  controller  C  in  Figure  13  is  a  continuous  Hopfield  network  with 
learning.  In  this  case  the  closed-loop  system  will  also  exhibit  a  singularly  perturbed 
character.  There  are  two  special  cases  that  can  arise. 


Figure  13:  Unity  feedback  control  system. 


(a)  Normal  Plant:  In  this  case  we  suppose  that  our  plant  is  a  linear  or  nonlinear 
system.  If  we  also  suppose  that  the  neural  net  is  learning  at  some  rate  that  is 
significantly  slower  than  the  closed-loop  dynamics,  then  we  can  view  the  closed 
loop  system  as  a  singularly  perturbed  system.  We  can  then  decompose  the  entire 
closed-loop  system  equations  into  their  fast  and  slow  modes. 

(b)  Singularly  Perturbed  Plant:  In  this  case  we  will  have  a  closed-loop  system  which 
exhibits  time  scale  behavior.  This  is  true  for  two  sub-classes:  (1)  the  neural  net 
is  used  in  an  operational  mode  and  (2)  the  neural  net  is  used  in  a  learning  mode. 
In  either  case  the  closed- loop  system  will  have  both  fast  and  slow  modes. 

We  propose  to  use  SPATS  to  specify  how  to  design  both  the  learning  and  computational 
parts  of  a  neural  network  to  ensure  learning  convergence  and  closed-loop  stability  for 
the  different  cases  enumerate  above. 

C.  Kalman  Filtering  and  Real-Time  Adaptive  LQR  Control 

In  future  work  we  expect  to  develop  the  application  of  neural  nets  to  solving  optimal 
control  problems  in  more  detail.  In  particular,  the  following  topics  are  currently  under 
study. 

1.  Further  investigation  of  the  structure  of  the  neural  net.  We  are  interested  in  the  fact 
that  the  neural  net  will  be  sparsely  interconnected  for  distant-horizon  problems  due  to 

the  structural  issues  we  mentioned  above. 
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2.  As  noted  previously,  we  are  developing  numerical  examples  using  P-SPICE  to  illustrate 
the  technique  and  we  will  compare  these  examples  to  conventional  solutions  of  optimal 
control  problems. 

3.  We  hope  to  apply  the  method  to  solving  general  Riccati  equation  problems.  This  will 
allow  us  to  extend  the  method  from  the  LQR  problem  to  other  quadratic  cost  problems 
such  as  LQG  and  Kalman  filtering.  The  following  logic  leads  us  to  believe  this  is 
possible.  We  have  shown  that  a  neural  net  can  be  used  to  solve  an  LQR  optimal  control 
problem.  However,  the  conventional  approach  to  solving  such  a  problem  involves  the 
solution  of  a  Riccati  equation.  Thus  we  can  say  that  the  neural  net  is  solving  a  Riccati 
equation.  Now  consider  a  Kalman  filtering  problem.  It  is  well  known  that  this  problem 
is  also  solved  by  solving  a  Riccati  equation.  Hence  we  propose  the  following  approach  to 
solving  the  Kalman  filtering  problem.  First,  map  from  the  Kalman  filtering  problem  to 
its  associated  Riccati  equation.  Then  map  from  the  Riccati  equation  to  its  associated 
optimal  control  problem.  Then  solve  the  optimal  control  problem  using  a  neural  net. 
Finally,  map  back  from  the  neural  net  solution  of  the  optimal  control  problem,  to  the 
Riccati  equation  solution,  and  then  to  the  Kalman  filtering  solution.  This  approach  is 
currently  being  considered  by  an  unfunded  graduate  student. 

4.  Finally,  we  are  interested  in  using  the  technique  in  real-time  adaptive  control  schemes. 
In  [7]  we  showed  how  to  identify  the  state-space  model  of  a  linear  system  using  a 
neural  net.  We  can  use  this,  together  with  the  optimal  control  result  given  above,  to 
suggest  the  configuration  shown  in  Figure  14.  As  shown,  the  identification  net  passes 
the  model  to  the  optimal  control  net,  which  computes  the  optimal  feedback  gains.  We 
are  interested  in  studying  the  closed-loop  stability  and  convergence  properties  of  this 
scheme. 
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Figure  14:  Real-time,  adaptive  control  using  neural  networks. 
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ABSTRACT 


Techniques  were  nvestigated  to  modify  the  BSTING  fire 
control  system  to  m  it  more  responsive  and  provide 
continuously  update*.  .raing  corrections.  The  ultimate  goal 
was  a  "point  and  shoot”  capability  while  still  retaining  a 
simple,  gun-mounted  stand-alone  system.  A  correction  scheme  was 
devised  that  accounted  for  the  fact  that  azimuth  and  elevation 
rates  are  measured  along  gun  axis  rather  than  the  required 
line-of-sight  axis.  Also  investigated  was  the  error  that 
results  if  this  correction  is  not  included.  Another  technique 
investigated  locked  the  gun  to  the  laser  beam  during  the  first 
two  seconds  while  the  initial  solution  is  computed  and  then  the 
gun  was  unlocked  to  allow  continously  updated  solutions.  An 
investigation  was  made  to  see  how  system  response  could  be 
improved  by  using  adaptive  Kalman  filtering  and  increasing  the 
data  sampling  rates.  An  analysis  was  performed  to  determine  the 
sensitivity  of  aiming  corrections  to  the  accuracy  of  the  sensor 
measurements.  A  brief  analysis  shoved  the  conditions  where 
elevation  angle  must  be  included  in  the  FC3  algorithms  and  the 
"small  angle"  approximation  is  not  appropriate.  A  new  fire 
control  system  using  externally  provided  measurements  was 
designed  and  evaluated. 
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CHAPTER  I 
INTRODUCTION 
Objectives  and  Approach 

While  the  current  BSTING  system  has  shown  significant 
promise,  its  greatest  limitation  is  that  it  requires  several 
seconds  for  measuring  parameters,  performing  computations, 
implementing  the  aiming  corrections  and  relocating  the  laser 
spot  on  the  target  before  the  weapon  is  ready  to  fire.  Also, 
once  the  solution  is  implemented,  the  process  must  be  repeated 
before  the  gun  can  be  realigned  with  respect  to  the  laser  beam. 
This  results  mainly  because  while  the  laser  is  used  to  measure 
the  range  and  range  rate,  gun  elevation  and  azimuth  rates  are 
measured  by  the  gun  itself,  requiring  the  gun  to  be  locked  to 
the  laser  beam  while  measurements  are  made. 

Ideally  it  would  be  desirable  to  make  measurements, 
compute  corrections  and  execute  them  continuously.  In  other 
words,  the  system  would  provide  almost  a  "point  and  shoot" 
capabi 1 ity . 
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OBJECTIVES 


The  purpose  of  this  research  was  to  determine  if  it  is 
possible  to  design  a  gun-mounted,  stand-alone,  system  that 
would  allow  continuous  gun  aiming  solutions  rather  then  require 
discrete  time  periods  for  tracking,  computation  and  aiming. 

Also  it  is  desirable  that  the  system  provide  and  implement 
aiming  corrections  in  a  minimum  amount  of  time. 

First,  an  investigation  was  done  to  see  if  the  current 
BSTING  algorithms  could  be  modified  to  provide  a  continuous 
aiming  solution.  Secondly,  a  fire  control  system  using 
externally  provided  measurements  was  designed  and  evaluated. 
Finally,  an  investigation  was  made  to  see  how  system  response 
could  be  improved  (i.e.  capability  approaching  "point  and 
shoot")  through  (a)  use  of  adaptive  Kalman  filtering  and  (b) 
increasing  data  sampling  rates. 

APPROACH 

As  an  initial  step,  an  investigation  was  made  to  see  if 
the  current  BSTING  fire  control  algorithms  could  be 
theoretically  modified  to  provide  continuous  tracking, 
computation  and  aiming.  A  more  adaptable  and  capable  BSTING- 
type  system  is  desirable  because  of  the  inherent  simplicity  of 
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the  system  with  its  minimum  of  required  measurements  and 
completely  "stand-alone"  characteristics.  This  initial  design 
incorporated  a  correction  technique  that  compensated  for 
the  fact  that  in  the  BSTING  system,  azimuth  and  elevation  rates 
are  measured  along  the  gun  boresight  rather  than  the  laser  beam 
1 ine-of-s ight  as  is  required  for  most  accurate  corrections. 

In  order  to  reduce  the  number  of  parameters  that  had 
to  be  investigated  and  optimized,  a  brief  investigation  was  done 
to  determine  the  sensitivity  of  aiming  corrections  to  the 
accuracy  of  the  sensor  measurements.  Because  the  correction  in 
the  azimuth  plane  due  to  movement  of  the  gun  platform  with 
respect  to  the  target  is  most  critical,  and  the  largest  source 
of  error,  this  parameter  was  used  in  determining  sensitivities. 

The  previous  theoretical  analysis  showed  it  was  possible 
to  adapt  the  BSTING  algorithms  to  provide  continuously 
updated  aiming  corrections.  However,  this  result  was  based  on 
"unnoisy"  data.  Thus  it  was  next  necessary  to  include  the  noisy 
sensor  measurements  that  results  when  the  gunner  is  aiming  the 
gun  from  a  moving  platform  at  a  stationary  target  to  see  if 
the  conclusions  were  valid  under  more  realistic  conditions.  The 
noisy  measurement  data  was  simulated  by  generic  noise  profiles 
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superimposed  on  the  theoretically  required  measurements.  The 
noise  profiles  were  based  on  data  obtained  during  the  BSTING 
flight  test  program  ( August-September  1990). 

As  mentioned  previously,  in  the  current  BSTING  system,  the 
azimuth  and  elevation  rates  are  measured  along  the  gun 
boresight  axis  though  computation  of  corrections  are  based  on 
sensor  measurements  along  the  laser  beam  axis.  To  compensate 
for  this  difference,  measurements  are  now  made  when  the  gun  is 
locked  to  the  laser  beam  axis.  An  investigation  was  made  to 
determine  the  aiming  error  that  results  if  the  gun  is 
"unlocked"  from  the  laser  beam  and  the  azimuth  and  elevation 
rate  sensors  move  with  the  gun  when  making  measurements. 
Subsequently,  this  analysis  was  extended  to  include  a  technique 
where  the  gun  was  locked  to  the  laser  beam  during  the  first  two 
seconds  while  the  initial  solution  is  computed.  This  follows 
the  technique  used  in  the  current  BSTING  system.  After  the 
initial  computation  was  determined,  then  the  gun  was  "unlocked" 
so  that  measurements  could  be  used  to  provide  a  continuously 
updated  solution  using  these  computed  aiming  corrections  as 
initial  conditions. 
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The  simple  BSTING  algorithms  do  not  use  the  elevation  angle 
in  gravity  correction.  Thus  an  analysis  was  done  to  determine 
the  magnitude  of  the  error  resulting  from  neglecting  this 
measurement.  The  flight  conditions  where  the  elevation  angle 
measurement  should  be  included  were  identified. 

An  alternate  fire  control  system  was  designed.  This  FCS 
uses  external  measurement  of  parameters  in  lieu  of  (or  in  addition 
to)  sensors  mounted  on  the  gun  itself  as  is  the  case  with 
the  current  BSTING  system.  The  goal  was  a  more  responsive  fire 
control  system  that  can  provide  "point  and  shoot"  capability. 
Initially,  this  FCS  was  designed  around  ideal,  that  is  unnoisy, 
sensor  measurements.  Subsequently,  the  analysis  was  extended 
using  more  realistic  and  noisy  data.  Again  the  noise  profiles 
used  were  based  on  data  obtained  during  the  BSTING  flight  test 
program. 

Finally,  a  rather  rigorous  analysis  was  conducted  to 
determine  if  system  response  could  be  improved  by  using 
(a)  optimization  of  the  filter  gains,  (b)  using  adaptive  Kalman 
filtering  and  (c)  increased  data  sampling  rates.  These 
techniques  were  applied  to  the  BSTING  algorithms 
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CHAPTER  II 

MODIFICATIONS  TO  BSTING  SYSTEM 
Continuous  Solution  -  Theoretical 

As  an  initial  step,  an  investigation  was  made  to  see  if 
the  current  BSTING  fire  control  algorithms  could  be  modified  to 
provide  continuous  tracking,  computation  and  aiming.  A  more 
adaptable  and  capable  BSTING-type  system  is  desirable  because 
of  the  inherent  simplicity  of  the  system  vith  its  minimum  of 
required  measurements  and  completely  "stand-alone"  character. 

ANALYSIS  OF  BSTING  AZIMUTH  AIMING  ERROR 

Currently,  the  BSTING  system  makes  range  and  azimuth/ 
elevation  measurements  vhile  the  gun  is  "locked"  to  the  laser 
beam.  Thus  the  azimuth  and  elevation  rates  measured  by  the 
gun  mounted  sensor  are  the  same  as  the  desired  LOS  (laser) 
rate.  However,  if  the  two  are  "unlocked"  (required  to  implement 
the  correction)  as  would  be  required  for  a  continuous  computing 
system,  the  gun  mounted  rate  sensors  would  not  be  measuring  the 
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correct  rates  and  therefore  the  resulting  aiming  corrections 
would  be  in  error.  This  error  would  be  directly  related  to  the 
rate  of  change  of  the  elevation  and  azimuth  corrections. 

The  correction  in  the  azimuth  plane  due  to  motion  of  the 
gun  platform  is  much  more  critical  to  system  accuracy  than  the 
rate  corrections  in  the  elevation  plane,  at  least  for  straight 
and  level  flight  maneuvers.  In  the  BSTING  system,  the  azimuth 
plane  is  the  plane  parallel  to  the  gun  boresight  axis,  whereas 
the  elevation  plane  is  normal  to  the  ground.  Thus  the  azimuth 
rate  is  directly  related  to  the  platform's  horizontal  velocity. 
Because  azimuth  rate  errors  are  the  greatest  source  of  error, 
this  parameter  will  be  used  in  the  subsequent  analysis.  Refer 
to  figure  II-l  for  the  correction  in  the  azinmuth  plane. 

BORESIGHT 


AZ 


Base 

Figure  II-l 

Correction  in  Azimuth  Plane 
(Platform  Moving  at  Velocity  V  at  Slant  Range  RSL ) 

a/c 

j 

,  43-10 
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From  the  figure 


where:  V  is  the  muzzle  velocity 

muz 

is  the  azimuth  rate  measured  by  the  rate  sensor 
laser  along  the  centerline  of  the  laser  beam 

EL  is  the  aiming  correction  in  the  elevation  plane 
corr 


Now  when  the  rate  sensor  is  mounted  on  the  gun,  the  correction 
would  be 


AZ' 

corr 


RSL 

-  AZ 

V  Cos (EL  )  gun 
muz  corr 


( 1 1  -3  ) 


For  straight  and  level  flight  past  a  stationary  target, 

the  azimuth  rate  along  the  laser  centerline  can  be  determined 

using  the  following  figure: 
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AZ 

t  laser 

■C? 

I  laser 


laser 


Cos  AZ 

a/c  laser 


Figure  11-2 

Determination  of  Azimuth  Rate  in  Laser  Beam  Plane 


V  Cos  AZ 
a/c  laser 


V  R 
a/c  min 


(II-4) 


laser 


If  the  azimuth  correction  were  based  on  the  gun  mounted  rate 
gyro  measurement,  as  is  the  case  for  the  current  BSTING  system, 
the  correction  would  be 


V  Cos ( AZ  )  V  Cos ( AZ  +  AZ  ) 

.  a/c  gun  a/c  laser  corr 

AZ  =  -  =  - - - ( 1 1  -5 ) 

gun  RSL  RSL 


If  Cos (EL  )  ~  1,  which  is  the  case  for  low  altitudes 
corr 
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Then  the  error  in  correction  is  given  by 


RSL 

A2  -  AZ'  =  -  {AZ  -  AZ  }  ( 1 1  -6 ) 

corr  corr  7  laser  gun 

muz 


V 

a/c 

-  {Cos ( AZ  )  -  Cos ( AZ  +  AZ  )> 

7  laser  laser  corr 

muz 


where  AZ  is  given  by  equation  I 1-2 
corr 


The  error  in  azimuth  aiming  correction  due  to  having 
the  rate  sensor  mounted  on  the  gun  (BSTING  system)  rather  than 
on  the  laser  beam  (required  for  maximum  accuracy)  axis  was 
estimated  using  equation  II-6  for  several  flight  scenarios. 


SCENARIOS 


CASE 

AIRCRAFT 

7EL0CITY 

MINIMUM 
SLANT  RANGE 

ALTITUDE 

ABOVE  TARGET 

1 

75  kts 

500  mtrs 

250  ft 

2 

125  kts 

1000  mtrs 

250  ft 

3 

175  kts 

1500  mtrs 

250  ft 

4 

200  kts 

2000  mtrs 

250  ft 

results 

of  this  analysis 

are  shown  in 

Figure  II-3. 
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«  175k ts.  lSOOmtrsP  200kts.  2000mtrs 


Piqure  I I —3 

Azimuth  Aiming  Error  Due  to  Mounting  Rate  Sensor 
Along  Gun  Boresight  Rather  Than  Laser  Beam  Centerline 
(Plight  starts  at  45  degrees  azimuth  and  ends  a  9 
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Proa  figure  1 1  —  3  it  can  be  seen  that  aiming  error  when 
rates  are  measured  along  gun  boresight  versus  along  laser  beam 
centerline  in  the  azimuth  plane  is  substantial  especially  at 
high  platform  velocities  and  large  azimuth  angles. 


THEORETICAL  TECHNIQUE  TO  REDUCE  AIMING  ERROR 

If  equation  II-l  is  differentiated  vith  time 

•  *  • 

AZ  =  AZ  +  AZ 

gun  laser  corr 

if  k  »  V  Cos  (EL  )/RSL 
muz  corr 

from  equation  I 1-2 


AZ  =  kAZ 

laser  corr 


therefore 


or 


AZ  *  kAZ  *  AZ 
gun  corr  corr 


AZ  =  -kAZ  +  AZ 
corr  corr  gun 


vhich  has  the  solution 

AZ 


-kt 

=  Qe  +  AZ  /k 
corr  gun 


where  t  is  the  sampling  period  time  interval 


( 1 1-7 ) 


( 1 1-8 ) 
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As  a  first  try  for  Q,  assume 

at  t  *  0,  AZ  (the  previous  correction) 

corrp 


therefore 


finally 


Q  3  AZ  -  AZ  /k 

corrp  gun 


-kt  -kt 

AZ  =  AZ  e  -  (e  -  1)  AZ  /k 

corr  corrp  gun 


( 1 1 ~9 ) 


Figure  I I -4  shovs  the  improvement  in  azimuth  aiming  correction 
when  azimuth  correction  defined  by  equation  II-9  is  used. 

The  error  can  be  further  improved  if  a  "correct" 
correction  is  used.  For  example,  this  can  be  determined  by 
by  locking  the  laser  beam  vrt  gun  during  an  initial  computation 
period  and  using  this  as  the  initial  condition.  The  initial 
correction  then  becomes 

-kt  -kt 

AZ  =  AZ  e  -  (e  -  1)  AZ  /k  (11-10) 

corr  initial  gun 

Subsequently,  the  previous  solution  would  be  used  (i.e. 
equation  II-9).  The  improvement  in  aiming  error  is  now  shown  in 
figure  II-5. 
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AZ  CORR  =  AZ  CORR  PREVIOUS 


AZIMUTH  -  LOS  Cdegreea> 

O  75kts.  500nrtps  ▼  125kts.  lOOOmtrs 


»  l75kts,  1500mtr*>  ZOOkts.  2000mtrs 


Figure  II-4 

Iaproveaent  in  Aziauth  Aiaing  Correction  When 
Equation  I I -9  is  Used 
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DIFFERENCE  -  milliradians 


AZIMUTH  -  LOS  (degrees) 
a  75kts.  SOOmtrs  ▼  125kts.  iOOOmtrs 

*  175k ts.  1500mtrs»  200kts.  2000mtrs 


Figure  II-5 

Improvesent  in  Azimuth  Aiming  Correction  When 
Equation  I I —9  and  11-10  are  Used 
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The  logic  of  this  correction  technique  vas  applied  to  the 

BSTING  fire  control  algorithms.  First,  for  the  first  three 

seconds,  range,  azimuth  rate  and  elevation  rate  were  used  to 

compute  the  azimuth  and  elevation  corrections.  The  gun  and 

laser  beam  moved  together  during  this  period.  Next,  one  second 

vas  used  to  implement  the  solution  and  reaim  the  beam  onto  the 

target.  Finally,  the  gun  vas  alloved  to  move  independent  of  the 

laser  beam,  but  still  used  to  keep  the  beam  on  the  target.  The 

correction  algorithm  described  by  equations  II-9  and  11-10  vere 

used.  The  value  of  AZ  vas  the  value  of  the  aiming 

initial 

correction  solution  at  the  end  of  the  calculation  period.  The 

folloving  scenario  vas  used: 

A/C  VELOCITY  RANGE  AT  CLOSEST  ALTITUDE 

APPROACH 

150  knots  1000  mtrs  250  feet 

Figure  I I -6  shows  the  gun  and  laser  beam  azimuth  rates. 
Incidently,  this  correction  technique  vas  not  applied  to  the 
elevation  rates  since  the  difference  in  elevation  rate  betveen 
gun  and  laser  are  significantly  less,  though  the  correction 
could  be  applied  here  as  veil. 
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TIME  -  Mcondi 

*  GUN  •  LASER 

Figure  II-6 

Azimuth  Rates  "Measured"  Along  Gun  Boresight  and  Laser  Beam 
Difference  Results  Because  of  the  Contantly 
Changing  Aiming  Correction 

Figure  I I -7  shows  the  azimuth  and  elevation  corrections 
that  would  be  applied  with  the  current  BSTING  algorithm.  Figur 
II-8  shows  the  azimuth  and  elevation  corrections  that  would  be 
available  using  continuously  updated  rate  measurements. 

The  analysis  shows  that  at  least  theoretically  the  BSTING 
fire  control  system  could  be  modified  by  revising  the 
algorithms  so  that  they  could  provide  continuously  updated 
aiming  corrections. 


milliradion* 


Figure  II-7 

Azimuth  and  Elevation  Corrections  Using  Current  BSTING 
Algorithms  (Non-noisy  Measurements) 
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Figure  II-8 

Aziauth  and  Elevation  Corrections  Using  Current  Modified 
BSTING  Algorithas  (Non-noisy  Measurements) 
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COMPUTER  PROGRAM:  DIFFPR.BAS 

PURPOSE:  Computes  Azimuth  Aiming  Error  Due  to  Mounting  Rate 
Sensor  Along  Gun  Boresight  Rather  Than  Laser  Beam 
Centerline . 

RESULTS:  Figure  I I -3 

10  OPEN  "0",  #1,  "B: AZDIFF.DAT" 

20  PI  =>  3.1416 

30  INPUT  "Aircraft  Velocity  (knots)  =?",  VAC 

40  INPUT  "Range  at  Closest  Approach  (meter)  **?",  RMIN 

45  RMIN  s  RMIN*3 . 28 

50  INPUT  "Altitude  (feet)  =  ?",  H 

60  INPUT  "Muzzle  Velocity  (fps)  =  ?" ,  VMUZ 

70  VA  =  VAC* 8 8/ 60/  .8684 

80  INPUT  "Time  Increnent  *  ?",  DT 

90  RBASE  =  RMIN  -  VA*T 

100  RSL  =  SQR ( RMIN~2  +  RBASE~2  +  H~2) 

110  AZRATE  =  -VA*RMIN/RSL/RSL 
120  AZLOS  =  RBASE/RSL 

130  AZLOS  =»  ATN ( AZLOS/SQR ( 1  -  AZLOS"2)) 

140  AZLOS1  =>  AZLOS *180/PI 
150  AZCORR  =*  RSL* AZRATE/ VMUZ 
160  IF  A ZL03  <  0  THEN  GOTO  210 

170  DIFP  =»  VA*(  COS  (AZLOS)  -  COS(AZLOS  +  AZCORR )) /VMUZ 

180  PRINT  T,  RSL,  AZLOS1,  AZRATB*1000,  AZCORR*1000,  DIFF*1000 

185  PRINT  #1,  AZLOS1,  DIFF*1000 

190  T  »  T  +  DT 

200  GOTO  90 

210  END 
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COMPUTER  PROGRAM :  ACTCORR . BAS 

PURPOSE:  Computes  the  Theoretical  Continous  Azimuth 

Aiming  Solution  Using  Either  Equations  II-9  and  11-10. 
(Depending  on  the  Initial  Correction  Used) 

RESULTS:  II-4  and  II-5. 


010  OPEN  "I",  #1,  "B: AZDIFF75 . DAT" 

20  OPEN  "0",  #2,  "B:AZDIFF75.0UT" 

30  PI  =  3.1416 

40  INPUT  "Muzzle  Velocity  (fps)  =  ?",  VMUZ 

50  INPUT  "Elevation  Correction  (degrees)  »  ?" ,  ELCORR 

60  ELCORR  =  BLCORR*PI/180 

65  INPUT  II,  T,  RSL,  AZLOS,  AZRATE,  AZDOTGUN,  AZCORR  ,  DIFF 
70  AZCORR1  =  AZCORR 

80  INPUT  |1,  T,  RSL,  AZLOS,  AZRATE,  AZDOTGUN,  AZCORR  ,  DIFF 
90  K  *  VMUZ*COS< ELCORR) /RSL 

95  AZCORR1  *  AZC0RR1*EXP(-.2*K)  -  (EXP(-.2*K)  -  1 ) *AZDOTGUN/K 
122  DIFF1  -  AZCORR1  -  AZCORR 

130  PRINT  AZLOS,  AZCORR*1000,  AZCORR1*1000,  DIFF*1000,  DIFF1*1000 
134  PRINT  #2,  AZLOS,  DIFP1*1000 
140  GOTO  80 
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COMPUTER  PROGRAM:  A2DATAPR . BAS 


PURPOSE: 


Computes 
Continous 
I I -9  and 


the  Input  Data  Used  for  the  Theoretical 
Aiming  Solution  Calculated  by  Equations 
r  t-in . 


RESULTS:  Inputs  for  Programs  Used  for  Figures  II-4  and  II-5 


010  OPEN  "0",  #1,  "B: AZDIFF75.DAT" 

20  PI  =  3.1416 

30  INPUT  "Aircraft  Velocity  (knots)  =?",  VAC 

40  INPUT  "Ranqe  at  Closest  Approach  (meter)  =?",  RMIN 

50  RMIN  *  RMIN*3 . 28 

60  INPUT  "Altitude  (feet)  =  ?",  H 

70  INPUT  "Muzzle  Velocity  (fps)  *  ?",  VMUZ 

80  VA  =*  VAC*88/60/  .  8684 

90  INPUT  "Time  Increment  *  ?",  DT 

100  RBASE  =  RMIN  -  VA*T 

110  RSL  *  SQR(RMIN~2  +  RBASE M 2  +  IT2) 

120  AZRATE  *  -VA*RMIN/RSL/RSL 
130  AZLOS  =  RBASE/RSL 

140  AZLOS  =  ATN ( AZLOS/SQR ( 1  -  AZLOS~2)) 

150  AZLOSi  *  AZLOS*180/PI 

160  AZCORR  =  RSL* AZRATE/ VMUZ 

170  IP  AZLOS  <  0  THEN  GOTO  240 

180  AZDOTGUN  =  -VA*COS ( AZLOS  +  AZCORR) /RSL 

190  DIPP  =  VA*( COS (AZLOS)  -  COS (AZLOS  +  AZCORR )) /VMUZ 

200  PRINT  T,  RSL,  AZLOSI,  AZRATE*1000,  AZDOTGUN*1000,  AZCORR*1000,  DIF 

210  PRINT  #1,  T,  RSL,  AZLOSI,  AZRATE,  AZDOTGUN,  AZCORR,  DIPP 

220  T  »  T  ♦  DT 

230  GOTO  100 

240  END 
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CHAPTER  III 


SENSITIVITY  ANALYSIS 
BSTING  System 

A  brief  investigation  was  done  to  determine  the  sensitivity 
of  aiming  corrections  to  the  accuracy  of  the  sensor  measurements. 
Because  the  correction  in  the  azimuth  plane  due  to  movement  of 
the  gun  platform  vith  respect  to  the  target  is  most  critical, 
and  the  largest  source  of  error,  this  parameter  vas  used  in 
determining  sensitivities. 

Since  the  algorithms  used  in  the  BSTING  System  represents 
probably  the  simplest  technique  possibility  to  determine  the 
correction  they  were  used  in  this  investigation.  In  the  BSTING 
system,  the  azimuth  aiming  correction  is 

• 

Sin  (AZ  )  =  -  RSL  x  AZ/V  (III-l) 

corr  muz 

where:  RSL  *  Slant  Range  to  Target 

AZ  3  Azimuth  Rate  Measured  by  Rate  Sensor 

V  =  Muzzle  Velocity 

muz 

During  measurements  the  gun  is  locked  to  the  laser  beam  so  that 

•  «  • 

AZ  =  AZ  AZ 

laser  gun 
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Therefore  the  azimuth  correction  is  sensitive  to  three 


parameters  -  slant  range,  azimuth  rate  and  muzzle  velocity. 
This  dependence  can  be  expressed  by  the  relationship: 


AZ 

corr 


8Corr 

8  RSL 


^RSL 


3  Corr 

+  - A  AZ 

3  AZ 


3  Corr 

+  - AV  ( 1 1 1 -2  ) 

3  V  muz 

muz 


Using  equation  III-l,  the  sensitivity  coefficients  can  be 
expressed  as : 


PARAMETER 
3  Corr 
3  RSL 
3  Corr 
3  AZ 


DESCRIBING  EQUATION 

TERM  x  AZ/V 

muz 

TERM  x  RSL/V 

muz 


3  Corr  .  2 

-  TERM  x  RSL  x  AZ/V 

3 v  muz 

muz 


TERM 


1 


(RSL  x  AZ/V  ) 
muz 


2 


The  above  sensitivities  vere  computed  for  several 
scenarios.  In  each  case  the  helicopter  flew  past  the  target 
on  a  straight  and  level  trajectory.  The  scenarios  used  are 
shown  in  table  1 1 1  —  1 . 
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TABLE  III-l 


SCENARIOS  USED  IN  SENSITIVITY  ANALYSIS 

SCENARIO  VELOCITY  RANGE  AT  CLOSEST  ALTITUDE 

NO.  (knots)  APPROACH  (meters)  (feet) 


1 

75 

500 

250 

2 

150 

500 

250 

3 

175 

1000 

250 

4 

175 

1500 

250 

5 

175 

2000 

250 

6 

175 

2500 

250 

The  resulting  sensititivies  are  shovn  in  figures  III-l  through 
III-3 . 

In  order  to  determine  the  error  contribution  on  each  term 
in  equation  III-2  it  was  necessary  to  estimate  the  true 
measurement  accuracy  of  each  of  the  three  sensors  used  in 
computing  the  azimuth  correction.  The  true  accuracy  includes 
not  only  the  inherent  accuracy  of  the  sensor  itself  but  also 
the  actual  accuracy  that  can  be  achieved  when  the  sensor  is 
aimed  by  the  gunner  on  the  moving  platform.  This  "total" 
sensor  accuracy  vas  determined  using  representative  data  from 
the  BSTING  flight  tests  conducted  at  the  Naval  Weapons  Center 
during  August  and  September  of  1990. 
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A  10  meter/second  (32.8  feet/second)  is  assumed  for  muzzle 
velocity  variation  as  suggested  by  Reference  III-l.  According 
to  the  reference,  the  main  cause  of  variation  in  muzzle 
velocity  is  due  to  variations  in  propellant  temperature.  Thus 
this  error  could  be  reduced  by  including  a  sensor  that  measures 
the  temperature  in  the  ammunition  container.  Referring  to 
figure  1 1 1  —  2 ,  a  10  m/s  error  in  muzzle  velocity  vould  result  in 
no  more  than  about  a  1-milliradian  error  in  azimuth  correction. 

Review  of  flight  test  data  showed  that  when  the  gunner  had 
the  laser  beam  aimed  at  the  target,  the  range  error  is  no  more 
than  10  feet.  Figure  III-4  that  shows  the  results  from  Pass 
No.  6  on  August  6,  1990,  which  is  typical.  This  is  unfiltered 
data  and  thus  filtered  data  that  is  actually  usedin 
calculating  corrections  is  even  better.  Combining  this  with 
the  sensitivities  in  figure  III-l,  the  maximum  error  in  azimuth 
aiming  correction  due  to  range  measurements  is  probably  no  more 
than  0.5  milliradian. 

After  subtracting  the  muzzle  velocity  error  (1  milliradian) 
and  range  measurement  (0.5  milliradian)  from  the  overall 
4  milliradian  criteria,  2.5  milliradians  can  be  alloted 
to  rate  azimuth  rate  sensor  measurements.  Using  a  sensitivity  of 
3  milliradians/mr-per-second  (see  figure  III-3),  a  rate  sensor 
accuracy  of  about  0.8  milliradians-per-second  vould  be  required. 
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millirodions/foot 


- 1 - • - 1 - - - 1  - - 1 

IS  25  35  45 

AZIMUTH  -  degree* 

▼  150.  500  ■  175.  lOOO 

x  175.  2000  a  175.  2500 


Figure  III-l 

Sensitivity  o£  the  Correction  in  Azimuth  Due  to  Variations 
in  Slant  Range  Measurements.  The  Tvo  Numbers  in  the  Legend 
are  Platform  Velocity  in  knots  and  Slant  Range  in  meters. 
(Flight  starts  at  45  degrees  azimuth  and  ends  at  0-5  degrees) 


□  75.500 
•  175.1500 


miltirodian/fps 


AZIMUTH  -  dm grra 

a  75.  500  *  150.  500  ■  175. 1000 

O  175. 1500  x  175.  2000  a  175.  2500 


Figure  III-2 

Sensitivity  of  the  Correction  in  Azimuth  Due  to  Variations  in 
Muzzle  Velocity  Used  in  Computations.  The  Two  Numbers  in  the 
Legend  are  Platform  Velocity  in  knots  and  Slant  Range  in  meters. 
(Flight  starts  at  45  degrees  azimuth  and  ends  at  0-5  degrees) 
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(nillirodian/mr-per-second 


Figure  III-3 

Sensitivity  o£  the  Correction  in  Azimuth  Due  to  Variations 
in  Aziauth  Rate  Measureaents .  The  Tvo  Nuabers  in  the  Legend 
are  Platform  Velocity  in  knots  and  Slant  Range  in  meters. 
(Flight  starts  at  45  degrees  azimuth  and  ends  at  0-5  degrees) 
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Greater  accuracy  is  required  as  range  increases.  For 
example,  in  the  BSTING  flight  tests,  the  range  was  limited  to 
usually  1000  meters,  therefore  looking  at  figure  III-3,  an 
accuracy  of  about  2-milliradians-per-second  was  acceptable. 

Figure  III-5  shows  filtered  azimuth  rate  used  in  the 
azimuth  correction  versus  that  actually  required  for  a 
representative  case,  again  for  Pass  6  on  August  7,  1990.  This 
shows  that  the  gunner  is  able  to  aim  the  gun/laser  so  that  a 
2-milliradians-per-second  rate  sensing  accuracy  is  possible. 

While  longer  ranges  were  not  used  during  the  flight  tests, 
a  qualitive  statement  can  be  made.  Since  at  longer  ranges 
result  in  reduced  azimuth  rates,  it  is  easier  to  keep  the  laser 
spot  on  target  so  rate  sensing  errors  should  be  substantially 
less.  Therefore  this  should,  at  least  partially,  compensate 
for  the  requirements  that  rate  sensing  accuracy  must  be  better 
at  longer  ranges. 


REFERENCES 

1 1 1 - 1  -  Fire  Control  Analysis  of  Selected  Crew  Served  Weapons. 
Thomas  B.  Knapp  and  K.P.  Pfleger,  U.S.  Army  Armament 
Research,  Development  and  Engineering  Center,  TR  ARFSD- 
TR-90032,  March  1991. 
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Figure  III-5 


Coaparison  of  Actual  Aziauth  Rate  with  Filtered 
Aziauth  Rate  Sensor  Data) 

(Pass  Nuaber  6  -  8/7/90) 
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COMPUTER  PROGRAM:  SENSPR.BAS 


PURPOSE:  Computes  the  sens  it  it  ivies  of  the  azimuth  correction 

to  variations  in  (1)  muzzle  velocity/  (2)  slant  range 
measurements  and  (3)  azimuth  rate  measurements. 

RESULTS:  III-l,  III-2  and  I I I -3 


10  OPEN  "0%  #1/  "B: C0RRRSL2.DAT" 

20  OPEN  "0"/  i2,  "B: CORRRTE2.DAT" 

30  OPEN  "0",  8 3/  "B: C0RRMUZ2.DAT" 

40  PI  *  3.1416 
50  G  *  32.17 

60  INPUT  "Aircraft  Velocity  (knots)  =  ?M,  VAC 
70  VAC  *  VAC*88/60/  .  8684 

80  INPUT  "Range  at  Closest  Approach  (meters)  =  RMIN 

90  RMIN  »  RMIN*3 . 28 

100  INPUT  "Altitude  (feet)  *  H 

110  RBASE  *  RMIN  -  VAC*T 

120  RSL  »  SQR ( RMIN~2  +  RBASE*2  +  H~2) 

130  AZ  *  RBASE/RSL 

140  AZ  *  ATN ( AZ/SQR ( 1  -  AZ~2)) 

150  AZ1  »  AZ*180/PI 

160  VMUZ  *  2860 

170  AZRATE  =  VAC*RMIN/RSL/RSL 

180  COEP  »  RSL* AZRATE/ VMUZ 

190  COEF  *  1/SQR ( 1  -  COEF~2) 

200  DCDRSL  =  COEF* AZRATE /VMUZ 

210  DCDRATE  *  COEF*RSL/VMUZ 

220  DCDVMUZ  *  COEF*RSL*AZRATE/VMUZ'*2 

230  PRINT  T,  AZ1,  DCDRSL*1000/  DCDRATE,  DCDVMUZ*1000 

240  PRINT  #1,  AZ1 ,  DCDRSL*1000 

250  PRINT  »2,  AZ1,  DCDRATE 

260  PRINT  #3,  AZ1,  DCDVMUZ*1000 

270  IF  AZ1  <  0  THEN  GOTO  400 

280  T  *  T  +  1 

281  GOTO  110 
400  END 
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CHAPTER  IV 


NOISE  SIMULATION  AND  FILTER  OPTIMIZATION 
Initial  Investigation 

It  was  shown  in  Chapter  II  that  it  was  possible  to  use 
the  BSTING  concept  for  obtaining  continuously  updated  aiming 
corrections,  at  least  when  theoretical  "unnoisy"  range  and 
azimuth/elevation  sensor  measurements  are  used.  Next,  it  is 
necessary  to  see  if  this  same  conclusion  can  be  made  in  the 
actual  BSTING  environment  with  noisy  sensor  measurements. 

The  technique  used  to  simulate  the  real  situation  was  to 
superimpose  noise  on  the  theoretical  range  and  azimuth/elevation 
rates.  The  noise  to  be  superimposed  was  determined  from  actual 
data  obtained  during  the  BSTING  flight  test  program.  Again  Pass 
No.  6  on  August  7,  1990  was  used.  The  noise  was  obtained  using 
the  following  steps. 

a.  A  tracking  period  was  chosen  from  the  flight  test  records 
where  it  was  obvious  that  the  gunner  was  doing  a  good  job  of 
keeping  the  laser  spot  on  the  target. 

b.  For  this  period  chosen,  mean  values  of  range  and  azimuth/ 
elevation  rates  were  determined  using  a  second-order  curve 
fit  to  the  noisy  flight  test  data. 
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c.  These  mean  values  were  then  subtracted  from  the  noisy  data 
to  obtain  the  "generic"  noise  required. 

The  resulting  generic  noise  is  shovn  in  figures  IV-1  through 


15  2 

TIME  -  seconds 


Figure  IV-1 


Azimuth  Rate  Generic  Noise  from  Actual  Flight  Test  Data. 
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A  brief  analysis  vas  done  to  determine  if  the  aziauth  rate 
filtering  could  be  iaproved  to  give  better  results  than  the 
current  filter  algorithas  now  used  in  the  current  BSTING 
system.  Incidently,  soae  optimization  had  been  done  vhen  the 
original  BSTING  algorithas  were  developed.  Since  the  aziauth 
correction  is  the  one  most  sensitive  to  noise,  it  vas  the  only 
one  considered  here.  Further,  optimization  of  either  range  or 
elevation  rate  filters  would  not  significantly  affect  results. 
(Mote:  later  the  results  of  using  an  adaptive  filter  that 
constantly  provides  an  optimum  filter  will  be  presented.) 

The  filter  algorithm  in  the  azimuth  plane  is: 


AZDHAT  »  TERM2* ( 1  -  2 . 0*RHATD*DTA/RHAT) *AZDHAT  (IV-1) 

+  KPA*GXHAT 


Where: 

AZDHAT  is  the  filtered  azimuth  rate 
RHAT  is  the  filtered  slant  range  rate 
RHATD  is  the  filtered  slant  range  rate 
GXHAT  is  the  latest  azimuth  rate  aeasureaent 
DTA  is  tiae  interval  between  data  samples 
TERM 2  *  0.8  in  the  current  BSTING  algorithm 
KPA  =  0.2  in  the  current  BSTING  algorithm 


The  filter  for  the  slant  range  measures  is: 


RHAT  =  C1*RHAT  +  C2*RHATD  +  KPR*SENRNG  (IV-2) 

RHATD  *  RHAT* (TERM3  -  KVR )  +  C3*RHATD  +  KVR*SEHRNG 
TERM 3  =  (ELDHAT‘*2  +  AZDHAT“  2  )  *DTR 
Where:  (in  addition  to  terms  previously  defined) 

SENRNG  is  the  latest  range  measurement 
ELDHAT  is  the  filtered  elevation  rate 
Cl  *  0.9 
C2  *  C1*DTR 

DTR  is  the  time  interval  between  data  samples 

Only  one  scenario  was  used  in  this  evaluation  and  it  used 
the  following  parameters,  again  for  a  straight  and  level  flyby 
from  an  azimuth  of  45  to  -45  degrees: 

PLATFORM  VELOCITY  -  175  knots 

MINIMUM  SLANT  RANGE  -  1000  meters 

ALTITUDE  ABOVE  TARGET  -  250  feet 

Figure  IV-4  shows  the  noisy  azimuth  rate  data  when 
the  generic  azimuth  noise  was  added  to  the  theoretical 
azimuth  rate.  This  data  would  be  the  raw  azimuth  rate 
measurements  that  would  be  subjected  to  different  filter 
parameters  in  the  subsequent  optimization  investigation. 
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Figure  IV-4 

"Raw"  Azimuth  Rate  Measurements 

In  the  current  BSTING  system,  the  values  of  KPA  and  TERM2 
are  0.2  and  0.8  respectively.  The  filtered  azimuth  rate  using 
these  initial  coefficient  are  shown  in  figure  17-5. 

The  first  improvement  consisted  of  changing  the  values  of 
KPA  and  TERM2  to  0.1  and  0.9  respectively.  This  has  the  effect 
of  placing  aore  dependence  on  the  "old"  correction  rather  than 
the  "latest"  measurement  value.  This  change  in  coefficient  also 
results  in  a  aore  sluggish  response.  Another  change  aade  was  to 


Figure  rv-5 

Filtered  Results  -  KPA  *  2.0,  TERM 2  »  0.8,  GXHAT(O)  *  0 

change  the  initial  value  for  the  azimuth  rate.  In  the  current 
algorithm,  the  initial  value  vas  assumed  to  be  zero  (GX HAT  » 
0).  This  vas  changed  so  that  initial  value  vas  taken  to  be 
equal  to  the  first  azimuth  rate  measurement.  The  results  of 
using  these  nev  coefficients  and  initial  value  for  GXHAT  are 
shovn  in  figure  IV-6.  These  changes  resulted  in  a  reduced 
difference  betveen  filtered  and  actual  azimuth  rates.  Also  the 
change  in  initial  condition  lead  to  a  reduced  initial 
difference  and  the  filtered  data  began  to  oscillate  about  the 
actual  azimuth  rate  faster. 
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TIME  -  Mconda 

Figure  XV-6 

Filtered  Results  -  KPA  *1.0,  TBRM2  *  0.9, 

GXHAT(O)  *  1st  Measurement 

Since  these  changes  improved  system  response,  the 
coefficients  were  further  changed  to  0.05  and  0.95.  The  results 
of  these  changes  are  shown  in  figure  IV-7.  Again  the  initial 
value  was  taken  to  be  one-half  the  initial  azimuth  rate  sensor 
measurement.  This  showed  a  further  reduction  in  oscillation 
about  the  nominal  rates.  However,  the  sluggishness  of  the 
response  was  too  great. 
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Figure  IV-7 

Filtered  Results  -  KPA  *  0.05,  TERM2  =  0.95, 

GXHAT(O)  3  1st  Measurement 

The  next  excursion  vas  to  see  the  effect  of  increasing  the 
saapling  rate.  Here  the  increased  sanpling  rate  was  simulated 
by  using  the  same  noisy  measurement,  but  the  interval  between 
the  data  point  vas  simply  halved.  The  results  of  this  increased 
in  sampling  rate  is  shown  in  figure  IV-8.  The  increased  sampling 
ra^®  results  in  the  azimuth  rate  homing  in  on  the  nominal 
value  faster  (i.e.  about  2.5  versus  5  seconds). 

The  results  of  this  preliminary  shoved  that  there  could  be 
benefit  in  increasing  the  sampling  rate. 
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Figure  IV-8 


Filtered 
GXH&T ( 0 )  * 


Results  -  KPA  -  0.1,  TERM 2  »  0.9, 

1st  Measurement,  Twice  Sampling  Rate 
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COMPUTER  PROGRAM:  FILTER. BAS 

PURPOSE:  Computes  the  Filtered  Azimuth  Rate.  Used  in 

Optimization  o f  Filter  Coefficients  and  Investigating 
Effect  of  Increasing  the  Sampling  Rate 


■S&rai/ W  SKUA .  Oat 

20  OPEN  "0",  92,  " C : AZRATE. CS 4" 

30  KPA  *  .1 

40  KPE  »  .1 

50  KPR  »  .1 

60  KVR  »  .1 

70  Cl  *  .9 

80  TERM1  »  .9 

90  TERM2  -  .9 

100  DTR  »  .22 

110  C2  *  DTR*C1 

120  C3  -  11  -  KVR* DTR 

130  INPUT  11,  TIME,  RSL,  SENRNG,  AZRATE,  GXHAT 

140  R( 1 )  -  SENRNG 

150  RHAT  »  SENRNG 

160  AZDHAT  »  GXHAT/1000 

170  GOSUB  370 

180  INPUT  #1,  TIME,  RSL,  SENRNG,  AZRATE,  GXHAT 

190  R( 2 )  »  SENRNG 

200  RHAT  »  . 5*(R( 2 )  +  RHAT) 

210  RHAT  *  . 5 * ( SENRG  -  RHAT) /DTR 
220  GOSUB  400 
230  GOSUB  370 

240  INPUT  81,  TIME,  RSL,  SENRNG,  AZRATE,  GXHAT 
250  RHAT  »  (R( 1 )  ♦  R<2)  +  SBNRNG)/3I 
260  RHATD  »  .5* (SENRNG  -  R(1))/DTR 
270  GOSUB  400 
280  GOSUB  370 

290  INPUT  81,  TIME,  RSL,  SENRNG,  AZRATE,  GXHAT 

300  RHAT  »  RHAT*C1  ♦  RHATD*C2  *SENRNG*KPR 

310  TERM3  -  (ELDHAT~2  +  AZDHAT~2 ) *DTR 

320  RHATD  »  RHAT* ( TERM3  -  KVR)  +  RHATD*C3  +SENRNG*KVR 

330  GOSUB  400 

340  GOSUB  370 

350  GOTO  290 

360  END 

370  PRINT  TIME,  RHAT,  AZDHAT*1000 
380  PRINT  82,  TIME,  AZDHAT*1000 

390  RETURN  _ 

400  AZDHAT  *  TBRM2*(1!  -  2 1 *RHATD*DTR/RHAT) *AZDHAT  +  KPA*GXHAT/1000 
410  RETURN 
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COMPUTER  PROGRAM:  RAWDATPR.BAS 

PURPOSE:  Computes  the  Slant  Rate  and  Azimuth  Rate  for  a  Given 
Flight  Trajectory.  Adds  to  Generic  Noise  Theoretical 
Range  and  Azimuth  Rate.  Used  for  Input  for  Filter 
Optimization  Program  -  FILTER. BAS 

RESULTS:  Figure  IV-4 

010  PI  *  3.1416 

20  OPEN  "I",  II,  "C: NOISE. DAT" 

30  OPEN  "O'*,  #2,  "  C : AZRATE . THE " 

40  OPEN  "O",  13,  "C: AZNOISE. ACT" 

50  OPEN  "O",  #4,  "C: RAWDATA.DAT" 

60  INPUT  "Aircraft  Velocity  (knots)  »  VAC 

70  INPUT  "Range  at  Closest  Approach  *  ?",  RMIN 

80  INPUT  "Altitude  (feet)  *  ?" ,  H 

90  VAC  »  VAC*88/60/ . 8684 

100  RMIN  »  RMIN*3 . 28 

110  RBASE  a  RMIN  -  VAC*T 

120  RSL  »  SQR(RMIN~2  +  H~2  +  RBASE~2 ) 

130  AZ  a  RBASE/RSL 

140  AZ  »  ATN ( AZ/SQR ( 1  -  AZ“2)) 

150  AZRATE  »  VAC*RMIN/RSL/RSL 

160  INPUT  II,  T,  AZNOISE,  ELNOISE,  RNGNOISB 

170  AZRATEN  a  AZRATE*1000  +  AZNOISE 

180  RNGN  -  RSL  +  RNGNOISE 

190  PRINT  T,  RSL,  RNGN,  -AZRATE*1000 ,  -AZRATEN 

200  PRINT  #4,  T,  RSL,  RNGN,  -AZRATE*1000,  -AZRATEN 

210  PRINT  12,  T,  -AZRATE* 1000 

220  PRINT  #3,  T,  -AZRATEN 

230  IF  T  >  24.5  GOTO  250 

240  GOTO  110 

250  END 


J 
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CHAPTER  V 

MEASUREMENTS  ALONG  GUN  SORESIGHT  VERSUS  LASER  BEAM  AXIS 

In  the  current  BSTING  system,  the  azimuth  and  elevation 
rates  are  measured  along  the  bores ight  axis  though  computation 
of  corrections  are  based  on  measurements  along  the  laser  beam 
axis.  To  compensate  for  this  difference,  measurements  are  made 
when  the  gun  is  locked  to  the  laser  beam  axis.  In  Chapter  II  a 
method  for  correcting  for  the  difference  in  measurement  axis 
vas  proposed.  This  would  allow  the  gun  to  be  unlocked  from  the 
laser  beam  vhile  the  measurements  are  made,  i  requirement  if 
a  continuous  aiming  correction  is  to  be  achieved. 

An  investigation  vas  made  to  determine  the  aiming  error 
that  results  if  the  gun  is  "unlocked*  from  the  laser  beam  and 
the  azimuth  and  elevation  rate  sensors  move  with  the  gun  when 
making  measurements.  Thus  these  sensors  are  not  only  measuring 
the  angular  rates  of  the  aircraft  with  respect  to  the  target, 
but  also  the  rate-of -change  of  the  azimuth  and  elevation 
corrections.  This  results  from  the  fact  that  new  azimuth  and 
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elevation  corrections  are  constantly  being  made.  Since  the 
range  and  range  rate  aeasureaents  are  aade  along  the  LOS,  i.e. 
the  laser  beaa,  they  are  not  affected.  The  object  here  was  to 
deteraine  the  aagnitude  of  the  error  in  aiaing  that  results 
because  aziauth  and  elevation  rates  are  aade  along  the  gun 
barrel  axis  and  not  the  LOS . 


The  technique  used  here  included  the  following  steps: 

a.  The  "theoretical"  aziauth  and  elevation  rates  were 
generated  assuning  that  the  laser  beaa  was  aiaed  at  the  target. 

b.  The  "theoretical"  aziauth  and  elevation  corrections  needed  to 
keep  the  beaa  on  target  when  the  gun  is  used  for  aiaing  was 
calculated . 

c.  Next  the  "unlocked"  gun  aziauth  rates  were  coaputed  using 
the  relationship  where  AZ  is  constantly  changing. 

corr 


AZ 

gun 


V  Cos ( AZ  +  AZ  ) 
a/c  laser  corr 


RSL 


(V-l) 


d.  For  the  elevation  rate  of  the  gun,  first  the  elevation  of 
the  laser  beaa  during  target  tracking  was  coaputed  at  each 
aeasureaent  interval. 


EL  3  Arcs  in ( Altitude/RSL )  (V-2) 

laser 

The  elevation  rate  vas  then  deterained  by  differencing 
elevation  angles  over  succeeding  aeasureaent  intervals,  i.e.: 


BLDOT  3  (EL  (t  +  dt)  -  EL  (t)l/dt  (V-3) 

laser  laser  laser 


.4 
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The  elevation  cocrection  based  on  the  LOS  rate  is  then 


BL 

corr 


RSL  x  ELDOT  /  7 

laser  muz 


(V-4) 


e.  The  gravity  drop  correction  was  then  added  to  this 
correction.  The  gravity  drop  correction  was  based  on  the 
algorithms  used  in  the  current  BSTING  system  and  is  detailed  in 
table  7-1. 


TABLE  7-1 

BSTING  GRA7ITY  DROP  CORRECTION 


fThere : 


2 

EL  »  16.85  x  (BETA  x  TOP)  /RSL 


2 

TOP  »  RSL/ ( 1  -  C*RSL/4 )  /7P 

TP  a  Range  Rate  +  (7  x  Cos (EL  )Cos(AZ  )] 

muz  corr  corr 

C  »  1.5  x  density/WCDA 

2 

WCDA  »  5E-6  RSL  -  0.036938  RSL  +  700.4299 

2 

BETA  »  3E-9  RSL  -  4.706E-5  RSL  +  1.0216 
RHO  a  0.076479 


f.  The  total  elevation  of  the  gun  becomes 

EL  »  EL  ♦  (EL  +  EL  )  (7-5) 

gun  laser  corr  gd 
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g.  The  gun  elevation  rate  can  be  determined  again  by 
differencing : 


ELDOT  =  (EL  (t  +  dt)  -  EL  (t)]/dt  (V-6) 

gun  gun  gun 


h.  Noise  was  added  to  the  range  and  azimuth/elevation 
measurements  to  simulate  the  gunner's  ability  to  keep  the  laser 
beam  on  the  target.  Again  this  vas  based  on  data  obtained 
during  the  BSTING  flight  test  program. 

i.  Since  the  range  data  is  measured  only  along  the  laser  beam 
axis,  the  sane  noisy  range  data  vas  used  for  both  cases. 


The  five  scenarios  used  in  this  investigation  are 


summarized 

in  table  V-2. 

TABLE  V-2 

SCENARIOS 

:ase 

ALTITUDE 

(feet) 

VELOCITY 
(knots ) 

MINIMUM  RANGE 
(meters ) 

i 

250 

150 

1000 

2 

250 

175 

2000 

3 

250 

100 

1500 

4 

250 

75 

500 

5 

250 

200 

1000 
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The  same  noise  vas  used  in  the  analysis  o€  each  scenario. 
Again  this  vas  based  on  noise  extracted  £rom  data  obtained 
during  the  BSTXNG  flight  test  program  and  represents  the 
gunner's  ability  to  keep  the  laser  spot  aimed  on  the  target. 
Figure  7-1  shows  typical  noisy  azimuth  and  elevation 
rate  measurements  as  vere  used  in  the  subsequent  analysis. 

These  examples  are  for  measurement  along  laser  beam  axis.  Noise 


Figure  7-1 


Representive  Azimuth  and  Elevation  Rate  Measurement 
(Case  1  Shown  -  Laser  Beam  Axis) 
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The  investigation  shows  a  significant  difference  in  the 
tine  it  takes  to  reach  the  point  where  the  algorithms  provide 
an  aiming  solution  within  the  +/-  4  milliradian  criteria.  3ee 
Table  V-3.  This  parameter  is  important  for  a  "snap  and  shoot" 
fire  control  system.  In  each  case  the  results  are  based  on 
azimuth/elevation  rates  measured  along  the  gun  boresight  axis. 


TABLE  V-3 
SETTLING  TIMES 


SCENARIO 

TIME  TO  REACH  +/-  4 
AZIMUTH 

(seconds) 

ar  CRITERIA 
ELEVATION 

1 

2 

1.5 

2 

13 

21 

3 

15 

2 

4 

2 

1 

5 

2 

2 

The  results  of  using  rate  sensors  on  the  laser  axis 
versus  on  the  gunsight  axis  are  shown  in  figures  V-2  to  V-6  for 
the  azimuth  correction  and  figures  V-7  to  V-ll  for  the 
elevation  correction.  In  most  cases  the  gunsight  based 
corrections  are  not  that  different  from  laser  based 
corrections.  In  some  cases,  they  were  actually  better. 


J 


J 
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Table  V-4  shows  the  maximum  aiming  errors  (After  the  +/-  4 
criteria  was  reached)  for  laser  and  boresight  based  sensor 
measurements . 


TABLE  V-4 

MAXIMUM  ERROR  IN  AIMING  CORRECTIONS 


CASE 

MAXIMUM  ERROR 

(milliradians ) 

MEASUREMENT  AXIS 

LASER 

BEAM 

GUN  BORESIGHT 

AZIMUTH 

ELEVATION 

AZIMUTH 

ELEVATION 

1 

3 

4 

3 

4 

2 

4 

3 

5 

7 

3 

3 

2 

8 

3.5 

4 

1.5 

3 

1 

3 

5 

4.5 

4 

3 

3 

Again  it  appears  that  rate  measurements  made  along  the 
boresight  do  not  result  in  aiming  errors  much  different  those 
that  would  be  made  along  the  laser  beam  axis. 
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Figure  V-4 

Altitude  ■  250  ft.  Velocity  *  100  kts.  Minimum  Range  =  1500  mtr 
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Figure  V-6 


Altitude  -  250  ft.  Velocity  =  200  kta,  Minina.  Range  *  1000  atr 
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Figure  V-ll 

Altitude  »  250  ft.  Velocity  =  200  kts,  Minimum  Range  *  1000  mtr 
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COMPUTER  PROGRAM:  LASERPR . BAS 

PURPOSE:  Computes  the  Theoretical  Range  and  Aziauth/Elevation 
Rates  along  the  Laser  Beam  Axis. 

RESULTS:  Used  as  Input  LASNOPR.BAS 

010  OPEN  *0%  «lf  '9:LASAZC01.out* 

20  OPEN  *0',  12,  *B : LASAZBT 1 . OUT* 

30  OPEN  '0%  »3,  ’BsUSELOTl.OUT* 

40  OPEN  *0%  14,  '8:LASDATA1.0UT' 

50  OPEN  '0*,  IS,  'SiLASELCOl.OUT' 

GO  INPUT  'Altitude  (feet)  »  ?',  H 

70  INPUT  'Aircraft  Velocity  (knots)  *?*,  VAC 

30  VAC  -  VAC*88/60/ . 3634 

90  INPUT  'Rami*  at  Closest  Approach  (asters)  *  ?*,  WIN 

100  RNIN  *  RHUM.28 

110  XBASE  »  XNIN  -  VACM 

120  RSI  a  SW(H*2  ♦  RHIN^  ♦  RBASEA2) 

130  ROOT  >  -RBASE»VAC/RSl 
140  AIOOT  >  HWIIHVAC/RSURSL 
ISO  AZ  a  AUMRBASE/RNIN)  *180/3. 1416 
160  BUTTON  >  SflR(RNIN*2  *  RBAS£*2) 

(70  &  <  ATN(H/90TT(Ni)*l80/3. 1417 
180  ELOOT  >  <a  -  ELI )  *3. 141 6/ 180/.  22 
190  £11  *  a 

200  ELCORR  >  RSUELMT/2B40 

210  ELCORR  *  -ATN(ELC0RR/S0R(1  -  aCORJTC)) 

220  ELCORR  >  aC0RR*1000 

230  AZCORR  a  RSl*AZDOT/2840/COS<aC08R/1000) 

240  AZCORR  a  -ATN(AZC0RR/S0R(1  -  AZC0RR*2)) 

230  AZCORR  a  AZCORRHOOO 

260  VP  »  ROOT  ♦  2840*CflS (ELCORR/ 1000) <C0S( AZCORR/ 1000) 

270  UCOA  a  , OOOOOS*RSLA2  -  .036938*RSL  *  700.4299 

280  BETA  a  3E-09*RSLA2  -  4. 706E-05*RSl  ♦  1.0216 

290  RHO  a  .076479 

300  C  a  l.5*8HO/HCOA 

310  TOT  »  RSl/VP/U  -  C»RSt/4)A2 

320  ELGB  >  16. 083*(  BETA* TOD  *2/RSL 

330  a60  >  a60*1000 

333  aCORR  a  aCORR  ♦  ELGB 

340  PRINT  II,  T,  AZCORR 

330  PRINT  12,  T,  AZD0TM000 

360  PRINT  13.  T,  a00T»1000 

370  PRINT  T,  RSL,  AZ,  EL.  AZMTM000,  ELMTMOOO,  AZCORR,  aCORR,  ROOT,  a60 
380  PRINT  14,  T,  RSL.  AZ,  EL,  AZMTMOOO,  ELMTMOOO,  AZCORR,  aCORR,  RMT,  a6D 
390  PRINT  IS.  T,  aCORR 
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400  IF  H  <  -42.6  THEM  SOTO  430 
410  T  =  T  ♦  .22 
420  SOTO  MO 
430  END 


COMPUTER  PROGRAM:  GUNPR . BAS 

PURPOSE:  Computes  the  Theoretical  Range  and  Azimuth/Elevation 
Rates  along  the  Gun  Bores ight 

RESULTS:  Used  as  Input  to  GUNNOPR . BAS 


10  OPEN  '0*,  II,  aB:6UNAZl.out* 

20  OPEN  a0',  12,  •BjfiUHAIOTl.OUT* 

30  OPEN  '0%  *3,  aB:6UNELDTl.0UT* 

40  OPEN  *0',  14,  aB:6UN8ATA1.0UT* 

30  OPEN  a0',  13,  *B:6UNEL1.0UT* 

80  INPUT  "Altitude  (feet)  *  V,  H 

70  INPUT  'Aircraft  Velocity  (knots)  *?*,  VAC 

80  VAC  *  VAC *88/80/. 8884 

90  INPUT  'Ranqe  at  Closest  Approach  (eeters)  *  ?•  RUIN 

100  RNIN  >  RHUH3.28 

110  RIASE  >  RNIN  -  VAC»T 

120  RSI  *  SlUKH*;  ♦  RHINA2  ♦  RBASE*2) 

130  ROOT  »  -RBASE*VAC/RSL 

140  BOTTOM  »  SBR(RHIN*2  ♦  R8ASE*2) 

130  a  *  ATNCH/80TT0H) 

iso  aooT  a  (a  -  an/.22 
170  ai » a 

180  aCORR  *  RSl*aD0OT/284O 

190  aCORR  *  -ATN(ELC0RR/SBR(1  -  EIC0RR*2)) 

200  VP  *  ROOT  ♦  284OK0S(aC0RR)iC0S(AZCQRR) 

210  UCDA  »  .000003«RSt.A2  -  .036938*RSL  ♦  700.4299 

220  BETA  *  3E-09*RSLA2  -  4.706E-05HWL  ♦  1.0216 

230  RHO  *  .076479 

240  C  a  l.StRHO/HCDA 

230  TOF  a  Ra/VP/(l  -  C»Ra/4)A2 

260  a60  *  16.083»(BETA*TOf)A2/RSL 

270  acoRRT  a  ass  ♦  aCORR 

280  a6UN  a  a  »  aCORRT 

290  aOGUN  *  EISUA  -  a«JNl)/.22 

300  aeuNi  a  clsun 

310  A l  a  ATNCRBASE/RNIN) 
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320  AZOOT  *  -«MN*VAC/RSL/RSl 

330  AZCOtt  »  RSL*AZDQT/2840/C0S(aCfltt) 

340  AZCOtt  »  -ATN(AZCOtt/Stt(l  -  AZCOtt*?)) 

330  AZOGUN  >  -VAC»C0S(AZ*PI/180  -  AZCOtt) /RSI 

90  AZGUN  *  AZ  *  AZCOtt 

370  POINT  01,  T,  AZ6UNMB0/3. 1416 

380  POINT  12,  T,  AZDGUNMOOO 

350  POINT  13,  T,  ELDOBNtOOO 

400  POINT  f,  a,  ELCOttT,  aSUN 

410  POINT  M,  T,  AZ6UNtl80/3.1417,  ELGUN«180/3. 1416r  AZD6UH1000,  ELDGUK1000 

420  POINT  IS,  T,  €LGUM»l80/3. 14IG 

430  AZI  >  AZM80/3.1416 

440  IF  AZI  <  -42.fi  THEN  GOTO  470 

430  T  *  T  +  .22 

4(0  GOTO  110 

470  00 


COMPUTER  PROGRAM:  LASNOPR . BAS 

PURPOSE:  Adds  Noise  to  the  Theoretical  Range  and  Azimuth/ 
Elevation  Measurements  along  Laser  Beam  Axis  in 
LASERPR . BAS 

RESULTS:  Used  as  Input  to  FILTLAS.BAS 

10  OPEN  M\  #1,  *8:  NOISE.  MT* 

20  OPEN  •!•,  12,  *9: LAS8ATA1 . MIT" 

30  OPEN  *0',  13,  *8:N0ISYAZ1.LAS* 

40  OPEN  *0\  14,  *8:N0ISYEL1.LAS* 

30  OPEN  '0*,  #3,  *8:N0ISYR61.LAS" 

SO  INPUT  II,  T,  AZN0ISE,  ELN0ISE,  RN8N01SE 
70  INPUT  12,  T,  RSI,  AZ,  a,  AZOOT,  ELDOT,  AZCOtt,  aCOtt,  ROOT,  aGD 
80  NOISERSL  *  RSI  ♦  RMNOISE 
50  N01SEAZ  >  AZOOT  ♦  AZNOISE 

too  Missa  *  eloot  ♦  elnoise 

110  PRINT  13,  T,  NQISEAZ 

120  PRINT  14,  T,  NOISEa 

130  PRINT  W,  T,  NOISERSL 

140  PRINT  T,  NQISEAZ,  NOISEa,  NOISERSL 

130  GOTO  60 
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COMPUTER  PROGRAM:  GUNNOPR.BAS 


0  PURPOSE:  Adds  Noise  to  the  Theoretical  Range  and  Azimuth/ 

Elevation  Measurements  along  Bores ight  Axis  Determined 
in  GUNPR.BAS 

RESULTS:  Used  as  Input  to  FILTGUN.BAS 


10  OPEN  '!*,  »1,  ‘8:  WISE.  DAT* 

20  OPEN  aI\  12,  a8j6UN0ATAl.0UTa 

30  OPEN  a0a,  13,  aB:WISYAZ1.6UNa 

40  OPEN  *0*,  14,  atjWISYELt.6Ur 

60  INPUT  (1,  T,  AZN0ISE,  ELN0ISE,  RN6N0ISE 

70  INPUT  12,  T,  AZGUM,  ELfiUN,  A200T,  ELOOT 

80  WISEAZ  >  AZDOT  *  AZ NOISE 

100  N0ISEEL  *  ELOOT  ♦  ELN0ISE 

110  POINT  13,  T,  WISEAZ 

120  PRINT  14,  T,  WISEEL 

140  PRINT  T,  NOISEAZ,  N0ISEEL,  WISERSL 

130  OOTO  60 
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190  RHAT  »  SENRNG 
200  AZDHAT  *  GXHAT/ 1000 
210  aOHAT  *  GYHAT/tOOO 
220  60SU  870 

230  INPUT  II,  TINE,  GXHAT,  8YHAT,  SENRNG 

240  8(2)  »  SENRNG 

230  SHAT  =  .5*(R<2)  ♦  SHAT) 

260  RHATO  »  .5»(SENRN6  -  RHAT) /DTK 
270  AZOHAT  *  (AZOHAT  *  SXHAT/1000)/2 
280  aOHAT  >  (ELOHAT  +  GYHAT/ 10001/2 
290  GOSUI  870 

300  INPUT  II,  TINE,  6XHAT,  GYHAT,  SENRNG 

310  8(3)  *  SENRNG 

320  RHAT  *  <2*RHAT  ♦  SENRNG)/3 

330  RHATO  a  .5t (SENRNG  -  R(2))/0TR 

340  AZOHAT  *  (2* AZDHAT  ♦  6XHAT/1000)/3 

330  aOHAT  «  (2KL0HAT  >  GYHAT/ 1 000) /3 

360  60SUI  870 

370  INPUT  II,  TINE,  GIHAT,  GYHAT,  SENRNG 

380  R(4)  *  SENRNG 

390  RHAT  *  (3IRHAT  SENRNG)/ 4 

400  RHATO  *  .5* (SENRNG  -  R(3))/DTR 

410  AZOHAT  >  (3IAZ0HAT  ►  GXHAT/10001/4 

420  ELOHAT  =  (3HL0HAT  +  GYHAT/10001/4 

430  60SUI  870 

440  INPUT  II,  TINE,  6IHAT,  GYHAT,  SENRNG 

430  S(3)  >  SENRNG 

460  RHAT  >  (4#RHAT  ♦  SENRNG)/ 3 

470  RHATO  3  ,3t( SENRNG  *  R(4))/DTR 

480  AZDHAT  >  (4# AZDHAT  ♦  GXHAT/1000J/3 

490  aOHAT  3  (4#€10HAT  o  GYHAT/ 1000J/S 

300  60SU0  870 

310  INPUT  II,  TIRE,  GXHAT,  GYHAT,  SENRNG 

520  R(6)  3  SENRNG 

S30  RHAT  3  (StRHAT  ♦  SENRNG) /S 

540  RHATO  3  ,5»( SENRNG  -  R(5))/0TR 

550  AZDHAT  3  (SMZOHAT  ♦  GXHAT/10001/6 

560  ELOHAT  *  (3<€L0HAT  ♦  GYHAT/1000W6 

570  60SU8  870 

580  INPUT  11,  TINE,  GXHAT,  6YHAT  SENRNG 

590  R(7)  3  SENRNG 

600  RHAT  3  (G«SHAT  *  SENRNG) /7 

610  RHATO  3  .5» (SENRNG  -  R(6))/DTR 

620  AZDHAT  3  (6*AZ0HAT  ♦  GXHAT/1000)/7 

630  aOHAT  3  (G*aOHAT  ♦  GYHAT/1000)/7 

640  GOSUI  870 

630  INPUT  II,  TINE,  GXHAT,  GYHAT,  SENRNG 

660  R(8)  3  SENRNG 

670  RHAT  3  (7IRHAT  »  SENRNG) /8 

680  RHATO  3  .5*(SENRN6  -  R(7))/DTR 

690  AZOHAT  3  (7IAZDHAT  *•  GXHAT/1000)/8 

700  aDHAT  *  (7KL0HAT  «•  SYHAT/1000WB 


7 10  HSUS  970 

720  INPUT  II,  TINE,  8XHAT,  SYHAT,  SQCM 
730  9(9)  *  SENS* 

740  SHAT  *  (8*RHAT  *  SENRNG)/9 
730  RHATO  *  .3KSQNW6  -  9(8))/0T9 
760  AZOHAT  *  (8MZ9HAT  ♦  6XHAT/1000)/9 
770  aOHAT  *  (8*€L0HAT  GYHAT/10001/9 

780  SOSUB  870 

790  INPUT  It,  TINE,  SXHAT,  SYHAT,  SENRN8 

800  RHAT  *  RHATfCl  ♦  RHATDK2  +SENRM8MPR 

810  TEKH3  *  (EUHAT*2  *  AZDHATA2)  »0T9 

820  RHATD  »  RHAT# (TERM  -  KVR)  ♦  RHATDK3  +SENRM8HVR 

830  60SUI  1030 

840  60SUB  870 

830  SOTO  790 

860  EMB 

870  ELC08R  *  RHATiELDHAT/2840 

880  aeon  >  <ATN(acon/SK(i  -  acon*2)) 

890  AZCOtt  >  RHATtAZDHAT/2840 

900  AZCOn  *  -ATN(AZCOn/a  -  AZC0RR*2)) 

920  VP  >  RHATD  t  2840K0S(ELC0n)«C0S(AZCfln) 

930  UCSA  *  .OOOOOSfRHAT*?  -  .036938KHAT  *  700.429 

940  BETA  *  3£-09«HATA2  -  4.706E-05«HAT  ♦  1.0216 

930  RHO  »  .076479 

960  C  *  1 . 3 tRHO/HCO A 

970  TOT  *  RHAT/VP/C1  -  C»RHAT/4)*2 

980  aSO  •  16.083*<IETA*rOf)A2/RHAT 

981  aeon  *  aeon  *  an 

990  PRINT  TINE,  AZOHATHOOO,  ELOHATltOOO,  AZCORfMOOO,  ELCOtttlOOO 

1000  PRINT  12,  TINE,  AZDHATHOOO 

1010  PRINT  13,  TINE,  ELDHAT*1000 

1020  PRINT  (4,  TINE,  AZCOn*lOOO 

1030  PRINT  IS,  TINE,  aCOnttOOO 

1040  RETUU 

1030  ELDHAT  *  TERNHU!  -  2! *RHATD*DTR/RHAT)*€LDHAT  ♦  KPE#8YHAT/I000 
1060  AZOHAT  >  TERR2X1!  -  2 !  tRHATDHTR/RHAT)  tAZDHAT  ♦  KPAtSXHAT/1000 
1070  RETIRN 
1080  END 
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COMPUTER  PROGRAM:  FILTLAS.BAS 


PURPOSE:  Computes  the  Azimuth  and  Elevation  Aiming  Corrections 
£rom  Noisy  Sensor  Data  Measured  along  Laser  Beam  Axis. 
Used  Data  from  LASNOPR.BA 

RESULTS:  Figures  o£  Azimuth/Elevation  Rates  and 

Azimuth/Elevation  Aiming  Corrections  -  Laser  Axis 


10  OP0I  •!•,  It,  'BiNOISEl.LAS* 

20  oral  *0*,  12,  'BsAZDOTl.LAS* 

30  oral  'O',  13,  ,8t£LOOTl.LAS* 

40  OPEH  *0',  14,  *B:AZC0RR1H.LAS* 

30  OPEN  '0*,  13,  'BiELCORRIN.LAS* 

70  KPA  »  .1 
80  KPE  »  .  1 
80  ICW  »  .1 

too  m  a  .1 

110  Cl  a  .3 

120  TERM  *  .8 

130  TERM  a  .3 

140  OTR  *  .22 

130  C2  a  DTR*C1 

160  C3  a  1!  -  KVRfOTR 

170  INPUT  II,  TIME*  6IHAT,  6YHAT,  SEHRM 

180  id)  a  s mm 

180  RHAT  a  SENRN8 

200  AZOHAT  *  SZHAT/1000 

210  ELOHAT  >  8VHAT/1000 

220  60SUI  870 

230  INPUT  II,  THE,  6IHAT,  5YHAT,  SENRM6 

240  R(2)  a  SEHRM 

230  RHAT  a  ,3*(R(2)  ♦  RHAT) 

260  RHAT0  *  .3KSENRN8  -  RHAT) /OTR 
270  AZDHAT  a  (A2DMAT  ♦  8ZHAT/1000)/2 
280  ELOHAT  a  (ELOHAT  ♦  6YHAT/l000)/2 
280  S0SU1  870 

300  INPUT  II,  THE,  6IHAT,  SVKAT,  SEHRN8 

310  RC3)  a  SEHRM 

320  RHAT  a  (2IRHAT  f  SEHRM)/3 

330  RHAT0  »  .3* (SEHRM  -  R(2))/DTR 

340  AZOHAT  *  (2MZDHAT  ♦  6IHAT/1000)/3 

330  ELOHAT  «  (2*EL0HAT  ♦  8YHAT/10001/3 

360  80SU8  870 

370  INPUT  II,  TIME,  SZHAT,  SYHAT,  SEHRM 
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380  8(4)  *  SENDS 

330  RHAT  =  (3»RHAI  *  SENDS) /4 

400  RHATD  *  .5KSCT8H8  -  R(3))/0TR 

410  AZDHAT  *  (3M20HAT  *  6ZHAT/ 1000)/4 

420  aOMAT  *  (3>aOHAT  ♦  8YHAT/1000)/4 

430  60SU8  870 

440  INPUT  It,  TINE,  SXHAT,  SYHAT,  SENDS 

430  8(3)  >  SENDS 

480  RHAT  *  (4«HAT  ►  SENDS) /5 

470  RMATD  >  .SttSENDS  -  8(4)) /DTK 

480  AZDHAT  »  (4UZDHAT  *•  8XHAT/lOOO)/3 

490  ELOHAT  *  (4«€U)HAT  ♦  6YHAT/1000J/3 

300  6GSU8  870 

310  INPUT  II,  TINE,  SXHAT,  6YHAT,  SENDS 

S20  8(6)  3  SENRIK 

530  RHAT  3  (SiRHAT  *  SENDS) /6 

340  8HATD  >  .3KSEN8N8  -  8(3))/DT8 

530  AZDHAT  3  (St AZDHAT  ♦  6XHAT/1000)/6 

560  aDHAT  *  (5tEL0HAT  ♦  6YHAT/1000J/6 

370  SOSUB  870 

580  INPUT  II,  TINE,  SXHAT,  SYHAT,  SEN8N8 

330  8(7)  3  SENDS 

600  SHAT  3  (6tffHAT  *  SENDSJ/7 

610  RHATD  3  .3* (SENDS  -  8(6))/DTR 

620  AZDHAT  *  (6*AZDHAT  *  6XHAT/1000J/7 

630  EIDHAT  *  (6t€LDHAT  ♦  SYHAT/1000)/7 

640  60SU8  870 

630  INPUT  11,  TINE,  SXHAT,  6YHAT,  SENDS 

660  8(8)  >  SENDS 

670  RHAT  3  (7IRHAT  ♦  SENDS) /8 

680  8HAT1  a  .SHSENDS  -  8(7))/DTR 

690  AZDHAT  «  (7»AZDHAT  +  SXHAT/ 1000) /0 

700  aDHAT  *  (7KLDHAT  ♦  8YHAT/1000)/8 

710  60SUB  870 

720  INPUT  It,  TINE,  6XHAT,  GYHAT,  SENDS 

730  8(9)  a  SENDS 

740  RHAT  3  (StRHAT  ♦  SENDS)/9 

730  RHATD  =  .5KSENRN8  -  8(8))/0T8 

760  AZOHAT  >  (8MZDHAT  ♦  SXHAT/1000)/9 

770  ELOHAT  a  (8#aOHAT  ♦  6YHAT/1000)/9 

780  SOSUt  870 

790  INPUT  II,  TINE,  6XHAT,  SYHAT,  SENDS 
800  RHAT  3  RHATfCl  *  RHATD*C2  *SEN8NG*KPR 
810  TE8N3  *  ( ELOHAT A2  ♦  AZDHAT A2) »0T8" 

820  RHATO  3  RHATHTE8N3  -  KVR)  ♦  RHATDt€3  fSENDSHVR 

830  SOSUB  1030 

840  SOSUB  870 

830  SOTO  790 

860  ENO 

870  aCORR  >  RHATtELDHAT/2840 

880  EIC0R8  *  -ATN(aC0M/S88(l  -  aCODTC)) 

890  AZC0R8  3  RHAT* AZDHAT /2840 

TOO  4ZC0R8  3  -ATN(AZC088/(1  -  AZC0DT2)) 


43-73 


< 


920  VP  *  MATO  *  284O*C0SiaC0RR)«C0S(AZC0RR) 

930  NCSA  >  .000003*RHAT*2  •  ,03(938»RHAT  *  700.429 

940  BETA  >  3£HW*RHATA2  -  4.708E-05WHAT  *  1.0216 

930  RHO  *  .07(479 

9(0  C  *  l.SHtHO/UCOA 

970  TOT  *  RHAT/VP/U  -  C*RHAT/4)A2 

980  a6S  *  1(.Q83*<B£TA*T0F)*2/RHAT 

981  aeon  >  aeon  «•  aso 

990  PRIST  TIRE,  AZ0HAT«1000,  ELSHATftOOO,  AZCfflUMtOOO,  aCORRUOOO 

1000  PRINT  12,  TIRE,  AZDHATMOOO 

1010  PRINT  13,  TIRE,  ELSHATMOOO 

1020  PRINT  14,  TIRE,  AZCORRMOOO 

1030  PRINT  13,  TIRE,  aCORRMOOO 

1040  RETURN 

1030  EUHAT  «  TERRI  t(l!  -  2 !  *RHATD»STR/RHAT) tELDHAT  *  KPE«6YHAT/1000 
10(0  AZONAT  *  TERR2t(l!  -  2!  ♦RHATSMTR/RHAT)  MZDHAT  ♦  KPA«IHAT/1000 
1070  RETURN 
1080  END 


COMPUTER  PROGRAM:  FILTGUN.BAS 

PURPOSE:  Computes  the  Azimuth  and  Elevation  Aiming  Corrections 
from  Noisy  Sensor  Data  Measured  along  Gun  Bores ight 
Used  Data  from  GUNNOPR . BAS 

RESULTS:  Figures  of  Azimuth/Elevation  Rates  and 

Azimuth/Elevation  Aiming  Corrections  -  Boresight  Axis 


OlO  OPEN  "I",  #1 
20  OPEN  "O",  82, 
30  OPEN  "0",  #3, 
40  OPEN  "0",  14 , 
50  OPEN  "0",  #5, 
70  KPA  *  .03 
80  KPE  *  .03 
90  KPR  *  .1 
100  RVR  a  .1 
110  Cl  «  .9 
120  TERM1  »  .97 
130  TERM2  »  .97 
140  DTR  a  .22 


"B:N0ISE1.GUN" 
"B: AZDOTl .GUN" 
"B:ELDOTl.GuN" 

"B : AZCORR1N . GUN" 
"B : ELC0RR1N . GUN" 


150  C2  »  0TR*C1 

160  C3  »  1!  -  KVR*DTR 

170  INPUT  HI,  TIME,  GXHAT,  GYHAT, 

180  R ( 1 )  a  SENRNG 


SENRNG 
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CHAPTER  VI 

GUM-BASED  VERSUS  LASER-BASED  MEASUREMENTS 
Investigation  of  Initial  Conditions 

In  the  previous  chapters,  it  was  found  that  aiming 
accuracy  vas  basically  no  worse,  or  no  better  either,  vith 
measurements  made  along  gun  boresight  than  when  they  were  made 
along  the  laser  axis.  The  gun  was  unlocked  from  the  laser  bearn^ 
for  the  entire  time  the  gunner  was  aiming  at  the  target  during 
this  analysis. 

Next  investigated  vas  the  effect  of  "locking"  the  gun  to 
the  laser  beam  during  the  first  two  seconds  while  the  initial 
solution  is  computed.  This  follows  the  technique  used  in  the 
current  BSTING  system.  After  the  initial  computation  vas 
determined  then  the  gun  vas  "unlocked"  so  measurements  could  be 
used  to  provide  a  continuously  updated  solution  using  these 
computed  aiming  corrections  as  initial  conditions. 


I 
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The  results  are  shovn  in  figure  7-1  through  7-5.  In  these 
cases,  the  sane  five  scenarios  were  used  as  reported  in  the 
previous  chapter.  There  was  no  significant  change  in 
the  aiaing  errors  and  in  aost  cases  the  system  could  meet  the 
♦/-  4  ailliradlans  criteria. 
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Figure  71-1 

Altitude  ■  250  ft.  Velocity  ■  150  kts,  Miniaua  Range  ■  1000  atr 


3-' 


Altitude 


Figure  71-2 

250  ft.  Velocity  -  175  kts.  Minimum  Range  =  2000  mtr 


8 


milliradions  milliradlan* 


O  10  20  30  40  50 


TIME  -  sKandft 


Figure  VI-3 

Altitude  3  250  ft.  Velocity  a  100  kta.  Minimum  Range  » 


1500  utr 


Figure  71-4 

Altitude  *  250  ft.  Velocity  ■  75  kts.  Minimum  Range  *  500  mtr 


) 
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millirodians  “d  milliradian* 


Figure  7-5 

Altitude  »  250  ft,  Velocity  =  200  kts,  Minimus  Range  3  1000  atr 
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CHAPTER  VII 


GUM  BORBSIGHT  VERSUS  LASER  BASED  MEASUREMENTS 
Theoretical  Considerations 

A  brief  analysis  was  conducted  to  determine  the  magnitude 
of  the  theoretical  error  that  results  when  the  elevation  and 
azimuth  rates  are-  measured  along  the  gun  bores ight  axis  versus 
the  laser  LOS  axis. 

AZIMUTH  CORRECTION 

Referring  to  the  following  figure: 

TARGET 


AZ 

corr 


If  azimuth  rate  is  measured  along  the  laser  beam  axis 


RSL 

AZ  3  -  - x  AZ  (VII -1 ) 

corr  V  Cos (EL  )  los 

los  muz  corr 


i 

1 


j 
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However,  as  in  the  BSTING  system,  when  the  measurements  are 
made  along  the  gunsight  axis 


RSL 

AZ  =»  . . — - x  AZ  ( VII-2 ) 

corr  7  Cos (EL  )  gun 

gun  muz  corr 


The  gravity  drop  correction  is  the  most  significant  part  of  the 
total  elevation  correction  and  much  larger  than  elevation  rate 
correction,  at  least  for  straight  and  level  flight.  Thus  it 
it  can  be  assumed  that  the  Cosine  of  the  elevation  correction 
using  laser-based  and  gun-based  measurements  are  essentially 
the  same.  Therefore, 


AZ 

corr 

error 


RSL 

(AZ  -  AZ  }  ( VII-3 ) 

V  Cos (EL  )  los  gun 
muz  corr 


Nov, 


so 


AZ  =  AZ  +  AZ 

los  gun  corr 


AZ  3  AZ  AZ 

corr  los  gun 
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Taking  the  differential  of  equation  VII-1 


AZ  - - (RSL  x  AZ  +  RSL  x  AZ  }  (VII -4} 

corz  V  Cos  (EL  )  los  los 

■uz  corr 


Where 


RSL 


2  2  2 
H  +  R  +  (R  -  V  t) 
ain  ain  a/c 


RSL  *  -  V  (R  -  V  t ) /RSL 

a/c  ain  a/c 


V  *  Platfora  Velocity;  R  »  Range  at  Closet  Approach 
a/c  ain 


AZ  -  V  R  /RSL 
los  a/c  ain 


AZ  »  -2  V  R  RSL/RSL 
los  a/c  ain 


ELEVATION  CORRECTION 

Referring  to  the  following  figure 


i 

i 
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Arcsin  {H/RSL} 


» 


EL 

los 

EL 

gd 

EL 

gun 


»  £ ( TOP ) 

*  EL  +  EL 

los  gd 


I£  measurement  of  elevation  is  made  along  laser  axis 


EL  »  -RSL  x  EL  /V 

corr  los  muz 

los 


However,  in  the  BSTING  system 


EL  =»  -RSL  x  EL  /V 

corr  gun  muz 

gun 


Thus  the  error  in  the  elevation  correction  becomes 


EL  =  -  RSL  (EL  -  EL  }/V 

corr  los  gun  muz 

error 


Substituting  equation  VII-5  into  VII-7 


EL  =  RSL  EL  /V 

corr  gd  muz 

error 
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(VII-5) 


(VII-6) 


(VII-6) 


(VII-7) 


( VII-8 ) 


j 


The  results  of  these  theoretical  approximations  are  shovn 
in  figures  VII-1  through  VII-2.  Again  the  scenarios  used  are 
those  summarized  in  Table  V-2.  The  magnitude  of  these  errors 
are  roughly  in  agreement  with  the  aiming  errors  resulting  from 
noisy  data  representing  actual  measurement  made  by  gunners. 


§ 

i 


w  AZIMUTH  e  ELEVATION 


Figure  Vll-i 

Altitude  *  250  ft.  Velocity  »  150  kts.  Minimum  Range  =  1000  mtr 
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Altitude 


25 


J 


miliirodions 


v  AZIMUTH  «  ELEVATION 


Figure  VII-5 

Altitude  ■  250  fit ,  Velocity  »  200  kts,  Minlaue  Range  *  1000  mtr 
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CC&PUTER  PROGRAM: 


ERRORPR . BAS 


PURPOSE:  Computes  the  Theoretical  Azimuth  and  Elevation  Aiming 
Correction  Error  when  Azimuth/Elevation  Rate 
Measurements  are  Made  along  Gun  Bores ight  Axis  versus 
Laser  Beam  Axis. 


RESULTS:  Figures  VII-1  to  VII-5 


10  OPEN  ‘O',  »lf  AZC0SRESR.CS4* 

20  oral  ’o*,  »2,  '8:aconE».cs4* 

30  P!  3  3.1416 
40  6  *  32.17 
43  V«JZ  *  2840 

30  INPUT  'Aircraft  Veiocifcy  (knots)  *  ?*,  VAC 

60  VAC  s  VAC *88/ 60/. 3684 

70  INPUT  'NiniMO  80090  (actors)  «  ?',  BIIN 

80  SHIN  *  RKINS3.28 

30  INPUT  'Altitude  (foot)  *  H 

100  RBASE  «  RHII  -  VAC»T 

110  SSL  »  SOROM  ♦  WHIM  ♦  RBASE*2) 

120  A I  »  ATN(RIASE/BIIN) 

130  AZ1  >  AZM80/PI 

140  a  >  ATN(H/SQR(RHIN*2  ♦  RBASE*2)) 

130  ai  *  a»l80/PI 

160  RSLOOT  *  -VACaRIASE/RSL 

170  AZD0T  *  VACHNKN/8SL/RSL 

180  AZD0T2  3  -ZWACiRNINMaUOT/RSLn 

130  O00T  *  -HtRSLOOT/RSL^ 

200  aeon  *  -RSLiaMT/2840 

210  VP  3  RSLOOT  »  284OiC0S(aC0ttT)«aB(AZC0n) 

220  UCOA  3  .0000050881*2  -  .036938O88L  0  700.4299 

230  BETA  »  3E-01*RSlA2  -  4.706€-03»88L  ♦  1.0216 

233  RH0  3  .076479 

240  C  *  l.StRHO/HCOA 

230  TOT  *  RSL/VP/(1  -  C»8SL/4)*2 

260  R60  *  8#(  BETA* TOP)  A2/RSL/2 

270  aGOOOT  *  (a6B  -  a6BI)/.2 

280  aGDl  s  as# 

290  aconr  *  aeon  ♦  a60 

300  AZCOnBOT  3  -RSLMT»AZD0T/C0S(EL£08RT)/VliJZ  -  RSLoAZDOT2/VWZ/COS(ELCOnT) 
310  AZC0R8ERR  *  -«L*AZC08ROOT/VNUZ/COS(aCDnT) 

320  aCORRERR  »  RSIK16000T/VWZ 

330  PRINT  rf  AZI,  ai,  AZC88RER8HOOO,  aCURRERRoiOOO 

340  PRINT  II,  T,  AZCORRERRoiOOO 

330  PRINT  12.  T.  aCORRERRtlOOO 

360  IF  AZI  <  -43  THEN  SOTO  400 

370  T  »  T  ♦  .2 

380  SOTO  100 

400  ENO 
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CHAPTER  VIII 

EFFECT  OF  APPROXIMATIONS  IN  GRAVITY  DROP  CORRECTION 

The  staple  BSTING  algorithms  do  not  use  the  elevation  angle 
in  gravity  correction.  Referring  to  this  figure,  the  gravity  drop 
correction  is: 


2 

h  *  gt  /2 
gd  f 


Where : 

t  is  the  tiae-of-f light 
f 


EL  *  Arctan  (h  Cos  O/RSL} 

corr  gd 

gd 
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A 


Tvo  approximations  are  made  in  the  BSTING  algorithms 

(1)  Tan  £L  3  EL 

corr  corr 

gd  gd 

(2)  Cos  0  3  1 

Figure  VIII-1  shovs  the  elevation  angle  versus  ground  range 
and  altitude  o £  the  gun  platform  above  the  target.  The 
resulting  error  introduced  by  these  tvo  approximation  . 
is  shown  in  figure  VIII-2.  For  altitudes  of  less  than  1000 
feet,  the  error  is  belov  1  milliradian,  and  in  most  cases 
significantly  less.  If  the  system  is  used  at  higher  altitudes, 
then  the  elevation  angle  should  be  accounted  for  in  the 
correction.  This  could  be  done  by  either  measuring  the 
elevation  angle  or  the  actual  altitude. 
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ELEVATION  ANGLE  -  degrees 


•  1500  ft.  x  2000  ft, 


figure  vm-i 


Elevation  Angle  as  a  Function  of  Altitude  and  Ground  Range 
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millirodions 


» 


GROUND  RANGE  -  mtUn 
a  250  ft  ▼  500  ft  *  lOOO  ft 

o  1500  ft  x  2000  ft 


Figure  VIII-2 

Error  in  Elevation  Aiming  Correction  Resulting  from 
Neglecting  the  Actual  Elevation  Angle 
(Also  the  Small  Angle  Approximation) 


COMPUTER  PROGRAM:  ELERRPR . BAS 

PURPOSE:  Computes  the  Elevation  Angles  as  a  Function  o£ 

Altitude  and  Ground  Range.  Also  Calculates  the  Error 
in  the  Azimuth  Aiming  Correction  Resulting  from 
Neglecting  the  Elevation  Angle  (Cos  0*1)  and  by 
Using  the  Small  Angle  Approximation 
(Tan  EL  *  EL  ) 

corr  corr 
gd  gd 


RESULTS:  Fig.  VIII-1  and  Fig.  VIII-2 
010  OPEN  "O",  #1/  "C: ELCORRER .100" 

20  OPEN  "O",  #2,  "C:EL.100" 

30  G  *  32.17 
40  PI  *  3.1416 
50  VMUZ  *  2840 

60  INPUT  "Minimum  Ground  Range  (meters)  »  7",  RMIN 
70  INPUT  "Altitude  (feet)  -  ?"/  H 
80  RBASE  -  RMIN 

80  RSL  »  SQR(fT2  ♦  (RBASR*3 . 28 ) ~2 ) 

100  EL  *  ATN ( H/RBASE/ 3.28) 

110  VP  -  VMUZ 

120  WCDA  *  . 000005*RSL~2  -  . 036938*RSL  +  700.4299 

130  BETA  *  3E-09*RSL‘*2  -  4 . 7Q6E-Q5*RSL  +  1.0216 

135  RHO  *  .076479 

140  C  *  1 . 5*RHO/WCDA 

150  TOP  *  RSL/VP/U  -  C*RSL/ 4 )  '*2 

160  ELGD  *  G*(BETA*TOF)~2/RSL/2 

170  ELGD1  *  ATN ( ELGD*COS ( EL ) ) 

180  DIFF  *  ELGD  -  ELGD1  _  *,on/OT 

190  PRINT  RBASE,  TOP,  ELGD*1Q 00,  ELGD1*1000,  DIFF*1000,  EL*180/PI 

200  PRINT  #1,  RBASE,  DIFF*1000 
205  PRINT  #2,  RBASE,  EL*180/PI 
210  IP  RBASE  >  3750  THEN  GOTO  240 
220  RBASE  *  RBASE  +  500 
230  GOTO  90 
240  END 


J 
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CHAPTER  IX 

FIRE  CONTROL  SYSTEM  USING  EXTERNAL  MEASUREMENTS 
Theoretical  Design 

An  alternate  design  of  a  fire  control  system  would  use 
external  measurement  of  parameters  in  lieu  of  (or  in  addition 
to)  sensors  mounted  on  the  gun  itself  as  is  the  case  with 
the  current  BSTING  system.  The  goal  is  a  more  responsive  fire 
control  system  that  can  provide  "point  and  shot"  capability. 

SYSTEM  CONCEPT 


This  initial  system  design  uses  the  following  sensors  for 
measuring  the  parameters  needed  to  make  the  required  azimuth 
and  elevation  aiming  corrections. 

a.  The  slant  range  would  be  measured  by  a  laser  rangefinder. 

b.  Angular  resolvers  mounted  on  the  gun  mount  would  sense  the 
azimuth  and  elevation  of  the  gun  with  respect  to  the 
platform.  As  in  the  BSTING  system,  angular  measurements  are 
made  along  the  gun  boresight  axis  rather  than  the  more 
desirable  laser  beam  axis. 

c.  An  inertial  platform  would  also  be  needed  to  reference 

the  platform  with  respect  to  inertial  space  in  the  vertical  and 
horizontal  directions. 
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d.  Platform  velocity  would  be  obtained  from  the  aircraft 
navigation  system  or  externally  from  the  Global  Positioning 
Satellite  (GPS)  system. 

Again  the  key  source  of  error  in  this  system,  as  with  the 
BSTING  system,  is  that  the  azimuth  and  elevation  data  has  to  be 
measured  with  respect  to  the  gun  boresight  rather  than  the 
laser  beam  axis.  An  analysis  was  conducted  to  determine  the 
magnitude  of  this  error. 

LOS  Range ♦  In  this  FCS  the  laser  range  finder  would  be 
measuring  the  slant  range  to  the  target.  For  straight  and  level 
flight,  the  range  measured  would  be  based  on  the 
relationship : 

2  2  2  2 
RSL  ■  H  +  RMIN  +  ( RMIN  -  VAC  x  t)  (IX-1) 

Where 

H  =  Altitude  above  target 

RMIN  =  Ground  range  at  point  of  closest  approach 
VAC  =  Aircraft  velocity 
t  =  Time 


43-98 


LOS  Elevation  and  Azimuth  Angles.  The  elevation  angle  is 
determined  from  (refer  to  figure  IX-la) 


EL  =  Arcs in  {H/RSL}  (IX-2) 

los 

and  the  azimuth  angles  is  (Refer  to  figure  IX-lb) 

A2  »  Arctan{  (RMXN  -  VAC  x  t)/RMIN}  (IX-3) 

los 

Incidently,  unlike  the  BSTZNG  system  vhere  the  azimuth  plane 
is  defined  along  the  gun  boresight  azimuth,  in  this  system  both 
the  elevation  and  azimuth  planes  are  defined  in  inertial  space. 

ELEVATION  PLANE  AZIMUTH  PLANE 


Pigure  IX-1 

Geometry  For  Theoretical  Analysis 
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Alaing 


L.  In  the  elevation  plane. 


the  correction  vas  determined  fron: 


El  »  0.5  G  x  Cos (EL  )  x  (  x  T  )  /RSL 

corr  los  £ 


(IX-4) 


Where : 


V  {1  -  c  x  RSL/4 ) 
P 

1.5  Denaity/WCDA 


WCOA  »  .000005  x  RSL  -  0.036938  x  RSL  +  700.4299 


-  3E-09  x  RSL  -  4.706E-05  x  RSL  +  1.0216 


Density  *  0.07479 


AslFU&ti  Aiaigq 


..  Referring  to  figure  IX-lb,  the 


required  corrections  in  the  azimuth  plane  are  determined  by: 


Sin(AZ  ) 
gun 


7  Sin(AZ  )Cos (EL  )  -  V 

P  las  las  a/c 

V  Cos  (EL  +  EL  ) 
muz  las  gd 


(IX-5) 


corr 


2  2  2 
7  *  7  +  2  7  7  3  in ( AZ  )Cos(EL  )  +  7 

P  muz  muz  a/c  las  las  a/c 


43-100 


j 


In  this  initial  investigation,  the  inertial  platform  vas 


not  included  as  it  vas  assumed  that  the  aircraft  vas  flying 
parallel  to  the  earth.  The  scenarios  shown  in  Table  IX-I  were 
used  to  test  this  system. 


TABLE  IX-I 
SCENARIOS 


ZASE 

ALTITUDE 
( feet ) 

VELOCITY 
( knots ) 

MINIMUM  RANGE 
(meters ) 

1 

250 

150 

1000 

2 

250 

175 

2000 

3 

250 

100 

1500 

4 

250 

75 

500 

5 

250 

200 

1000 

These  scenarios  are  the  same  ones  used  in  evaluating 
the  BSTING  system,  as  reported  in  the  previous  chapters. 

The  results  are  shown  in  figures  IX-3  through  IX-12. 

The  curves  marked  ’’Laser"  represent  the  desired  correction,  in 
other  words  the  correction  that  theoretically  provided 
the  greatest  accuracy.  The  curves  marked  "GUN"  represent  the 
actual  corrections  that  result  when  azimuth  and  elevation 
angles  are  measured  along  the  boresight  axis.  The  difference 
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between  these  two  values  represents  the  " UNCORRECTED "  error  in 
aiming  results  when  the  bores ight-based  azimuth  and  elevation 
angles  measured  were  used  to  compute  corrections.  Reviewing 
these  figures,  the  maximum  aiming  errors  are  shown  in 
Table  IX-II. 

TABLE  IX-II 

MAXIMUM  AIMING  ERRORS  (UNCORRECTED) 

SCENARIO  AZIMUTH  ELEVATION 

(milliradians ) 


1 

-3.5 

-0 . 2 

2 

-4.5 

-0.8 

3 

-1.4 

-0.15 

4 

-1.0 

0.0 

5 

+/-6.0 

-0.35 

It  can  be  seen  from  this  data  that  while  errors  in 
elevation  are  minimal,  they  are  significant  in  azimuth. 
Therefore,  a  scheme  is  required  to  correct  the  azimuth  aiming 
corrections.  As  an  initial  technique,  azimuth/elevation 
measurements  along  the  gun  boresight  axis  were  corrected  by  the 
value  of  the  previous  correction  before  they  were  used  to 
compute  the  next  correction,  i.e. 
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A2GUN  =  AZGUN  +  AZCORR 

k+1  k  k 

and 

ELGUM  »  ELGUN  +  ELCORR 

k  +  1  k  k 


Figure  IX-2  is  a  block  schematic  diagram  shoving  how  this 
technique  might  be  implemented. 


V  Air 


RAnQ-C 


The  results  of  implementing  this  correction  technique  is 
shown  in  figure  IX-3  through  IX-1Z  and  identified  as 
"CORRECTED''  corrections  and  aiming  error.  As  can  be  3een,  this 
technique  corrects  the  error  at  least  for  the  theoretical  case 
vithout  noisy  data.  Next,  the  technique  vill  be  investigated  in 
a  simulated  gunner  operated  system  vhere  the  measurements  are 
noisy. 
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Figure  IX-4 

250  ft;  Aircraft  Velocity  =  150  kts; 
Minimum  Range  =  1000  mtr 
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Figure  IX-6 

Altitude  =  250  ft;  Aircraft  Velocity  =  175  kts; 
Minimum  Range  =  2000  mtr 
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Figure  IX-7 


Altitude  =  250  ft;  Aircraft  Velocity  =  100  kts; 
Minimum  Range  =  1500  ratr 
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Altitude  =  250  ft;  Aircraft  Velocity  =  100  kts; 
Minimum  Range  =  1500  mtr 
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Figure  IX-9 

250  ft;  Aircraft  Velocity 
Minimum  Range  -  500  mtr 


=  75  kts; 
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Figure  IX-10 
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Figure  IX-11 

Altitude  =  250  ft;  Aircraft  Velocity  *  200  kts; 
Minimum  Range  -  1000  mtr 
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COMPUTER  PROGRAM:  LOSPR.BAS 

PURPOSE:  Computes  the  Line-of-Sight  Azimuth/Elevation  Angle 
and  Range  of  the  Target  with  Respect  to  Platform. 
These  Parameters  are  "Measured"  along  Laser  Axis. 

RESULTS:  Used  in  LASTHEPR . BAS  and  GUNDATPR . BAS 

10  OPEN  "0",  #1,  "B: LOSDATA.CS 5 " 

20  PI  =  3.1416 
30  DT  =  .2 
40  G  =  32.17 

50  INPUT  "Altitude  (feet)  =  ?",  H 

60  INPUT  "Aircraft  Velocity  (knots)  =  ?",  VAC 

70  VAC  =  VAC*88/60/ .8684 

80  INPUT  "Minimum  Ground  Range  to  Target  =  ?",  RMIN 

90  RMIN  *  RMIN*3 . 28 

100  RBASE  »  RMIN  -  VAC*T 

110  RSL  =  SQR ( H~2  +  RMIN~2  +  RBASE~2 ) 

120  A2LOS  a  ATN(RBASE/RMIN) 

130  ELLOS  »  ATN ( H/SQR ( RBASE~2  +  RMIN~2)) 

140  PRINT  #1,  T,  AZLOS /  ELLOS,  RSL 

142  PRINT  T,  AZLOS*180/PI,  ELLOS*180/PI ,  RSL 

150  IF  AZLOS*180/PI  <  -42  THEN  GOTO  180 

160  T  =  T  +  DT 

170  GOTO  100 

180  END 
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COMPUTER  PROGRAM:  LASTHEPR . BAS 


PURPOSE:  Computes  the  Azimuth  and  Elevation  Corrections  Based 

on  Measurements  Made  along  Laser  Beam  Axis. 

RESULTS:  Used  in  Figures  IX-2  through  IX-11 


010  OPEN  "I",  II,  "B: LOSDATA.CS 5" 

20  OPEN  "0",  #2,  "B : LASAZCOR . CS5 ” 

30  OPEN  "0",  13,  "B: LASELCOR.CS 5” 

40  OPEN  "0",  #4,  "B:0UTLAS.CS5" 

50  PI  =  3.1417 
60  G  =  32.17 
70  VMUZ  »  2840 

80  INPUT  "Aircraft  Velocity  =  ?",  VAC 

90  VAC  =  VAC*88/60/ .8684 

100  INPUT  II,  T,  AZLAS,  ELLAS,  RSL 

110  VP  =  SQR ( VMUZ ~ 2  +  2*VMUZ*VAC*SIN ( AZLAS ) *COS ( ELLAS )  +  VAC~2) 

120  WCDA  =*  .000005*RSL~2  -  .036938*RSL  +  700.4299 

130  BETA  =  3E-09*RSL~2  -  4 . 706E-05*RSL  +  1.0216 

140  RHO  *  .07479 

150  C  =  1 . 5*RHO/WCDA 

160  TOF  =  RSL/VP/ ( 1  -  C*RSL/4)~2 

170  ELGD  »  16.0 8 5*COS( ELLAS ) * ( BETA*TOF ) ~2/RSL 

180  ELCORR  =  ELGD 

190  AZGUN  =  VP*SIN ( AZLAS )*C0S( ELLAS)  -  VAC 
200  AZGUN  ■  AZGUN/VMUZ/COS ( ELGD  +  ELLAS) 

210  AZGUN  =  ATN (AZGUN/ SQR (1  -  AZGUN~2)) 

220  AZCORR  ■  AZLAS  -  AZGUN 

230  PRINT  T,  AZLAS*180/PI ,  AZCORR*1000,  ELCORR*1000,  TOF,  VP 

240  PRINT  12,  T,  AZCORR*1000 

250  PRINT  13,  T,  ELCORR*1000 

260  PRINT  14,  T,  AZCORR*1000,  ELCORR*1000 

280  GOTO  100 


43-115 


COMPUTER  PROGRAM:  GUMDATPR.BAS 


PURPOSE:  Computes  Azimuth  and  Elevation  Angles  along  the  Gun 
Boresight  Axis.  Uses  Equation  IX-4  and  IX-5. 


RESULTS:  Used  in  GUNTHEPR . BAS  and  GUNCORPR.BAS 


10  OPEN  "I",  #1,  "B: LOSDATA.CS 5" 

20  OPEN  "0",  *2,  "B : gundata . CSS" 

30  PI  =  3.1417 
40  G  -  32.17 
50  VMUZ  =  2840 

60  INPUT  "Aircraft  Velocity  =  ?",  VAC 
70  VAC  =  VAC* 8 8/60/ .8684 
80  INPUT  II,  T,  AZLAS,  ELLAS,  RSL 

90  VP  *  SQR(VMUZ~2  +  2*VMUZ*VAC*SIN( AZLAS )*COS( ELLAS)  +  VAC'‘2) 

100  WCDA  =  . 000005*RSL~2  -  .036938*RSL  +  700.4299 

110  BETA  *  3E-09*RSL"2  -  4 . 706E-05*RSL  +  1.0216 

120  RHO  a  .07479 

130  C  »  1.5*RHO/WCDA 

140  TOF  =  RSL/VP/ ( 1  -  C*RSL/4)~2 

150  ELGD  =  16 . 085*COS (ELLAS ) * ( BETA*TOF ) ~2/RSL 

160  ELCORR  =  ELGD 

170  AZGUN  =  VP*SIN( AZLAS )*COS( ELLAS)  -  VAC 
180  AZGUN  =  AZGUN/VMUZ/COS ( ELGD  ♦  ELLAS) 

190  AZGUN  =  ATN ( AZGUN/SQR ( 1  -  AZGUN~2)) 

200  ELGUN  =  ELLAS  +  ELGD 

210  PRINT  T,  AZGUN*180/PI ,  ELGUN*180/PI ,  RSL 
220  PRINT  12,  T,  AZGUN,  ELGUN,  RSL 
230  GOTO  80 
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COMPUTER  PROGRAM:  GUNTHEPR . BAS 


PURPOSE:  Computes  the  Azimuth  and  Elevation  Corrections  Based 

on  Azimuth  and  Elevation  Angle  Measurements  Made 
along  Gun  Boresight  Axis 

RESULTS:  Figures  IX-3  through  IX-12 


010  OPEN  "I",  #1,  "B: GUNDATA.CS 5" 

20  OPEN  "0%  #2,  M B : GUNAZTHE .  CS  5  " 

30  OPEN  "0%  13,  " B: GUNELTHE.CS 5 " 

40  OPEN  "0",  #4,  WB: OUTTHE.CS 5" 

50  PI  =  3.1417 
60  G  =  32.17 
70  VMUZ  =  2840 

80  INPUT  "Aircraft  Velocity  =  7",  VAC 

90  VAC  »  VAC* 8 8/60/ .8684 

100  INPUT  81,  T,  AZLAS,  ELLAS,  RSL 

110  VP  »  SQR(VMUZ~2  +  2*VMUZ*VAC*SIN( AZLAS ) *COS ( ELLAS )  ♦  VAC~2) 

120  WCDA  =  . 000005*RSL~2  -  .036938*RSL  +  700.4299 

130  BETA  »  3E-09*RSL~2  -  4 . 706E-05*RSL  +  1.0216 

140  RHO  a  .07479 

150  C  =  1 . 5*RHO/WCDA 

160  TOP  =  RSL/VP/ ( 1  -  C*RSL/4 ) ~2 

170  ELGD  »  16. 085*COS( ELLAS ) * ( BETA*TOF ) ~2/RSL 

180  ELCORR  a  ELGD 

190  AZGUN  a  VP*S I N ( AZLAS ) *COS ( ELLAS )  -  VAC 
200  AZGUN  »  AZGUN/VMUZ/COS(ELGD  +  ELLAS) 

210  AZGUN  »  ATN(AZGUN/SQR( 1  -  AZGUN~2)) 

220  AZCORR  =  AZLAS  -  AZGUN 

230  PRINT  T,  AZLAS* 180/PI ,  AZCORR*1000,  ELCORR*1000,  TOF,  VP 

240  PRINT  82,  T,  AZCORR*1000 

250  PRINT  83,  T,  ELCORR*1000 

260  PRINT  84,  T,  AZCORR*1000,  ELCORR*1000 

270  GOTO  100 
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COMPUTER  PROGRAM:  GUNCORPR.BAS 


PURPOSE:  Computes  the  Azlauth  and  Elevation  Corrections  Based 
on  Azlauth  and  Elevation  Angle  Measurements  Made 
along  Gun  Boreslght  Axis,  but  Corrects  these 
Measureaents  using  the  Previous  Correction  before 
Making  the  Computation. 


RESULTS:  Figures  IX- 3  through  IX-12 


010 

OPEN  "I", 

#1/ 

"B: GUNDATA.CS 5" 

20 

OPEN  "O", 

#2, 

"B: GUNAZCOR.CS 5" 

30 

OPEN  "O", 

#3, 

"B: GUNELCOR.CS 5" 

40 

OPEN  "O", 

M, 

"B: OUTCOR.CS 5" 

50  PI  *  3.1417 
60  G  *  32.17 
70  VMUZ  *  2840 

80  INPUT  "Aircraft  Velocity  *  ?",  VAC 
90  VAC  =  VAC* 8 8/60/ .8684 
100  INPUT  #1,  T,  AZLA3,  ELLAS ,  RSL 
110  AZLAS  »  AZLAS  +  AZCORR 
120  ELLAS  »  ELLAS  -  ELCORR 

130  VP  »  SQR ( VMUZ “2  *  2*VMUZ*VAC*SIN (AZLAS ) *C0S ( ELLAS )  +  VAC~2) 

140  WCDA  »  .  000005*RSL‘'2  -  .036938*RSL  ♦  700.4299 

150  BETA  »  3E-09*RSL“2  -  4 . 706E-05*RSL  +  1.0216 

160  RHO  «  .07479 

170  C  »  1 . 5*RHO/WCDA 

180  TOP  »  RSL/VP/U  -  C*RSL/4 )  '*2 

190  ELGD  »  16. 085*C03(  ELLAS)  *(BETA*T0F)'‘2/RSL 

200  ELCORR  a  ELGD 

210  AZGUN  =  VP*SIN( AZLAS ) *COS ( ELLAS )  -  VAC 
220  AZGUN  *  AZGUN/VMUZ/C0S ( ELGD  +  ELLAS) 

230  AZGUN  *  ATN ( AZGUN/SQR ( 1  -  AZGUN~2)) 

240  AZCORR  a  AZLAS  -  AZGUN 

250  PRINT  T,  AZLAS*180/PI ,  AZCORR*1000,  ELCORR*1000,  TOF,  VP 

260  PRINT  #2,  T,  AZCORR*1000 

270  PRINT  13,  T,  ELCORR*1000 

280  PRINT  #4,  T,  AZCORR*1000,  ELCORR*1000 

290  GOTO  100 
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COMPUTER  PROGRAM:  CORRECTP . BAS 

PURPOSE:  (a)  Computes  Nominal  Slant  Range  and  Azimuth/Elevation 

Angles  for  the  LOS;  (b)  Adds  Noise  to  Nominal  Slant 
Range  and  Azimuth/Elevation  Angles;  (c)  Computes 
Noisy  Gun  Boresight  Azimuth/Elevation  Angles;  (d) 
Filters  Range  and  Azimuth/Elevation  Measurements; 

(e)  Computes  Azimuth/Elevation  Aiming  Corrections; 

(f)  Computes  Error  in  the  Aiming  Correction. 

RESULTS:  Error  in  Azimuth  and  Elevation  Aiming  Corrections, 

as  veil  as  Individual  Contributions  to  these  Errors. 


10  OPEN  *!*,  I!,  'C: NOISEDAT.DAT' 

20  OPEN  ‘O',  12,  'BjAZCORM.COI* 

30  OPEN  ‘O’,  13,  "B: AZTOTDIE. COi “ 

40  OPEN  '0\  14,  •BsaTOTDirCOl' 

50  OPEN  *0*,  »5,  '8:ELC0RRDI.c01' 

60  RH0  *  .07473 
65  6  =  32.17 
70  Or  =  .017 
80  VNUZ  =  2840 
30  P!  ’  3.1416 
100  KPR  =  .5 
110  KVR  *  .5 
120  KPA  -  .05 
130  m  *  .05 
140  KPE  *  .05 
150  KVE  =  .05 

160  INPUT  'Aircraft  Altitude  (feet)  =  ?\  H 

170  INPUT  'Aircraft  Noemal  Velocity  (knots)  *  VEL 

180  VAC  =  VEL#88/60/.8684 

130  INPUT  'Range  at  Closest  Approach  (aeters)  =?',  RNIN 
200  RNIN  =  RHINf3.28 

210  INPUT  II,  T,  AZNOISE,  ELNOISE,  R6N0ISE 
220  60SUB  340 
240  60SUB  1150 

250  PRINT  12,  T,  AZC08R»1000 
260  PRINT  14,  T,  E16DMOOO 
270  RHAT  =  RSLN 
280  RHATi  =  RHAT 
230  VACHAT  -  VAC 
300  AZHAT  -  AZL0SN 
310  AZHAT 1  *  AZHAT 
320  EIHAT  =  EIL0SN 
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330  ELHAT 1  -  EUOSM 
3S0  60SU8  1250 

360  AZD IFF  =  AZCORR  -  AZCOSRN 

370  ELD  IFF  =  aSO  -  EL6DN 

371  PRINT  T,  AZL0SM80/PI,  AZDIFFMOOO,  VAC 
380  PRINT  13,  T,  AZDIFFMOOO 

390  PRINT  13,  T,  ELDIFF«1000 

400  INPUT  »t,  T,  AZNOISE,  ELNOISE,  R8N0ISE 

420  68SUI  940 

430  S0SU8  U50 

440  PRINT  i2,  T,  AZC0RR»1000 
450  PRINT  14,  T,  a6D»1000 
460  RHAT  *  RHAT  *  (RSLN  -  RHAD/2 
470  ROHAT  *  (RHAT  -  RHATD/DT 
480  RHAT1  =  RHAT 

490  VACHAT  =  VACHAT  MVACN  -  VACHAT) /2 
500  AZHAT  =  AZHAT  ♦  (AZLOSN  -  AZHAT) /2 
510  AZOHAT  *  (AZHAT  -  AZHATD/DT 
520  AZHAT 1  =  AZHAT 

530  ELHAT  =  EIHAT  ♦  (aLOSN  -  ELHAT) /2 
540  aDHAT  =  (ELHAT  -  ELHATD/DT 
550  aHATl  -  ELHAT 
560  60SUB  1250 

565  AZDIFF  *  AZCORR  -  AZCORRN 

566  ELBIFF  *  aSD  -  a60N 
570  PRINT  13,  T,  AZDIFFMOOO 

580  PRINT  15,  T,  aDIFFMOOO 

581  PRINT  T,  A2L0S* 180/PI,  AZDIFF»1000,  VAC 
600  INPUT  II,  T,  AZNOISE,  ELNOISE,  R6N02SE 
610  60SUB  940 

620  60SU8  1150 

630  PRINT  12,  T,  AZCORRHOOO 
640  PRINT  14,  T,  ELGDMOOO 
650  RHAT  =  RHAT  ♦  (RSLN  -  RHAT1/3 
660  ROHAT  =  (RHAT  -  RHATD/OT 
670  RHAT!  *  RHAT 

680  VACHAT  =  VACHAT  ♦  (VACN  -  VACHAT) /3 
690  AZHAT  =  AZHAT  ♦  (AZLOSN  -  AZHAT) /3 
700  AZDHAT  =  (AZHAT  -  AZHATD/DT 
710  AZHAT!  -  AZHAT 

720  aHAT  =  ELHAT  ♦  (ELLOSN  -  ELHAT) /3 
730  aDHAT  --  (ELHAT  -  ELHATD/DT 
740  ttHATl  =  ELHAT 
750  60SUB  1250 

755  AZDIFF  =  AZCORR  -  AZCORRN 

756  ELOIFF  =  a60  -  EL60N 
760  PRINT  13,  T,  AZDIFFMOOO 

770  PRINT  15,  T,  ELDIFFMOOO 

771  PRINT  T,  AZLOS*  180/PI,  AZDIFFMOOO,  VAC 
790  K  -  3 

800  INPUT  II,  T,  AZNOISE,  ELNOISE,  RGNOISE 
310  60SUB  340 


820  60SU8  1150 
830  PRINT  12,  T,  AZC0RRH000 
840  PRINT  14,  T,  a6D»1000 
850  60SUB  1070 

860  VACHAT  =  VACHAT  ♦  (VACN  *  VACHAT)/(K  +1) 

870  60SU8  1250 

875  AZOirr  =  AZCDRR  -  AZCORRN 

376  ELOIFF  =  £160  -  EL6DN 

880  PRINT  13,  T,  AZDIFFMOOO 

880  PRINT  15,  T,  aOIFFMOOO 

891  PRINT  T,  AZLOS* 180/PI,  AZOIFFMOOO,  VAC 

900  IF  AZLOS  <0!  THEN  60T0  1350 

910  T  =  T  +  .017 

920  K  -  K  M 

930  GOTO  800 

940  RSL  =  S0R(HA2  ♦  RHINA2  ♦  (RHIN  -  VAC*T)A2) 

950  RSLN  =  RSL  t  R6N0ISE 
960  ELIOS  =  H/RSL 

970  ELLOS  =  ATN(ELL0S/S8R(1  -  ELL0S*2)) 

980  aLOSN  =  ELLOS  *  ELNOISE/RSL. 

990  AZLOS  =  ATNdRHIN  -  VAC»T)/RNIN) 

1000  AZLOSN  -  AZLOS  ♦  AZNOISE/RSL 

1010  VP  =  VHUZA2  ♦  2*VHUZ*VAC*SIN(AZL0S)*C0S(ELL0S)  *  VACA2 
1020  VP  *  SQR(VP) 

1030  VACN  -  VAC  •  R6NOISE«VAC/460 

1040  VPN  =  VHUZA2  *  2tVNUZ*VACN»SIN(AZL0SN)tC0S(ELL0SN)  ♦  VACNA2 
1050  VPN  sSQR(VPN) 

1060  RETURN 

1070  0HE6A  *  S8R(AZDHATA2  ♦  aDHATA2) 

1080  RHAT  =  (1  -  KPR)*RHAT  ♦  DT#(1  -  KPRXRDHAT  ♦  KPRtRSLN 

1090  RDHAT  =  ( DT *0HE6AA2  -  KVR)*RHAT  ♦  (l  -  0T*KVR)*R0HAT  ♦  KVR*RSLN 

1100  AZHAT  =  (1  -  KPAXAZHAT  ♦  KPA#AZLOSN  ♦  DT*(1  -  KPAXAZDHAT 

1110  AZOHAT  =  -KVAMZHAT  Ml  -  DT#KVA  -  2*R0HAT »DT/RHAT) * AZDHAT  *  KVAtAZLOSN 

1120  ELHAT  =  (1  •  KPEXELHAT  ♦  KPE*ELLOSN  ♦  0T»(1  -  KPEXELOHAT 

1130  ELDHAT  *  -KVE»ELHAT  Ml  -  0T*KVE  -  2*R0HAT*0T/RHAT)*a0HAT  KVEtaLOSN 

1140  RETURN 

1150  UCDA  *  . 000005*RSLA2  -  .036938»RSL  ♦  700.4299 

1160  8ETA  =  3E-09*RSLA2  -  4.0706E-05*RSL  ♦  1.0216 

1170  C  =  1 . S*RHO/UCOA 

1180  TOF  =  RSL/VP/U  -  C»RSL/4)A2 

1190  EL6D  =  .5»6«COS(ELLQS)«(BETA»TOF)A2/RSL 

1200  AZGUN  *  VP*S I N (AZLOS ) >COS ( ELLOS)  -  VAC 

1210  AZ6UN  *  AZ6UN/VNUZ/C0S(ELL0S  *  EL60) 

1220  AZGUN  :  ATN(AZ6UN/SQR(1  -  AZ6UNA2)) 

1230  AZCORR  =  AZLOS  -  AZGUN 
1240  RETURN 

1250  UCDA  =  . 000005*RHATA2  -  .036938*RHAT  *  700.4299 
1260  8ETA  =  3E-09#RHATA2  -  4.0706E-05»RHAT  »  1.0216 
1270  C  =  1 . 5*RH0/UCDA 
1280  TOF  =  RHAT/VPH/ ( 1  -  C»RHAT/4)A2 
1290  ELGON  *  . 5*8 tCOS ( ELHAT )»( BETA* TOF ) A2/ RHAT 
1300  AZGUMI  =  VPN»S IN( AZHAT )*COS( ELHAT)  -  VACN 
1310  AZ6UNN  •  AZGUNN/VHUZ/COS( ELHAT  t  ELGON) 

1320  AZGUNN  *  ATN(AZ6UNN/S0R(1  -  AZGUWA2)) 

1330  AZCORRN  *  AZHAT  -  AZ6UNN 
1340  RETURN 
1350  END 
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CHAPTER  X 

FIRE  CONTROL  SYSTEM  USING  EXTERNAL  MEASUREMENTS 
Analysis  vith  Simulated  Noisy  Data 

A  fire  control  system  that  uses  external  measurement  of 
aircraft  velocity,  a  laser  rangefinder,  gun  boresight  aligned 
angular  resolvers  for  azimuth/elevation  LOS  and  an  inertial 
platform  was  detailed  in  the  previous  chapter.  The  previous 
analysis  shoved  that  by  adding  back  the  previous  azimuth  and 
elevation  correction  to  the  measurements  made  along  the  gun 
boresight,  the  system  performs  essentially  as  if  measurements 
vere  made  along  the  laser  beam  axis  as  required. 

However,  the  previous  analysis  of  this  system  did  not  use 
"noisy"  measurements  that  result  when  a  gunner  is  aiming  the 
laser  beam  via  the  gun  mounted  in  a  moving  helicopter. 

Thus  in  this  analysis  "noisy"  range  and  angular  measurements 
vere  included  to  provide  a  more  realistic  operating  environment. 

The  noisy  range  data  was  based  on  laser  range 
measurements  actually  measured  during  the  BSTING  flight  test 
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program  conducted  during  August  and  September  of  1990. 

The  noisy  range  data  was  the  same  as  used  in  the  previous 
investigations  to  evaluate  improvements  to  the  BSTING  system. 
Figure  X-l  shows  the  zero-mean  noise  added  to  the  nominal  range 
data . 


Figure  X-l 

Zero-Mean  Range  Noise 
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During  this  same  test  program,  the  location  of  the  laser 
spot  on  the  target  was  measured  by  a  low  light  level  TV  camera. 
This  data  was  used  here  to  determine  a  representative  noise 
profile  for  measurements  made  by  the  azimuth  and  elevation 
angular  resolvers  mounted  on  the  gun.  The  noise  extracted  from 
the  flight  test  data  for  a  typical  pass  is  shown  in  figures  X-2 
and  X-3.  It  was  assumed  that  the  gunner  kept  the  spot  within 
the  approximately  40  ft  x  40  ft  area  of  the  target  and  thus  the 
noise  data  was  "faired"  in  during  the  period  when  the  laser 
beam  was  off  and  thus  its  position  was  not  recorded.  Like  the 
range  noise,  azimuth/elevation  noise  was  normalized  to  have  a 
zero  mean  value.  These  azimuth/elevation  noise  measurements 
were  then  divided  by  the  instantaneous  nominal  slant  range  to 
obtain  angular  noise  values.  These  values  were  then  added  to 
the  nominal  azimuth  and  elevation  angles  determined  in  the 
previous  chapter. 

In  this  method,  the  aircraft  velocity  is  supplied  from  an 
external  source.  It  was  assumed  that  velocity  contains  a 
certain  amount  of  noise.  This  noise  was  modeled  by  assumimg 
the  noise  was  5-percent  of  the  noise  in  range  measurements  and 
of  opposite  sign,  i.e. 

VELN  =  (1  -  0.05  NOISE  ) 

range 
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Since  azimuth  and  elevation  angles  rather  than  azimuth 
and  elevation  rates  are  being  measured,  a  new  set  of  filters 
were  needed  to  smooth  the  noisy  range  and  azimuth/elevation 
data.  These  are  described  here; 

Range  Filter 

RHAT(k+l )  =  (1  -  KPR ) *RHAT ( k )  +  (1  -  KPR } *RDHAT ( k+1 ) *DT 
+  KPR*RSLN 

2 

RDHAT ( k+1 )  =  (OMEGA  *DT  -  KVR ) *RHAT( k )  +  (1  -  KPR*DT ) *RDHAT( k ) 
+  KVR*RSLN 

Azimuth  Angle  Filter 

AZHAT( k+1 )  =  (1  -  KPR ) *AZHAT(k )  +  (1  -  KPR } *AZDHAT( k+1 ) *DT 
+  KPR*AZGUN 

AZDHAT( k+1 )  =  -KVA*AZHAT(k )  +  (1  -  KVA ) *DT 

-  (2*DT*RDHAT(k)/RHAT(k) ]*AZDHAT(k)  +  KVA*AZLOSN 

Elevation  Angle  Filter 

ELHAT(k+l )  =  (1  -  KPE)*ELHAT(k )  +  (1  -  KPE ) *ELDHAT ( k+1 ) *DT 
+  KPE*ELGUH 

ELDHAT(k+l )  =  -KVE*ELHAT( k )  +  (1  -  KVE ) *DT 

- ( 2*DT*RDHAT ( k ) /RHAT ( k )  J  *ELDHAT ( k )  +  KVE*ELLOSN 
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2  2  2 

WHERE:  OMEGA  =  AZDHAT(k)  +  ELDHAT(k) 

RSLN  =  Noisy  Range  Data 

AZLOSN  =  Noisy  Azimuth  Data  (Measured  along  Gun 
Boresight  then  Corrected  Using  Previous 
Azimuth  Correction  Value.) 

ELLOSN  =  Noisy  Elevation  Data  (Measured  along  Gun 
Boresight  then  Corrected  Using  Previous 
Elevation  Correction  Value.) 

DT  -  Time  increment  between  noisy  data  samples. 


Aircraft  Velocity  Filter 


VACHAT ( k+1 )  =  VACHAT(k)  +  [VACN  -  VACHAT(k)/(k  +  1) 


The  filtered  data  was  then  used  in  the  following  algorithms 
to  determine  the  azimuth  and  elevation  corrections. 


Elevation  Angle  Correctio 


ELGD  =  0 . 5*C0S ( ELHAT) * ( BETA*TOF ) /RHAT 
RHAT 

TOF  =  -  2 

VPN* ( 1  -  c*RHAT/4 ) 

c  =  1 . 5*dens i ty/WCDA 

2 

WCDA  =  . 00005*RHAT  -  0 . 036938*RHAT  +  700.4299 

2 

Beta  =  3E-09 *RHAT  -  4 . 706E-05*RHAT  +  1.0216 
density  =  0.07479 


ELCORR  *  ELGD 


Azimuth  Angle  Correct ion 


VP*Sin ( AZHAT ) *COS ( ELHAT )  -  Va/c 

Sin  (AZGUN)  =  - - 

Vmuz  *Cos ( ELLOSN  +  ELGD) 

2  2  2 
VPN  =  V  +  2*V  *V  *SIN(AZLOSN)*COS( ELLOSN)  +  V 

muz  muz  a/c  a/c 

AZCORR  =  AZLOSN  -  AZGUN 

Starting  Values 

For  each  parameter,  the  first  three  values  were  computed 

by: 

"X"HAT(k+l)  a  "X"HAT(k)  +  [ "X"N  -  "X"HAT( k )/ ( k  ♦  1) 
Where:  k=  0f  1,  2 

Starting  values  for  rate  parameters  were  determined  by: 

”X"DHAT( k  +  1 )  =  ( MX”HAT ( k  +  1 )  -  "X"HAT(k ) ]/dt 

Only  one  scenario  vas  used  in  this  investigation  and  vas 

Va/c  =  150  knots  (helicopter  velocity) 

Rmin  =  1000  meter  (range  at  closest  approach) 

H  =  250  feet  (altitude  above  target) 
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A  brief  investigation  was  done  to  determine  the  ’•best1’ 
values  for  the  filtering  coefficients  -  KPR,  KVR,  KPA,  KPE, 

KVA  and  KPE.  The  results  are  shovn  in  figures  X-4  through  X-9 
in  terms  of  the  difference  between  the  nominal  and  filtered 
noisy  data.  Prom  this  analysis,  the  following  coefficients 
values  were  used  in  subsequent  analysis. 


KVR 

=  0.5 

KPR 

=  0.5 

KPA 

=  0.05 

KVA 

=  0.05 

KVE 

=  0.05 

-  AZIMUTH 


ELSvencN 


Figure  X-4 

Unfiltered  Differences  between  Nominal  and  Noisy  Azimuth 

and  Elevation  Angle  Measurements 
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Figure  X-5 

Filtered  Differences  between  Nominal  and  Noisy  Azimuth  and 
Elevation  Angle  Measurements.  (KPA  *  KPR  =  KPE  =  KVE  =  0.1) 
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Figure  X-6 

Filtered  Differences  between  Nominal  and  Noisy  Azimuth  and 
Elevation  Angle  Measurements.  (KPA  =  KPR  =  KPE  =  KVE  =  0.05) 
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Figure  X-7 

Filtered  Differences  between  Nominal  and  Noisy  Azimuth  and 
Elevation  Angle  Measurements.  (KPA  =  KPR  =  KPE  =  KVE  =  0.01) 
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Figure  X-8 

Filtered  Aircraft  Velocity 
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Figure  X-9 

Filtered  Slant  Range  (KVR  =  KPR  =  0.5) 

The  results  of  this  investigation  are  shown  in  figure  X-10 
and  X-ll  in  the  form  of  the  computed  azimuth  and  elevation 
corrections  based  on  the  noisy  data  versus  the  required 
corrections  for  accurate  aiming.  Because  of  the  large  sampling 
rate,  the  azimuth  range  used  started  at  45  degrees  off  target 
and  ended  at  0  degrees  vith  respect  to  the  target. 

For  this  fire  control  system,  there  are  four  measurements 
that  contribute  to  the  total  error.  These  are  the  errors  in 
slant  range,  aircraft  velocity,  LOS  azimuth  and  LOS  elevation 
angles.  This  system  also  requires  an  inertial  platform,  but  the 
error  contribution  of  the  inertial  platform  were  not  included 
in  this  initial  investigation. 
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Figure  X-1I 

Total  Correction  in  Elevation  Plane  -  Computed  versus  Required 
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These  individual  contributions  are  shown  as  well  as  the 
total  error  are  shown  in  figures  X-12  and  X-13  for  the  azimuth 
and  elevation  aiming  corrections  respectively.  In  both  cases, 
the  graphs  show  the  difference  between  the  computed  corrections 
and  the  required  corrections.  By  far,  the  largest  error  comes 
from  errors  in  the  aircraft  velocity.  This  can  also  be  discerned 
from  the  algorithm  used  to  determine  the  azimuth  correction. 
Incidently,  in  this  investigation  no  error  in  vertical  velocity 
was  assumed.  In  a  real  situation,  there  would  be  some  error  in 
vertical  velocity  and  consequently  an  increase  in  elevation 
aiming  error. 


m  RANGE  #  AZIMUTH 

Figure  X-12 

Error  Contributions  -  Azimuth  Plane 
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Figure  X-13 

Error  Contributions  -  Elevation  Plane 

Since  aircraft  velocity  is  the  largest  contributor  to  the 
total  azimuth  aiming  error,  an  investigation  was  made  into  the 
sensitivity  of  total  aiming  errors  in  aircraft  velocity 
measurement.  Again  the  same  scenario  was  useu  and  two  azimuth 
pointing  directions  were  used  -  45,  and  0  degrees  off  of  the 
RMIN  (range  of  closest  approach).  The  results  are  shown  in 
figure  X-14  and  X-15. 
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Figure  X-14 

Sensitivity  of  the  Total  Azimuth  Aiming  Error 
to  Errors  in  Aircraft  Velocity  Measurements 
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AIMING  ERROR  -  milliradians 
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Figure  X-15 

Seto‘Erro^  °£  Jh*  TOtal  Elevati°"  Aiming  Error 
to  Errors  in  Aircraft  Veiocity  Measurements 
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CONCLUSIONS: 


While  this  fire  control  system  does  reduce  the  sensitivity 
to  azimuth  and  elevation  errors  due  to  gunner  pointing 
difficulties,  the  system  requires  requires  very  accurate 
measurement  of  the  aircraft  velocity.  From  figure  X-15,  it  can 
be  concluded  that  aircraft  velocity  would  have  to  measured  at 
better  than  2  percent. 

Also  not  investigated  in  this  study,  but  required  in  an 
actual  system  are  the  complexity  and  inaccuracies  of  inertial 
platforms  and  altitude  measurements  that  also  would  have  to  be 
used  with  this  system. 
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COMPUTER  PROGRAM:  CORRECTP.BAS 


PURPOSE:  (a)  Computes  Nominal  Slant  Range  and  Azimuth/Elevation 

Angles  for  the  LOS;  (b)  Adds  Noise  to  Nominal  Slant 
Range  and  Azimuth/Elevation  Angles;  (c)  Computes 
Noisy  Gun  Boresight  Azimuth/Elevation  Angles;  (d) 
Filters  Range  and  Azimuth/Elevation  Measurements; 

(e)  Computes  Azimuth/Elevation  Aiming  Corrections; 

(f)  Computes  Error  in  the  Aiming  Correction. 

RESULTS:  Error  in  Azimuth  and  Elevation  Aiming  Corrections, 

as  well  as  Individual  Contributions  to  these  Errors. 


10  OPEN  *r,  II,  'C:NOISEDAT.DAT* 

20  OPEN  ‘O',  12,  'BlAZCORRlI.COl' 

30  OPEN  ‘O',  13,  'B:AZT0TDIF.C01' 

40  OPEN  ‘O',  14,  'BiELTOTDIF.COl' 

50  OPEN  ‘O',  IS,  ’B:ELC0RRDI.c01' 

60  RH0  *  .07470 
65  8  =  32.17 
70  OT  =  .017 
80  VHUZ  *  2840 
00  PI  a  3.1416 
100  KPR  a  .5 
110  KVR  =  .3 
120  KPA  =  .05 
130  KVA  *  .05 
140  KPE  »  .05 
150  KVE  »  .05 

160  INPUT  'Aircraft  Altitude  (feet)  a  ?',  H 

170  INPUT  'Aircraft  Noeinal  Velocity  (knots)  »  ?',  VEl 

180  VAC  a  VEL*88/60/.S684 

ISO  INPUT  'Range  at  Closest  Approach  (eeters)  =?',  RHIN 
200  RHIN  a  RNIIM3.2B 

210  INPUT  It,  T,  AZN0ISE,  ELN0ISE,  RSN0ISE 
220  60SUB  S40 
240  G0SUB  1150 

250  PRINT  12,  T,  AZC0RR*1000 
260  PRINT  14,  T,  EISOHOOO 
270  RHAT  =  RSUf 
280  RHAT I  a  RHAT 
230  VACHAT  »  VAC 
300  AZHAT  a  AZLOSN 
310  AZHAT 1  a  AZHAT 
320  EIHAT  a  EILOSN 
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COMPUTER  PROGRAM:  VELSENSI.BAS 


PURPOSE:  Computes  the  Azimuth  Aiming  Error  that  Results  from 

Errors  in  Measuring  Aircraft  Velocity.  Allows 
Different  Percentages  of  Error  and  Aircraft 
Velocities  from  0  to  250  knots. 

RESULTS:  Azimuth  Aiming  Errors  as  a  Function  of  Aircraft 

Velocity  and  Percentage  Error  in  Velocity. 


010  OPEN  "0",  #1,  "B:  VELIO*. C45" 

20  PERCENT  =  .1 
30  PI  =  3.1417 
40  G  =  32.17 
50  VMU2  =  2840 

60  INPUT  ’’LOS  Azimuth  wrt  Platform  (deg)  *  ?" /  AZ 

70  INPUT  "LOS  Elevation  wrt  Platform  (deg)  =  ?",  EL 

80  AZ1  *  AZ*PI/180 

90  ELI  *  EL*PI/180 

100  VAC  =  VEL* 88/ 60/ .8684 

110  GOSUB  210 

120  AZCORRTH  *  AZCORR 

130  VAC  =  VAC* ( 1  +  PERCENT) 

140  GOSUB  210 

150  AZCORRERR  =  AZCORR 

160  AZDIFF  =  -(AZCORRTH  -  AZCORRERR) 

170  PRINT  VEL,  AZCORRTH*1000,  AZCORRERR*1000 ,  AZDIFF*1000 

175  PRINT  #1,  VEL,  AZDIFF*1000 

180  IF  VEL  >240  THEN  GOTO  280 

190  VEL  =  VEL  +  10 

200  GOTO  100 

210  VP  *  VMUZ~2  +  2*VMUZ*VAC*SIN( AZ1 ) *C0S (ELI )  +  VAC~2 
220  VP  *  SQR(VP) 

230  AZGUN  =  VP*SIN(AZ1) *COS ( ELI )  -  VAC 
240  AZGUN  =  AZGUN/VMUZ/COS ( ELI ) 

250  AZGUN  =  ATN( AZGUN/SQR(  1  -  AZCUN^)) 

260  AZCORR  =  AZ1  -  AZGUN 
270  RETURN 
280  END 
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330  ELHAT 1  *  ELLOSN 
350  60SU8  1250 

380  AZOIFF  *  AZCORR  -  AZCOftRN 

370  £10 IFF  :  £180  -  CLSON 

371  PRINT  T,  AZLQSM80/PI,  AZDIFFMOOO,  VAC 
380  PRINT  13,  T,  AZDIFFMOOO 

390  PRINT  *5,  T,  ELDIFFMOOO 

400  INPUT  !lf  T,  AZNOISE,  ELNOISE,  RGNOISE 

420  60SUB  940 

430  80SU8  1150 

440  PRINT  »2f  T,  AZCORRMOOO 
450  PRINT  14,  T,  E16D#1000 
460  RHAT  »  RHAT  ♦  (RSLN  -  RHAT1/2 
470  ROHAT  *  (RHAT  -  RHATD/OT 
480  RHAT1  =  RHAT 

490  VACHAT  a  VACHAT  ♦  (VACN  -  VACHAT) /2 
500  AZHAT  *  AZHAT  +  (AZLOSN  -  AZHAT) /2 
510  AZOHAT  a  (AZHAT  -  AZHATD/OT 
520  AZHAT 1  a  AZHAT 

530  OHAT  a  ELHAT  ♦  (ELLOSN  -  ELHAT) /2 
540  ELDHAT  *  (ELHAT  -  ELHATD/DT 
550  ELHATt  a  ELHAT 
560  60SUB  1250 

565  AZOIFF  a  AZCORR  -  AZCORRN 

566  ELD IFF  a  EL6D  -  ELGON 
570  PRINT  13,  T,  AZDIFFMOOO 

580  PRINT  IS,  T,  ELDIFFMOOO 

581  PRINT  T,  AZL0SM80/PI,  AZDIFFMOOO,  VAC 
600  INPUT  II,  T,  AZNOISE,  ELNOISE,  RGNOISE 
610  60SU8  940 

620  60SUB  1150 

630  PRINT  12,  T,  AZCORRMOOO 
640  PRINT  14,  T,  EL6DMOOO 
650  RHAT  *  RHAT  *  (RSLN  -  RHATJ/3 
660  ROHAT  a  (RHAT  -  RHATD/OT 
670  RHATI  a  gHAT 

680  VACHAT  a  VACHAT  ♦  (VACN  -  VACHATJ/3 
690  AZHAT  a  AZHAT  ♦  (AZLOSN  -  AZHAT) /3 
700  AZOHAT  »  (AZHAT  -  AZHATD/OT 
710  AZHAT1  a  AZHAT 

720  ELHAT  a  ELHAT  ♦  (ELLOSN  -  ELHATJ/3 
730  ELOHAT  a  (ELHAT  -  ELHATD/DT 
740  ELHAT 1  a  ELHAT 
750  60SUB  1250 

755  AZOIFF  a  AZCORR  -  AZCORRN 

756  ELOIFF  a  a60  -  CLSON 
760  PRINT  13,  T,  AZDIFFMOOO 

770  PRINT  15,  T,  aOIFFMOOO 

771  PRINT  T,  AZLOSMBO/PI,  AZDIFFMOOO,  VAC 

790  K  a  3 

800  INPUT  II,  AZNOISE,  ELNOISE,  RGNOISE 
910  G0SU8  340 


920  60SUB  1150 

330  PRINT  12,  r,  AZCORR* 1000 
940  PRINT  14,  T,  £160*1000 
950  60SUI  1070 

960  VACHAT  =  VACHAT  ♦  (VACN  -  VACHAT)/(K  M) 

970  60SU9  1250 

975  AZ01FF  =  AZCQRR  -  AZCORRN 

976  ELD  IFF  =  aSO  -  a60N 
980  PRINT  13,  T,  AZD!FF*1000 
930  PRINT  IS,  T,  aAIFFMOOO 

991  PRINT  T,  AZLOS* 180/PI ,  AZDIFF*1000,  VAC 

900  IF  AZLOS  <  0!  THEN  60T0  1350 

910  T  *  T  *  .017 

920  K  a  K  t  1 

930  60T0  900 

940  RSL  *  30801*2  ♦  RHIN*2  *  (WIN  -  VAC*T)A2) 

950  RSLN  -  RSL  *•  RGNOISE 
960  aLOS  *  H/RSL 

970  aLOS  *  ATN(ELL0S/SQR(1  -  aL0S*2)) 

980  aLOSN  a  aLOS  *  aNOISE/RSL 
990  AZLOS  a  A TN(  (WIN  -  VACW/RHIN) 

1000  AZLOSN  a  AZLOS  *  AZNOISE/RSL 

1010  VP  *  VHUZA2  ♦  2»VHUZ»VACiSIN(AZL0S)*C0S(aL0S)  *  VACA2 

1020  VP  »  SOR(VP) 

1030  VACN  a  VAC  -  R6NOISE*VAC/460 

1040  VPN  a  VNUZA2  *  2*VHUZtVACN*SIN(AZL0SN)*C0S(ELL0SN)  *  VACNA2 
1050  VPN  =SQR(VPN) 

1060  RETURN 

1070  ONEGA  a  SQRIAZDHAT*?  *  aOHATA2) 

1080  RHAT  »  (1  -  KPRURHAT  ♦  0T*<1  -  KPR)*RDHAT  ♦  KPR*RSLN 

1090  ROHAT  a  (0T*OHE6AA2  -  KVR) *RHAT  ♦  (1  -  OT*KVR)*ROHAT  *  KVRtRSLN 

1100  AZHAT  =  (1  •  KPA)*AZHAT  *  KPA«AZLOSN  *  0T*(1  -  KPA)*AZDHAT 

1110  AZDHAT  a  -KVAtAZHAT  Ml  -  OT«VA  -  2*R0HAT»0T/RHAT) tAZOHAT  *  KVA»AZLQSN 

1120  aHAT  »  (1  -  KPE)*€LHAT  ♦  KPE*ELL0SN  ♦  DT*(1  -  KPE)*aOHAT 

1130  aDHAT  *  -KVE*ELHAT  Ml  -  OTHCVE  -  2»R0HAT *0T /WAT) *aOHAT  ♦  KVE*aLOSN 

1140  RETURN 

1150  UCDA  =  .000005*RSlA2  -  .036938»RSL  ♦  700.4299 

1160  8ETA  a  3E-09*RaA2  -  4.0706E-05»RSL  *■  1.0216 

1170  C  a  1.5*RH0/NC9A 

1180  TOF  =  RSUVP/(1  -  C»RSl/4)A2 

1190  EL6D  »  .StGfCOSlELLOSltlBETAfTOn^/RSL 

1200  AZGUN  a  VP*SIN(AZLOS)*COS(aLOS)  -  VAC 

1210  AZ6UN  a  AZ6UN/VNUZ/C0S(ELL0S  *  a6B) 

1220  AZGUN  a  ATN(AZ6UN/S0R(1  -  AZGUN*2)) 

1230  AZCORR  a  AZLOS  -  AZGUN 
1240  RETURN 

1250  UCDA  =  . 000005»RHAT*2  -  .036938»RHAT  ♦  700.4299 

1260  8ETA  *  3E-09»RHATA2  -  4.0706£-)5*RHAT  >  1.0216 

1270  C  a  1.5*RH0/UCDA 

1280  TOF  a  RHAT/VPN/ ( 1  -  C»RHAT/4)A2 

1290  ELGON  a  .5»G*C0S(aHAT)»(BETA*T0F)A2/RHAT 

1300  AZ6UNN  a  VPN*SIN(AZHAT)*COS(ELHAn  -  VACN 

1310  AZGUNN  a  AZGUNN/VMJZ/COS(ELHAT  ►  ELGON) 

1J20  AZGUNN  a  ATN(AZ6UNN/SBR(1  -  AZ6iMr2)) 

1330  AZCORW  a  AZHAT  •  AZGUNN 
1340  RETURN 
1350  END 
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CHAPTER  XI 

FURTHER  IMPROVEMENTS  FOR  CONTINUOUS  TRACKING  AND  AIMING 
Adaptive  Filters  and  Increased  Sampling  Rates 

Three  promising  techniques  for  improving  the  BSTING-type 
fire  control  system  so  that  it  provides  more  accurate, 
continuous  tracking  and  aiming  as  well  as  rapid  response 
approaching  a  "point  and  shoot"  capability  have  been  discussed 
previously.  These  are  (a)  the  gun  unlocked  from  the  laser  beam, 
(b)  optimization  of  the  filter  gains  and  (c)  increased  data 
sampling  rates.  A  more  rigorous  investigation  was  done  on  the 
latter  two  techniques. 

SIMULATION  MODEL 

Using  the  vectorial  representation  of  the  helicopter-target 
kinematics  shown  in  figure  XI-1,  the  equations  of  motion  with 
respect  to  the  X-Y-Z  can  be  derived.  Note:  The  rotation  of  the 
x-y-z  reference  plane  attached  to  the  helicopter  can  be 
described  by 


a 


(j 


<a 


(XI-1) 
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It  was  assumed  for  this  analysis  that  the  target  was 
moving  while  the  platform  remained  stationary.  This  has  the 
same  results  as  if  the  platform  was  moving  with  respect  to  the 
target,  but  makes  the  analysis  a  bit  simpler.  The  position, 
velocity  and  acceleration  of  the  target  with  respect  to  the 
platform  are: 


vr  *  i£  *  rw*j  -  Jc, 

a-  *  C-  •  r<a2]  I  *  Cr»4  *  2r»a]j 

•  * 


Where : 


Cra. 


2ra  J  Jc„ 

i 


( XI  — 2  ) 


The  left  hand  side  of  the  kinematic  acceleration  equation 
above  is  assumed  to  be  the  maneuver  noise.  The  acceleration 
vector  is  then  used  for  the  state  space  model  of  the  system 
kinematics . 


There  are  two  stochastic  processes  at  work  in  the  model  with 

respect  to  system  noise.  These  are  (a)  noise  associated  with 

model  uncertainties  represented  by  the  maneuver  noise  and  (b) 

measurement  noise.  It  is  assumed  that  both  processes  are 

Gaussian.  That  is,  they  have  zero  mean  and  are  uncorrelated  with 

2 

each  other.  The  units  of  the  maneuver  noise  are  feet/sec  and 

measurement  noise  for  range  and  angular  rates  are  feet  and 

2  2 

rad  /sec  respectively. 
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The  kinematic  model  can  be  decoupled  into  three  separate 
sub-models  that  decribe  (a)  range  and  range  rate,  (b)  azimuth 
rate  and  (c)  elevation  rate. 


Range  and  Range  Rate 


The  range  state-space  model  is: 


»  * 

'  0 

ij 

fr] 

0 

ai  * 

l-i  1 

G)3 

Oj 

' 

k* 

v., 

( XI -3 ) 


and 


The  terms  v  and  v  represent  the  process  and  measurement  noise 
r  •  r 

respectively.  Note  that  the  same  nomenclature  is  used  for  the 
azimuth  and  elevation  rate  models;  however,  the  subscripts  are 
changed  to  denote  the  model. 

A  discrete  state-space  model  as  required  in  a  system  with 
discrete  measurements  can  be  developed  by  discretizing  the 
differential  equations  of  motion  using  the  forward  difference 
equation: 


x 


&2* 


(  XI  -  4  ) 
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Thus,  the  stochastic  discrete  state-space  model  for  the  range 
and  range  rate  kinematics  becomes: 


" 

-JC*t 

* 

( XI -5 ) 


and 


Jc 


(XI -6) 


[i  a] 


Azimuth  and  Elevation  Rates 


The  stochastic  continuous  model  for  the  azimuth  rate  is: 


Ci>  -  - 


W. 


and 


Q 


q4  *  r.. 


( XI -7  ) 


( XI -8  ) 


After  discretization,  the  stochastic  azimuth  rate  state-space 
becomes : 


a> 


»lr.l 


2  r  ^  at. 

— — uiJ  " 

rK  kJ  -jc 


Va„' 


( XI -9  ) 


and 


3  ^ 


(XI-10) 
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Likewise  for  the  elevation  rate,  the  stochastic  continuous 
state  space  model  is: 


and 
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j 


( XI -12 ) 


Discretizing  the  differential  equation  for  the  elevation  rate: 


u 


•Jc*l 


f,  .  .  Hi 

1  **]  *■» 


V, 


•*' 


and 
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fc  *  +  r«k* 
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K.ajLmap  Filter 


Discrete  Kalman  filters  can  now  be  developed  using  the 
above  stochastic  discrete  state-space  models.  The  general 
form  of  the  Kalman  filter  is: 


2, 


Sfl 


(XI-15) 
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Note:  the  "hatted"  terms  denotes  an  estimate,  the  single  bar 

represents  a  vector  and  a  double  overbar  signifies  a  matrix. 

For  the  range  and  range  rate  estimators  the  A  ,  H  and  K 

k  k  k 


matrices  are: 


[1  0], 


(XI -16) 


The  resulting  Kalman  filters  or  optimal  estimators  are: 


■  jc-i 

i 

> 

*JP»V 


1  -  JC^  Arr(1  ' 


[*; * 

|*JC 

»k«a 


(XI-17) 


The  A  ,  H  and  K  matrices  for  azimuth  and  elevation  rate 
k  k  k 

estimators  are: 


51 1* 


(Xl-19) 


Using  these  matrices,  the  Kalman  filters  for  azimuth  and 
elevation  rates  become: 


£ 


1  -  k 


r- 


a**t/ 


A  —  <  Q 
w*lc 


( XI -20 ) 
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i 


3  2 


( XI -21 ) 

The  only  unknowns  in  the  Kalman  filter  equations  are  the 

filter  gains,  k  ,  k  ,  k  and  k  .  There  are  two  common 

pr  vr  va  ve 

approaches  to  assigning  values  for  these  gains.  The  simplest 
way  is  to  assign  constant  values  as  is  the  case  with  the 
current  BSTING  algorithms.  The  other  technique  is  to  use  a 
priori  and  a  posteriori  state  and  error  covariance  equations. 
These  equations  update  the  values  of  the  filter  gains  at  each 
time  increment. 

Using  constant  filter  gains  works  well  only  if  a  steady 
state  response  is  desired.  However,  if  a  reliable  estimate  is 
desired  for  the  transient  response  of  a  filter,  then  the  state 
and  error  covariance  equations  should  be  solved.  This  discrete 
Kalman  filter  design  is  often  called  a  predictor-corrector 
method  because  the  a  priori  state  and  error  covariance 
equations  are  used  to  predict  how  the  means  and  variances  of 
the  state  evolve  through  time  under  the  influence  of  the  system 
dynamics.  The  a  posterior i  equations  correct  the  states,  plus 
the  mean  and  error  covariances  based  on  the  updated  measurement. 
The  a  priori  state  equation  is: 

^  -  "*2*  -  1*2*.  (  XI  -22  ) 


and 
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The  a  priori  error  covariance  equation  is: 


*T  - 


( XI -23 ) 


Once  the  a  priori  state  and  error  covariance  are  computed,  the 
filter  gains  and  a  posterior i  states  and  error  covariances 
can  be  determined. 


Mr*! 


« 


( XI -24  ) 


and 


Vi  a  ( 


( XI  -25 ) 


( XI  -26  ) 


are  the  a  posterior i  state  and  error  covariance  and  filter  gain 

equations,  respectively.  The  matrices  Q  and  R  are  the  process 

k  k 

noise  covariance  and  measurement  noise  covariance  matrices, 
respectively.  If  equations  XI-23  and  XI-25  are  substituted  into 
equation  XI-24  and  then  simplified,  the  result  reduces  to 
equation  XI-15  which  is  the  general  form  of  the  Kalman  filter. 
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The  range  a  priori  error  covariance  takes  the  fora  of  a 
2  by  2  matrix,  the  elements  of  which  are  given  by: 


*  A~r  (Pz*k  •XSa^ATJ 

*  Aa*  "*  Pa**2**  +  ^(Pir*  *  <** 

*  Arr<Pa*  *  PafcATs»J)  *  Py^a r.a>| 

P22fc*l  3  Pafc  <3fcA*'r  +  P:skA~rw*  *  ^r^Pa*  +’  Pn C0kl 

where 

»  < 

Puj  Pa* 

3*  a 

Alfc  P=2k 

_  i 

The  a  ooeteriori  error  covariance  matrix  for  the  range 
kinematics  is: 


'  Je*X 


*♦!  *  ***.% 


Pa. 


Pa. 


The  filter  gains  are: 


+■  ^ 

*♦1 


Pa 


*♦1  *  rk*l 

Z?«  <•  t 


<c*l — »r*l 


k*t 
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(XI-28) 


( XI-29 ) 


(XI-30) 


The  expression  for  the  range  and  rate  state  estimators  is 
given  in  equation  XI-16. 
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The  a  prior i  error  covariance  for  the  azimuth  rate  is: 


-PsrM. 


(XI -31 ) 


The  §.  Poster ior i  error  covariance  and  filter  gain  equations 
are : 


and 


P)e*\ 


Pk~i 


"•Wl 


-Pjc.t  ^  r, 


*k*l 
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The  i.  ELS  iqSY  and  a  posteriori  error  covariance  and  filter  gain 
equations  for  the  elevation  are: 


•Pic-1.  3  "  2 - 5  I  *]c  ^  ^«k' 


-  Jf 


(XI-34) 


Pjci.  3 


■Pv-T.- 


Pk*i  *  -afc.,. 


( XI-35 ) 


and 


*■»/  .  ^ 

***  -  •fc.l 


( XI  —  36) 
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COMPUTER  SIMULATIONS 


Computer  programs  were  vritten  to  implement  the  Kalman 
filters,  error  covariances,  and  filter  gains  described  in  the 
previous  section.  These  were  then  used  to  test  the  validity  of 
the  model  and  analyze  the  performance  of  the  filter  as  veil 
as  optimize  the  filter  gain  coefficients.  Tvo  computer 
programs,  Laserpr.m  and  Kalman. m  were  vritten  for  this 
simulation.  These  programs,  vritten  in  Mathematics,  are 
included  at  the  end  of  this  chapter. 

Noise  Model  and  Scenario 

In  order  to  simulate  the  actual  measuring  and  aiming 
problem,  it  vas  necessary  to  model  the  "noisy"  measurements 
obtained  from  the  laser  rangefinder  and  azimuth/elevation  rate 
sensors.  Again,  actual  flight  test  data  from  the  Summer  of  1990 
tests  vere  used  as  the  basis  for  the  noise  models.  First,  a 
quadratic  least-square  expression  vas  fitted  to  this  noisy 
data.  This  quadratic  expression  vas  then  subtracted  from  the 
noisy  data  to  obtain  a  nearly  mean  generic  noise  profile. 
Separate  noise  profiles  vere  determined  for  range  as  veil  as 
azimuth  and  elevation  rate  measurements.  These  profiles  vere 
then  added  to  the  theoretical  range  and  azimuth/elevation  rates 
to  obtain  the  noisy  measurement  models.  Figures  XI-2,  XI-3  and 
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XI-4  shows  the  generic  noise  profiles  that  were  used  in 
subsequent  analyses. 


The  following  flight  scenario  was  used  in  this  simulation 


AIRCRAFT  VELOCITY 
ALTITUDE  ABOVE  TARGET 
RANGE  AT  CLOSEST  APPROACH  = 
AZIMUTH  RANGE 


150  knots 

250  feet 

1000  meters 

+45  to  -45  degrees 


Figure 
Zero-Mean  Range 


XI  -2 

Noise  Profile 
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Figure  XI -3 
Zero-Mean  Azimuth  Rate  Noise  Profile 


Figure  XI -5 

Zero-Mean  Elevation  Rate  Noise  Profile 


Kalman  Filter  Statistical  Parameters 


The  first  step  to  improve  the  Kalman  filtering  was  to 
choose  values  for  the  statistical  parameters  in  the  respective 
equations.  This  was  done  using  a  simple  sensitivity  analysis 
where  the  statistical  parameters  were  varied  individually  over 
a  range  of  values  starting  from  a  baseline  set  of  design  para¬ 
meters.  This  was  a  time  consuming  task  because  ten  parameters 
had  to  be  evaluated  and  about  seven  different  values  were  used 
for  each  parameter.  Once  the  simulations  were  completed  the 
data  was  reduced  and  plotted  so  that  effects  of  changing  one 
variable  over  the  range  of  values  could  be  analyzed.  This 
sensitivity  analysis  produced  a  large  matrix  of  data  that  was 
use  to  choose  the  optimum  parameter  values.  Optimisation  was 
based  on  the  best  approximation  to  the  theoretical  (unnoisy) 
azimuth  and  elevation  corrections.  The  calculated  corrections 
(noisy)  had  to  fall  within  a  +/-  4  milliradian  band  around  the 
theoretical  corrections.  The  covariance  values  that  gave  the 
best  results  are  shown  in  table  XI-I. 

Figure  XI-6  shows  a  comparison  between  the  theoretical 
(unnoisy)  and  calculated  (noisy)  azimuth  aiming  correcrion3. 

The  corresponding  difference,  or  error,  is  shown  in  Figure  XI-7. 
The  results  show  that  the  error  is  outside  the  -*■/-  4 
tniliiradians  criteria  until  after  8.5  seconds.  After  that,  the 
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TABLE  XI -I 


OPTIMUM  COVARIANCE  VALUES 


2 

2  2 

p 

a 

5000  ft 

P 

= 

50  ft  /sec 

11 

2  2 

22 

2  2 

p 

a 

0.03  rad  /sec 

P 

0.001  rad  /sec 

a 

2 

e 

2  2 

r 

a 

10  ft 

r 

= 

0.008  rad  /sec 

r 

2  2 

a 

2  2 

r 

a 

0.01  rad  /sec 

q 

= 

0.7  ft  /3ec 

e 

2  4 

r 

2  4 

<3 

a 

0.0006  rad  /sec 

q 

a 

0.0001  rad  /sec 

a 

e 

correction  stays  within  error  criteria.  The  corresponding  correct 
-ions  in  elevation  and  the  elevation  aiming  errors  are  shown  in 
figures  XI-8  and  XI-9,  respectively.  The  elevation  correction  stays 
within  the  +/-  4  milliradian  criteria  for  the  entire  time. 

Figures  XI-10  shows  how  the  ''adaptive”  filter  gains  varied 
during  the  simulation  until  the  they  reached  the  steady  state 
values  shown  in  table  XI-II. 

TABLE  XI-II 


STEADY  STATE  FILTER  GAINS 


k 

=  0.15 

k  = 

0 .06 

pr 

vr 

k 

=  0.06 

k  = 

0  .  03 

pa 

pe 
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Figure  XI -6 

Theoretical  versus  Calculated  Azimuth  Corrections 


Figure  X-7 

Error  in  Azimuth  Aiming  Correction 


Theoretical  versus  Calculated  Elevation  Corrections 


Figure  X-9 

Error  in  Elevation  Aiming  Correction 
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Increased  Sampling  Rates 

Next  an  investigation  was  made  to  determine  if  system 
response  could  be  improved  (i.e.  within  +/-  4  milliradian  error 
band  sooner)  by  increasing  the  rate  sensor  sampling  times  by 
factors  of  two,  five  or  ten  times  over  the  current  sampling 
interval  of  0.22  seconds.  Data  acquired  at  an  increased  rate 
was  simulated  by  appropriately  compressing  the  nominal  zero 
mean  noi3e  data  previously  used  (e.g.  figure  XI-3).  Sampling 
intervals  of  0.11,  0.044  and  0.022  seconds  were  investigated. 
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investigated.  In  order  to  obtain  sufficiently  long  noisy  files 
for  the  shorter  sampling  rates/  the  noise  files  were  lengthened 
by  appending  duplicate  files  end-to-end. 

The  results  of  this  investigation  shoved  a  substantial 
improvement  in  the  response  for  the  azimuth  aiming  correction 
and  a  minor  improvement  for  the  elevation  aiming  correction, 
which  vas  not  a  problem  in  the  first  place.  With  a  tenfold 
increase  in  sampling  rate,  the  +/-  4  milliradians  criteria  vas 
reached  in  less  than  one  second  compared  to  about  3.5  second 
with  the  0.22  second  sampling  rate. 

The  increased  sampling  rate  data  vas  used  with  the 
adaptive  filtering.  Here  a  sampling  rate  of  0.0275  seconds 
(eightfold  increase)  vas  used  for  azimuth  rate  data  and  0.22 
seconds  for  the  elevation  rate  sampling.  Figure  XI -11  shows  the 
compressed  noise  data  (0.0275  second  sampling  rate)  added  to 
the  theoretical  (unnoisy)  azimuth  rate. 

The  Kalman  filter  statistical  parameters  were  again 
optimized.  Again  the  optimization  criteria  vas  the  theoretical 
(unnoisy)  aiming  corrections.  The  results  of  this 
reoptimization  is  summarized  in  table  XI-III. 
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Figure  XI-11 


Azimuth  Rate  Sensor  Data  -  0.0275  second  Sampling  Rate 


TABLE  XI -I I I 


REVISED  OPTIMUM  COVARIANCE  VALUES 


2 

2  2 

p 

S 

10000  ft 

p 

=  300  ft  /sec 

11 

2  2 

22 

2  2 

p 

* 

0.02  rad  /sec 

p 

=  0.003  rad  /sec 

a 

2 

e 

2  2 

r 

= 

10  ft 

T* 

im 

=  0.006  rad  /sec 

r 

2  2 

a 

2  2 

r 

s 

0.01  rad  /sec 

r* 

•4 

=  0.7  ft  /sec 

e 

2  4 

2  4 

q 

= 

0.006  rad  /sec 

=  0.0005  rad  /sec 

a 

e 

43-163 


Using  an  increased  sampling  rate  results  in  drastic 
improvements  in  system  response.  Figure  XI-10  shows  a 
comparison  between  the  required  (theoretical)  azimuth  aiming 
correction  and  the  corrections  obtained  from  the  noisy  rate 
3ensor  measurements  (0.0275  second  sampling  interval).  The 
corresponding  difference  between  these  corrections  is  shown  in 
figure  XI-11.  The  difference  falls  within  the  +/-  4  milliradian 
error  band  within  a  mere  0.25  seconds.  Figures  XI-12  and  XI-13 
shows  the  corresponding  corrections  and  difference  in 
correction  in  the  elevation  plane.  Here  the  error  stayed  within 
the  required  error  band  for  the  entire  time.  Finally,  as  seen 
in  figure  XI-14,  while  the  adaptive  filter  gains  reach  steady 
3tate  values  that  are  similar  to  those  found  previously  (i.e. 
figure  X-10  and  table  XI-II),  the  gains  are  significantly 
different  during  the  initial  few  seconds. 
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Figure  XI-10 

Theoretical  versus  Calculated  Azimuth  Corrections 
(0.0275  second  Sampling  Rate) 


Figure  X-ll 

Error  in  Azimuth  Aiming  Correction  (0.0275  second  Sampling  Rate) 
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Figure  XI-12 


Theoretical  versus  Calculated  Elevation  Corrections 
(0.0275  second  Sampling  Rate) 


1 

Figure  X-13 

Error  in  Elevation  Aiming  Correction  (0.0275  second  Sampling  Rate) 


CHAPTER  XII 

CONCLUSIONS  AND  RECOMMENDATIONS 

CONCLUSIONS 

The  current  BSTING  algorithms  can  be  modified  to  provide 
a  continuous  solution  so  that  the  gun  is  always  appropriately 
aimed  with  respect  to  the  target.  This  would  eliminate  the 
current  situation  where  the  aiming  correction  (difference 
between  gun  and  laser  line-of -sight  in  azimuth  and  elevation) 
is  fixed  and  thus  the  aiming  accuracy  degrades  as  the  platform 
moves  with  respect  to  the  target. 

A  correction  technigue  was  developed  that  compensates  for 
the  fact  that  azimuth  and  elevation  rates  are  measured  along 
the  gun  boresight  rather  than  the  required  laser  beam  axis. 
Subsequent  analysis  with  "noisy"  data  simulating  an  actual 
gunner  aiming  from  a  moving  platform  showed  that  measurements 
made  along  the  gun  boresight  and  laser  beam  axis  provide  aiming 
corrections  of  essentially  equal  accuracy. 
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Furthermore,  there  was  essentially  no  change  in  accuracy 
if  the  gun  and  laser  were  locked  together  for  an  initial 
computation  period  (i.e.  2  seconds)  and  then  released  to  allow 
a  continuous  tracking  and  aiming  capability. 

The  current  BSTING  algorithm  for  elevation  correction 
due  to  projectile  gravity  drop  does  not  account  for  the 
elevation  angle  and  uses  the  "small  angle"  approximation. 

An  analysis  shoved  that  if  the  platform  is  over  1000  feet  above 
the  target  (and  a  1  milliradian  error  is  tolerable  from  gravity 
drop)  then  the  elevation  angle  should  be  measured  and  included 
in  the  computation  <  f  the  gravity  drop  correction.  The  small 
angle  approximation  should  also  not  be  used. 

An  alternate  fire  control  system  was  designed  that  used 
external  measurement  of  parameters  in  lieu  of  (or  in  additior 
to)  sensors  mounted  on  the  gun  only  as  used  in  the  current 
BSTING  with  its  gun  mounted  rate  and  range  sensors.  In  this 
system,  the  actual  azimuth  and  elevation  angles  between  the  gun 
and  platform  were  measured  by  resolvers.  These  angles  were 
corrected  by  the  previous  aiming  correction  before  being  used 
to  calculate  the  next  aiming  correction.  This  technique 
very  closely  approximates  the  use  of  the  more  accurate  LOS 
azimuth  and  elevation  angles  in  calculating  aiming  corrections. 
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While  this  fire  control  system  greatly  reduces  the  sensitivity 
to  azimuth  and  elevation  errors  due  to  gunner  pointing 
difficulties,  the  system  requires  accurate  measurement  of  the 
aircraft  velocity.  Also  the  system  is  more  complex  than  the 
BSTING  system  and  requires  the  use  of  an  inertial  platform  that 
brings  added  cost  and  its  own  error  contributions. 

Using  constantly  updated  Kalman  filter  gains  (adaptive 
Kalman  filtering)  resulted  in  a  modest  improvement  in  system 
response  (i.e.  time  before  aiming  correction  errors  are  within 
+/-  4  milliradians  error  band).  System  response  can  be 
substantially  improved  by  increasing  the  data  sampling  rate  for 
the  sensor  measurements,  especially  in  the  azimuth  plane.  For 
example,  when  the  azimuth  rate  measurements  were  simulated  at 
0.0275  second  intervals  rather  than  the  0.22  second  sampling 
rate  used  in  the  current  BSTING  system,  the  aiming  error  was 
within  the  tolerance  band  in  a  mere  0.25  seconds. 

RECOMMENDATIONS 

While  the  BSTING  system  can  theoretically  be  modified  to 
provide  continous  solutions,  an  investigation  should  be  done  to 
see  the  "hardware'*  effects  of  actually  implementing  a  contin¬ 
uous  solution  capability.  For  example,  the  dynamics  of  the 
servos  U3ed  to  move  the  laser  beam  with  respect  to  the  gun  may 
result  in  an  additional  aiming  error  or  a  decreased  response. 
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The  increased  sampling  rate  investigates  used  noisy  data 
simulated  simply  by  taking  the  BSTING  data  taken  at  0.22  second 
intervals  and  reducing  the  intervals  reduced  by  factors  of  two, 
four,  eight  and  ten.  A  more  rigorous  analysis  should  use  data 
from  actual  sensors  that  make  measurements  at  the  faster  rates. 

Another  idea  that  might  be  investigated  is  to  "subtract" 
out  gunner  induced  noise  by  using  both  angular  resolvers  and 
rate  sensors.  This  ansver  vould  be  used  to  determine  platform 
velocity  that  could  be  used  in  the  correction  algorithms.  For  a 
platform  moving  with  respect  to  a  stationary  target: 


V 

a/c 


AZ  =  dAZ/dt 

platform 

If  we  know  AZ  we  can  easily  compute  the  platform  velocity 

platform 

i.e.  V  =  AZ  x  RSL  x  Cos  (Elevation  Angle) 

a/c  platform 


J 
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Nov 


AZ  =  AZ  +  AZ 

total  platform  gun/platform 


AZ  =  AZ  -  AZ 

platform  total  gun/platform 


AZ  is  the  total  azimuth  rate  as  measured  by  rate 

total  sensors  mounted  on  the  gun  (e.g.  current  BSTING 
type  rate  sensors) 

AZ  is  the  azimuth  rate  of  the  gun  with 

gun/platform  respect  to  the  platform.  This  could 

be  measured  by  an  angular  resolver, 
(either  as  angle  divided  by  the  time 
increment  or  directly  as  a  rate) 

Further  investigation  is  needed  to  see  if  such  a  technique 

will  give  more  accurate  results  using  measurements  from  actual 

sensors  that  could  be  used  in  a  low  cost,  gun  mounted  fire 

control  system. 
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COMPUTER  PROGRAM:  Laserpr.m 


PURPOSE: 


RESULTS : 


Generates  theoretical  range  and  azimuth/elevation 
rate  data.  Adds  generic  noise  profiles  to 
theoretical  data.  Computes  the  theoretical  azimuth 
and  elevation  aiming  corrections  that  vill  be  used 
for  comparisions  of  similar  corrections  based  on 
simulated  noisy  measurement  to  establish  the 
performance  of  the  Kalman  filter  design. 

Simulated  "noisy"  range  and  azimuth/elevation  range 
measurements  used  in  Kalman. m.  Also  provides 
required  theoretical  aiming  corrections  used  to 
evaluate  system  performance. 
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a 


kalman[  ]  :=  Block [ 

(*  objective:  This  program  simulates  the  discrete  kalman  filter 
designed  for  the  3STING  system.  The  scenario  is  a  constant  velocity 
fly  by  at  ISO  knots  at  an  altitude  of  250  ft.,  with  a  minimum  range 
of  1000  mtr.(3280  ft.).  The  simulated  range,  azimuth  rate,  and 
elevation  rate  data  are  contained  in  scen5.dat.  *) 

{  eltheo,  aztheo,  pll,  pl2,  p21,  p22,  pa,  pe,  pbll,  pbl2, 
pb21,  pb22,  pba,  pbe,  kpr,  kvr,  kpa,  kpe,  dtr,  dtae,  rr, 
ra,  re,  qr,  ga,  qe,  cl,  c2,  c3,  terml,  term2,  term3, 
rhat,  rhatd,  azdhat,  eldhat,  omega2,  denom},  { 

azelupdate[  ]  :  =  Block [ 

( *  This  procedure  calculates  the  azimuth  and  elevation 
corrections,  due  to  platform  velocity  and  gravity.  The 
calculations  also  update  arrays  containing  the  azimuth  and 
elevation  rates  and  azimuth  and  elevation  corrections. 

*) 


{  elcorr,  azcorr,  vp,  wcda,  beta,  rho,  c,  tof,  gravdrop 

},  { 

elcorr  =»  -ArcTan  [  (rhat *eldhat/ 2840)  /Sqrt  [  1  - 
(rhat*eldhat/2840)  /'2]  ]  ; 
azc  =  rhat*azdhat/2840; 

azcorr  -  -ArcTan [ (rhat *azdhat/2 840) /Sqrt [ 1  - 
(rhat*azdhat/2840) ~2] ] ; 
vp  »  rhatd  +  2840*Cos  [azcorr  ]  *Cos  [elcorr  ] ; 
wcda  *  .  000005*rhat/'2  -  .036938*rhat  +  700.429; 
beta  a  3 .  *10~-09*rhat A2  -  4 . 7*10~-05*rhat  +  1.0216; 
rho  =  .076479; 
c  -  1 .5*rho/wcda; 
tof  =  rhat/vp/  (1- (c*rhat/4)  )  ''2; 
gravdrop  =  16 . 085*  (beta*tof )  ,'2/rhat; 
elcorr  a  elcorr  +  gravdrop; 

azcorrdata  =  AppendTo [azcorrdata, (time,  azcorr*1000 } ] ; 
elcorrdata  =  AppendTo [elcor rdat a, {time,  elcorr*l000 } ] 

}]  ; 

( *  The  next  seven  procedures  generate  plots  of  various  parameters 
of  interest  *) 

plotters (  11_,  12_,  pl_,  xl_,  yl_,  pr_]  :=  Block [ 

(*  This  procedure  plots  the  azimuth  rate  estimate  data  and 
the  theoretical  azimuth  rate  data  together  on  one  plot.  *) 

{stylel,  style2,  plotl,  plot2  },  { 

stylel  =  ( Thickness [0 . 002 ] ,  PointSize ( 0 . 002 ] } ; 

sty  le2  =  Casnmgl  {  .01,  .01}  ]  ,  Thickness  i  0 . 002  ]  , 


1 
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Point Size [0.002] } ; 

plot!  :=  ListPlot [  11,  PlotJoined  ->  True,  PlotStyle  -> 
sty lei ] ; 

plct2  :=  ListPlot [  12,  PlotJoined  ->  True,  PlotStyle  -> 
styie2 ] ; 

grapha  =  Show[ {plotl, plot2, 

Graphics [  {Text ["azdhat” ,  { 11 [ [ 350 , 1 ] ] , 

11  [  [350, 2] ] } ] ,  Text ("theo .  azd",  {12[[?0,1]], 

12  [[70,2]]}]  }  ]},  PlotLabel  ->  pi, 

AxesLabel  ->  [xl,yl},  PlotRange  ->  Dr  ] 

}]; 

plottere[  11_,  12_,  pl_,  xl_,  yl_,  pr_]  :«  Block [ 

(*  This  procedure  plots  the  elevation  rate  estimate  data  and 
the  theoretical  elevation  rate  data  together  on  one  plot. 

*> 


{ stylel, 
styiel  = 
style2  ■ 

plotl 

plot2  := 

graphe  =* 


}]  ; 


style2,  plotl,  plot2  },  { 

{Thickness [0.002],  Point Size [0.002]  } ; 

{Dashing [{.01, .01}],  Thickness [0.002] , 

Point Size [0.002] }  ; 

ListPlot (  11,  PlotJoined  ->  True,  PlotStyle  -> 
stylel] ; 

ListPlot [  12,  PlotJoined  ->  True,  PlotStyle  -> 
Style2] ; 

Shew [ (plotl, plot2, 

Graphics [  {  Text [ "eldhat ” ,  {11  [  [350,  1] ] , 

11  [  [350,2] ]}] ,  Text [ "theo .  eld",  {12[[70,1]], 

12  [[70,2]]}]  }  ]},  PlotLabel  ->  pi, 

AxesLabel  ->  {xl,yl},  PlotRange  ->  pr  ] 


plotterr[  11  ,  12_,  pi  ,  xl  ,  yl  ,  pr_]  :=  Block [ 


(*  This  procedure  plots  the  range  estimate  data  and  the 
theoretical  range  data  together  on  one  plot.  *) 


{stylel, 
stylel  =» 
style2  = 

plotl 

plot2  := 

graphr  = 


style2,  plotl,  plot2  },  { 

{Thickness [0.002],  Point Size [0.002]  }  ; 

{ Dashing [ { .01, .01} ] ,  Thickness [0.002], 

Points ize [0.002] } ; 

ListPlot [  11,  PlotJoined  ->  True,  PlotStyle  -> 
stylel ] ; 

ListPlot (  12,  PlotJoined  ->  True,  PlotStyle  -> 
style2 ] ; 

Show( {plot!, plot 2, 

Graphics  [  {  Text  [  ’’rhat”,  {11  [[40,1]], 

11[  [40,2]  ]  }],  Text  [  "theo  .  mg",  { 12  [  [70, 1]  ] , 
12  [  [70, 2]  ]  }  ]  ]•  ]],  PlotLabel  ->  pi, 

AxesLabel  ->  <xi,yl},  PlotRange  ->  pr  ] 
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piotterdf  li_,  12_,  pl_,  xl_,  yl_,  pr_]  :=  31ock[ 

C*  This  procedure  plots  the  range  rate  estimate  data  and  the 
theoretical  range  rate  data  together  on  one  plot.  *) 

{stylel,  style2,  plotl,  plot2  },  { 
stylel  =  {Thickness [0 .002] ,  PointSize (0 . 002] } ; 
style2  -»  { Dashing  [{.  01 01 }] ,  Thickness  [0 . 002  ]  , 
PointSize [0 . 002] } ; 

plotl  :=  ListPlotC  11,  PlotJoined  ->  True,  PlotStyle  -> 
stylel] ; 

plot2  :=  ListPlotC  12,  PlotJoined  ->  True,  PlotStyle  -> 
style2] ; 

graphrd  =*  Show[ {plotl, plot2, 

Graphics [  {  Text ["rhatd" ,  {11 [[40,1]], 
lit  [40,2] ] } ] ,  Text ["theo .  rd",  {12[[70,1]], 

12  [[70,2]]}]  }  ]},  PlotLabel  ->  pi, 

AxesLabel  ->  {xl,yl},  PlotRange  ->  pr  ] 


piotterk[  11_,  12_,  13_,  14_,  pl_,  xl_,  yl_,  pr_]  :=  31ock[ 

(*  This  procedure  overlays  the  range (kpr),  range  rate(kvr), 
azimuth  rate(kpa),  and  elevation  rate(kpe)  filter  gain  data 

*) 


{stylel,  style2,  style3,  style4,  plotl,  plot2,  plot3, 
plot 4 } ,  { 

stylel  *»  {Thickness  [0 . 002] ,  PointSize [0 . 00?] } ; 
style2  =  {DashingC { .01, .01} ] ,  Thickness [0 . 002] , 
PointSize [0.002] } ; 

style3  =  {DashingC { .05, .05} ] ,  Thickness [0 . 002] , 
PointSize [0 . 002 ] }/ 

style4  =  {Dashing [{.005,  .01,  .01,  .01}],  Thickness  [0.002], 
PointSize [0 . 002]  }  ; 


plotl 

ListPlotC 
stylel ] ; 

11, 

PlotJoined  -> 

True, 

PlotStyle  -> 

plot2 

: =  ListPlotC 
style2] ; 

12, 

PlotJoined  -> 

True, 

PlotStyle  -> 

plot3 

: =  ListPlotC 
styie3] ; 

13, 

PlotJoined  -> 

True, 

PlotStyle  -> 

plot4 

:=  ListPlotC 

14, 

PlotJoined  -> 

True, 

PlotStyle  -> 

style4] ; 

graphk  =  Show( {plot!,  plot2, plot3, plot4, 

Graphics  [  {  Text[”kpr’\  {11[  [40,  1]  ]  , 

11  [  [40,2]  ]  }  ]  ,  Text  [ ’’kvr*' ,  {12  [[60,1]], 

12  [  [60,2] ]  } ] ,  Text ["kpa",  { 13 [ [ 600,  1  ] ] , 

13  [  [600, 2] ] } ] ,  Text ["kpe”,  { 14 [ [ 800 , 1 ] ] , 

14  {  [800, 2] ] } ]  }  ]},  PlotLabel  ->  pi, 
AxesLabel  ->  {xl,yl},  PlotRange  ->  pr  ] 
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plotterca(  11_,  12_,  ?1_,  xl_,  yl_,  pr_]  :=  Block [ 

(*  The  azimuth  correction  based  on  estimated  data  and  the 
theoretical  azimuth  correction  are  plotted  cn  the  same 
graph.  *) 

{stylel,  style2,  plotl,  plot2  },  { 

stylel  =  {Thickness [0 . 002] ,  PointSize [ 0 . 002 ]} ; 

style2  =■  {Dashing [  { .01,  .01}  ] ,  Thickness  [0 . 002]  , 
PointSizeCQ .002] } ; 

plotl  : =  ListPlot{  11/  PlotJoined  ->  True,  PiotStyle  -> 
stylel] ; 

plot2  :=  ListPlotC  12,  PlotJoined  ->  True,  PiotStyle  -> 
style2] ; 

graphca  =*  Showf  {plotl,  plot2, 

Graphics [  {  Text [ "azcorr" ,  { 11 { [350, 1] ]  , 

11  [  [350, 2] ] } ] ,  Text[”theo.  azcorr", 

{ 12 [ [560 , 1 ] ] ,  12 [ [560, 2] ] } ]  }  ]}, 

PlotLabel  ->  pi,  AxesLabel  ->  {xl,yl}, 
PlotRange  ->  pr  ] 

}]; 

plotterce  [  11_,  12_,  ?1_,  xl_,  yl_,  pr_]  :=  31ock[ 

(*  The  elevation  correction  based  on  estimated  data  and  the 
theoretical  elevation  correction  are  plotted  on  the  same 
graph .  * ) 

{stylel,  style2,  plotl,  plot2  },  { 

stylel  -  {Thickness [0 . 002] ,  PointSize [0 . 002 ]} ; 

style2  =  {DashingC { . 01,  . 01 }  ] ,  Thickness [0 . 002] , 
PointSize [0 . 002] }; 

plotl  :=  ListPlot[  11,  PlotJoined  ->  True,  PiotStyle  -> 
stylel ] ; 

plot2  :=  ListPlot[  12,  PlotJoined  ->  True,  PiotStyle  -> 
style2] ; 

graphce  =  Show[ {plotl, plot2, 

Graphics [  {  Text [ "elcorr" ,  { 11  [  [350 ,  1 ]  ]  , 

11  [  [350, 2] ] } ] ,  Text["theo.  elcorr", 

{ 12  [  [560,  1] ],  12  [  [560,2] ]}  ]  }  ]}, 

PlotLabel  ->  pi,  AxesLabel  ->  (xl,yl), 
PlotRange  ->  pr  ] 

}]; 


program  begins  here . 

data  arrays.  *) 


(*  The  main  body  of  the 

Define  the  necessary 

rhatdata  =*  {}; 
rhatddata  =  {}; 
azdhatdata  =  { } ; 
eldhatdata  =  { }  ; 


(*  range  estimate 
(*  range  rate  estimate 
(r  azimuth  rate  estimate 
(*  elevation  rate  estimate 


*) 
*) 
*) 
+  ) 
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kprdata  - 

{  }  ; 

(* 

range  filter  gain 

*) 

kvrdata  = 

{ } ; 

(* 

range  rate  filter  gain 

*) 

kpadata  = 

{ }; 

(* 

azimuth  rate  filter  gain 

*) 

kpedata  * 

{ }; 

(* 

elevation  rate  filter  gain 

*) 

azcorrdata 

= 

{ }; 

(* 

azimuth  correction 

*) 

eicorrdata 

{ }; 

(* 

elevation  correction 

*) 

(*  Open  and  read  into  an  array  the  simulated  noisy  range,  azimuth 
rate,  and  elevation  rate  measurements.  *) 

data  »  <<scen5.dat; 

(*  initialize  state  covariance  matricies.  The  noise  is  assumed  to 
be  gaussian  so  the  range  states  noises  are  uncorrelated.  Thus 
pl2  and  p21  are  zero.  *) 

pl2  =  0; 
p21  =  0; 

Print ["Enter  the  initial  state  covariances."]; 
pll  =  Input [ "pi 1  -  ?  "]; 
p22  =  Input ["?22  =  ?  ; 

pa  »  Input  [  "pa  *  ?  »  ]  ; 
pe  =  Input  [  "pe  =  ?  ••  j  • 

Print [ ] ; 

(*  user  initializes  measurement  error  covariances.  *) 

Print ["Enter  the  measurement  error  covariances."]; 
rr  a  input  ["rr  a  ?  "]; 
ra  =  input ["ra  =  ?  "]; 
re  »  input ["re  =  ?  "]; 

Print [ ]  ; 

(*  user  initializes  plant  error  covariances.  *) 

Print [ "Enter  the  plant  error  covariances."]; 
qr  =  Input ["qr  =  ?  "]; 
qa  =  input  [”qa  =  ?  •»]; 
qe  =  input ["qe  =  ?  "]; 

(*  initialize  remaining  variables  *) 

time  =  0; 

dtr  =  .22;  (*  dtr  is  the  range  sampling  rate  in  seconds.*) 

dtae  »  .0275;  (*  dtae  is  the  sampling  rate  of  az .  and  el. 

rate  sensors.  *) 
rhat  =  data [[1,4]]; 
rhatd  *  0; 
azdhat  =  0  ; 
eldhat  =  0; 

(*  the  simulation  evolves  at  two  different  rates.  The  range 
filter  has  a  sampling  rate  of  .22s,  while  the  azimuth  rate 
and  elevation  rate  filters  have  sampling  rates  of  .0275s. 

*) 
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Do[  { 


time  =  data ( [ i, 1 ] ] ; 

(*  update  az.  and  el.  rate  a  priori  error  covariances  *) 
oca  =  (1  -  2*dtae*rhatd/rhat)  A2*pa  +  dtae-'2"rqa; 
pbe  =  (1  -  2*dtae*rhatd/rhat )  ,'2*pe  +  dtae^2"*qe; 

(*  update  az .  and  el.  rate  filter  gains  *) 

:<pa  =  pba/  (pba  +  ra  )  ; 
kpe  =  pbe/  (pbe  +  re  )  ; 

(*  update  az .  and  el.  rate  a  posteriori  error  covariances  *) 
pa  =  (1  -  kpa) *pba; 
pe  *  (1  -  kpe) *pbe; 

(*  calculate  current  az .  and  el.  rate  estimates  * ) 
terml  =  1  -  kpe; 
term2  =  1  -  kpa; 

eldhat  =  terml* (1  -  2*rhatd*dtae/rhat) *eldhat  + 
kpe*data [ [i, 3] ] /1000; 

azdhat  =  term2*(l  -  2*rhatd*dtae/rhat ) *azdhat  + 
kpa*data [ [i, 2] ] /1000; 

(*  calculate  the  range  estimate  every  eighth  time  step  *) 
If t  Mod [ ( i — 1 ) ,  8]  «  0, 

omega2  =  azdhat^2  +  eldhat*2; 


(*  update  range  a  priori  error 
pbll  =  pll  +  dtr* (pl2  + 
pbl2  =»  pl2  +  dtr*  (p22  + 
pb21  =  p21  +  dtr* (p22  + 
pb22  =  p22  +  qr*dtr/'2  + 
dtr*omega2*pll) ; 


covariances 
p21  +  p22*dtr) ; 

(pll  +  dtr*p21) *omega2)  ; 
(pll  +  dtr*pl2) *omega2)  ; 
omega2*dtr* (p21  +  p!2  + 


*) 


(*  update  range  filter  gains  *) 

denom  *  pbll  +  rr; 
kpr  =  pbll /denom; 
kvr  =  pb21/denom; 


(*  update  range  a  posteriori  error  covariances  *) 
pll  =  pbll *rr/denom; 
pl2  =  pbl2*rr/denom; 
p21  =  pb21*rr/denom; 
p22  -  pb22  -  pb21 *pbl2/denom; 

(*  calculate  current  range  and  range  rate  estimates  *) 
cl  =  1  -  kpr; 
c2  =  dtr*cl; 
c3  =  1  -  kvr * dtr; 

rhat  =  cl*rhat  +  c2*rhatd  +  kpr*data [ [ i,  4  ]  ]  ; 
term3  =  omega2*dtr; 

rhatd  =  rhat*(term3  -  kvr)  rhatd*c3  - 
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data[  [i,  4]  ]  *’<vr; 

( *  add  current  range  estimates,  range  rate  estimates,  and  filter 
gams  to  time  history  arrays  *) 

kprdata  =  AppendTo [  kprdata,  { time, kpr } ] ; 
kvrdata  =  AppendTo [  kvrdata,  (time,  kvr }  ]  ; 
rhatdata  =  AppendTo (  rhatdata,  { time, rhat } } ; 
rhatddata  =  AppendTo [  rhatddata,  { time, rhatd} ]  ]; 

(*  add  current  az .  and  el.  rate  estimates  and  filter  gains  to 
time  history  arrays  *) 

azdhatdata  =  AppendTo [  azdhatdata,  { time, azdhat*1000 } ]  ; 
eldhatdata  =  AppendTo [  eldhatdata,  { time, eldhat'lOOO }  ]  ; 
kpadata  =  AppendTo (  kpadata,  (time, kpa} ] ; 
kpeaata  =*  AppendTo [  kpedata,  {time,kpe}j; 

(*  calculate  corrected  azimuth  and  elevation  angles  *) 
azelupdate ( ] } , 

(i,  l,  Length [data] }] ; 

(*  generate  plots  of  time  histories 

These  arrays  were  generated  by  the  program  laserpr.m.  They 
contain  theoretical  values.  *) 


aztheo  »  <<aztheo  .dat ;  (*  theo  .  azimuth  rate  *) 
eltheo  =»  <<eltheo .dat;  {*  theo.  elevation  rate  *) 
rngtheo  ■  <<rngtheo . dat ;  {*  theo.  range  *) 
rngdtheo  =  «rngdtheo.dat;  (*  theo.  range  rate  *) 


azcrtheo  =  <<azc5.dat;  (*  theo.  azimuth  correction  *) 

elcrtheo  ■  <<elcS.dat;  (*  theo.  elevation  correction  *) 

azcorrdata  =  Rest [azcorrdata] ; 
elcorrdata  =  Rest (elcorrdataj ; 

azcrdiff  ■  Tablet  (azcorrdata[ [i,  1] ]  ,  azcorrdata [ [i,  2]  ]  - 
azcrtheo [ [i, 2] ] } ,  { i,  1, Length [azcorrdata] } j  ; 
elcrdiff  =  Tablet  (elcorrdata [ [i,  1] ] ,  elcorrdata ( [i,  2]  ]  - 
elcrtheo  t(i,2]]},  (i,l, Length [elcorrdata] } ] ; 

plotterca [azcorrdata, azcrtheo,  ’’azcorr  &  theo.  azcorr  for 
scenario:”,  "t(sec)",  "mrad”,  {50,100}]; 
plotterda  =  ListPlot [azcrdiff ,  PlotJoined  ->  True, 
PlotRange  ->  {-5,5},  AxesLabel  -> 

{ "t  (sec) ”, "mrad" } ,  PlotLabel  ->  "difference 
between  azcorr  and  theo.  azcorr"], 
plotterce [elcorrdata, elcrtheo,  "elcorr  &  theo  elcorr  for 
scenario:",  ”t(sec)",  "mraa/s",  {10,20}]; 
plotterde  =  ListPlot (elcrdiff ,  PlotJoined  ->  True, 
PlotRange  ->  {-5,5},  AxesLabel  -> 

{ "t  (sec) ", "mrad" } ,  PlotLabel  ->  "difference 
between  elcorr  and  theo.  elcorr"], 
piottera [azdhatdata, aztheo,  "azdhat  &  theo.  az .  rate  for 
scenario:",  "t (sec) ",  "mrad/s",  {-30,-20}]; 
plotters [eldhatdata, eltheo,  "eldhat  &  theo  el.  rate  for 
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scenario:",  "t(sec)",  ’,mraci/s,•,  {-12,4}]; 
plotterr  [rhatdata,  rngtheo>  "rcihat  &  cheo  range  for 

scenario:",  "t(sec)",  "feet",  {3000,5000}]; 
plotterd [ rhatadata, rngdtheo ,  "rhatd  &  theo  range  rate  for 
scenario:",  "t(sec)",  ”ft/s",  {-200,200}]; 
plotterk [kprdata, kvrdata, kpadata, kpedata, 

"filter  gains  for  scenario:",  "t(sec)",  "  ", 

{0,1}]; 

}] 
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aserprC  ]  :=  Block'  { 

vac,  ratlin,  h,  rsl,  elevation,  rsldot,  azdot, 
eldot,  elcorr,  velpro:,  wcda,  beta,  rho,  c, 
tof,  azcorr,  azdot,  rngnoise,  elnoise,  noise 

},  { 

vac  =  1 . 68 8 9* Input ( "Aircraft  Velocity (knots )  =  "]; 

rmin  *»  3 . 28* Input  [ "Minimum  Range  (meters )  =  ”]; 

h  =  Input [ "Altitude (feet)  =  "]; 

azimuth  =0; 

grav  =  32.17; 

time  =  0.0; 

i  3  1; 

ell  ■  elevation; 
eldotdata  =  { } ; 
azdotdata  =»  {  } ; 
rngdata  =»  {}; 
rngddata  =  { }  ; 
elcordata  =*  { }  ; 
azcordata  =»  {  }  ; 

While [azimuth  >  -.75049, 

{rbase  =*  rmin  -  vac*time; 

rsl  =  Sqrt  [hA2  +  rmin A 2  +  rbaseA2]; 

azimuth  =  ArcTan  [rbase /rmin] ; 

elevation  =  ArcTan [h/Sqrt [rminA2  +  rbaseA2]]; 

rsldot  »  -vac* rbase/ rsl; 

azdot  »  -vac*rmin/rslA2; 

eldot  -  (elevation  -  ell)/. 22; 

ell  »  elevation; 

elcorr  =  rsl*eldot/2840 ; 

elcorr  »  -ArcTan [elcorr /Sqrt [1  -  elcorrA2]]; 

azcorr  =  rsl*azdot/ (2840*Cos [elcorr ]) ; 

azcorr  =  -ArcTan (azcorr /Sqrt [ 1  -  azcorrA2]]; 

velproj  -  rsldot  +  2840*Cos [elcorr] *Cos [azcorr ] ; 

wcda  =  .  000005*rslA2  -  .036938*rsl  +  700.4299; 

beta  »  3 .  *10A-09*rslA2  -  4 . 7*10A-05*rsl  +  1.0216; 

rho  =*  .076475; 

c  =  1 . 5  * rho /wcda; 

tof  =  rsl/velpro j/ (l-c*rsl/4) A2; 

elgravdrop  =  (grav/2) * ( (beta*tof) A2) /rsl; 

elcorr  =  elcorr  +•  elgravdrop; 

If[  i  =*=*  1  ||  Mod[  <  i  —  1 )  ,  8]  ==»  0,  range  =  rsl]; 
azdotdata  =  AppendTo (azdotdata, {time,  azdot*1000 } ] ; 
eldotdata  =  AppendTo [eldotdata, {time,  eldot*1000 } ] ; 
rngdata  =  AppendTo [rngdata, {time,  range}]; 
rngddata  =  AppendTo [rngddata, {time, rsldot }] ; 
azcordata  =  AppendTo [azcordata, {time,  azcorr*l000 } ] ; 
elcordata  =  AppendTo (elcordata, { time,  elcorr*!000 } ] ; 
Print  [  i,"  ",  time,  "  ",  rsl,  "  ",  range]; 

i  1; 

time  t=  .  0275  }  ] ; 

Rest (azcordata] ; 
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Rest [eiccrdata] ; 

elnoise  =  ccnosel4.dat; 

aznoise  =*  <<nosaz4  .dat; 

rngnoise  =  <<nosrng4 .cat; 

nazdotdata  =  Table [  { azdotdata ( [ i,  1  ]  ]  ,  azdotdata [  [  i , 2 ] ]  * 

aznoise [[1,2]]},  { i.  Length [azdotdata] } ] ; 

neldotdata  =  Tablet  [eldotdata ( [i,  1  ]  ] ,  eldotdata [ [ i, 2 ] ]  + 

elnoise [ [i, 2] ] } ,  { i.  Length [eldotdata] } ] ; 

nrngdata  =  Tablet  [rngdata[ [i, 1] ] ,  rngdata [ [i, 2] ]  + 

rngnoise [ [i,  2] j } ,  { i, Length [rngdata] } ] ; 

Print [ "Theoretical  data  for  azimuth  rate,  elevation  rate, 
azimuth" ] ; 

Print [ "correction,  elevation  correction,  and  range  are  in  the 
arrays”]; 

Print [ "azdotdata,  eldotdata,  azcordata,  elcordata,  and  rngdata, 
having"]; 

Print ["the  dimensions  milliradians ,  milliraadians  per  second, 
and  feet .  "] ; 

Print ["  "]; 

Print [ "Noise  corrupted  data  for  azimuth  rate,  elevation  rate, 
and  range"]; 

Print ["are  in  the  arrarys  nazdotdata,  neldotdata,  nrngdata."]; 

}  ] 
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3  [  ]  :=  Block [  {  },  { 


azeiupdatef  ]  :=  Block [  {  },  { 

(*  this  procedure  calculates  the  azimuth  and  elevation 
corrections.  It  aiso  updates  arrays  containing  the 
azimuth  and  elevation  rates  and  azimuth  and  elevation 
corrections.  *) 

elc  =*  rhat*eidhat/2840; 

elcorr  =  -ArcTan [  elc/SqrtC  1  -  elc^  ]  ]; 
azc  =  rhat*azdhat/2840; 

azcorr  =  -ArcTan [  azc/Sqrt[  1  -  azc~2  ]  ]; 

vp  =  rhatd  +  2840*Cos (azcorr ] *Cos [elcorr ] ; 

wcda  -  .  000005  *rhatA 2  -  . 036938 *rhat  +  700.429; 

beta  »  3.*10''-09* rhat * 2  -  4 . 7*10A-05*rhat  +  1.0216; 

rho  =  .076479; 

c  =*  1 . 5  *rho/wcda; 

tof  »  rhat/vp/ (l- (c*rhat/4) ) A2; 

gravdrop  =  16 . 085* (beta*tof ) A2/rhat ; 

elcorr  =»  elcorr  +  gravdrop; 

azcorrdata  =  AppendTo [azcorrdata, {time,  azcorr*1000 } ] ; 
elcorrdata  =  AppendTo [elcorrdat a, {time,  elcorr*1000 } ] 


plotters [  11_,  12_,  pl_,  xl_,  yl_,  pr_J  :=  Block [  { 
stylel,  style2,  plotl,  plot2  },  { 

stylel  =*  {Thickness [0 . 002] ,  PointSize [0 . 002] } ; 
style2  =  {Dashing [{. 01,  . 01 }] ,  Thickness [0 . 002] , 
PointSize (0 . 002] }; 

plotl  :=  ListPlotf  11,  PlotJoined  ->  True,  PlotStyle  -> 
stylel] ; 

plot2  :=  ListPlot[  12,  PlotJoined  ->  True,  PlotStyle  -> 
style2] ; 

grapha  =*  Show [ {plotl, plot2,  Graphics [  {  Text [ "azdhat", 
{11  [  [50, 1] ] ,  ll[  [50,2] ] >],  Text [”theo .  azd", 

{ 12 [  [60,1] ] ,  12 [ [ 60, 2] ]  }  ]  }  ] },  PlotLabei  ->  pi, 
AxesLabel  ->  {xl,  yl } ,  PlotRange  ->  pr  ] 

}]; 

plottereC  11_,  12_,  pl_,  xl_,  yl_,  pr_]  :=  Block [  { 
stylel,  style2,  plotl,  plot2  },  ( 
stylel  =  {Thickness [0 .002] ,  PointSize  [  0 . 002 ]} ; 
sty  le2  =•  {Dashing[{  .01,  .01}  ] ,  Thickness  ( 0 . 002  ]  , 
PointSize [0 . 002 ] } ; 

plotl  :=  ListPlot[  11,  PlotJoined  ->  True,  PlotStyle  -> 
stylel]; 

plot2  :=  ListPlct[  12,  PlotJoined  ->  True,  PlotStyle  -> 
style2 ] ; 

graphe  =  Show[ {plotl,  olot2,  Graphics {  {  Text [  "eldhat", 
{11  [[50,1]],  11 [ [50,2]  ]  }  ]  ,  Text [ "theo .  eld”, 

{ 12 [ [ 60 , 1 ] ] ,  12 [ [60,2] ]  }  ]  }  ]}, 

PlotLabei  ->  pi,  AxesLabel  ->  {xl,yi}. 
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PlotRange  ->  pr  ] 


}]; 

?lotterr[  11_,  12__,  pl_,  xl_,  yl_,  pr_J  :=  Block [  { 
stylel,  style2,  plotl,  plot2  },  { 

stylel  =  {Thickness  [0 . 0*02]  ,  PointSize [ 0 . 002 ]}  ; 
style2  =  {Dashing [{. 01, . 01 }] ,  Thickness [0 . 002] , 
PointSize [0 .002]  }  ; 

plotl  :=  ListPlotC  11,  PlotJoined  ->  True,  PlotStyle  -> 
stylel ] ; 

plot2  :  =  ListPlot[  12,  PlotJoined  ->  True,  PlotStyle  -> 
style2] ; 

graphr  =  Show[  {plotl,  olot2,  Graphics [  {Text ["rhat" , 
{11  [[50,1]],  11  [[50, 2]  1H,  Text  [  "theo.  mg", 
{12 [[60,1] ] ,  12  [  [60, 2] ] } ]  }  ]}, 

PlotLabel  ->  pi,  AxesLabel  ->  {xl,yl}, 
PlotRange  ->  pr  ] 

}]  ; 

plotterd[  11_,  12_,  pl_,  xl_,  yl__,  pr_]  :  =  Block  [  { 
stylel,  style2,  plotl,  plot2  },  { 
stylel  »  {Thickness [0 . 002] ,  PointSize [0 . 002] } ; 
style2  =  {Dashing [{. 01, . 01 }] ,  Thickness [0 . 002] , 
PointSize [0.002] } / 

plotl  ListPlotC  11,  PlotJoined  ->  True,  PlotStyle  -> 
stylel] ; 

plot2  :=  ListPlotC  12,  PlotJoined  ->  True,  PlotStyle  -> 
style2] ; 

graphrd  »  Show[{plotl,  plot2,  Graphics]  {  Text ["rhatd", 
{ 11 [ [50, 1] ] ,  11 [ [50, 2 ] ]  }  ] ,  Text [ "theo .  rd", 
{12 [ [60, 1]  ]  ,  12  C [60, 2]  ]  }  ]  }  ]}, 

PlotLabel  ->  pi,  AxesLabel  ->  {xl,yl}, 
PlotRange  ->  pr  ] 

}]; 


plotterca[  11_,  12_,  pl_,  xl_,  yl_,  pr_]  :=  31ock[  { 
stylel,  style2,  plotl,  plot2  },  { 

stylel  =  {Thickness [0 . 002] ,  PointSize [ 0 . 002 ]} ; 
style2  =  {Dashing [{. 01,  . 01 }] ,  Thickness [ 0 . 002 ] , 
PointSize [0 .002]  }  ; 

plotl  :=  ListPlotC  11,  PlotJoined  ->  True,  PlotStyle  -> 
stylel ] ; 

plot2  :=  ListPlotC  12,  PlotJoined  ->  True,  PlotStyle  -> 
style2] ; 

graphca  =  Showi {plotl,  plot2,  Graohics  [  {  Text [ "azcorr" , 
{11  [  [50, 1] ] ,  11  [  [50, 2] ]  }  ] ,  Text [ "theo . 
azcorr”,  { 12 [ [ 60 , 1 ] ] , 12 [ [ 60 , 2] ] } ]  }  ]}, 
PlotLabel  ->  pi,  AxesLabel  ->  {xl,yl}, 
PlotRange  ->  pr  ] 


plctterce [  11  , 


?1_,  xl_,  yl_,  pr_]  :=  Block [  { 
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style!/  stvie2,  plot!,  plot2  },  { 
style!  =  {Thickness  CO .002] ,  PointSize ( 0 . 002 ] } ; 
style2  =•  {Dashing[  {  .01,  .01}  ]  /  Thickness  [0 . 002  ]  , 
PointSize [0.002] } ; 

plot!  :=  ListPlot [  11,  Plot Joined  ->  True,  PlotStyl 
stylel ] ; 

plot2  :=  ListPlot [  12,  PlotJoined  ->  True,  PlotStyl 
style2]; 

araphce  »  Show[ {plotl, plot2, 

Graphics  [  {  Text [ "elcorr",  {11  [[50,1]], 
lit [50,2] ]}]  ,  Text [”theo .  elcorr", 

{ 12 [ [ 60 , 1 ] ] ,  12 [ [ 60 , 2 ] ] } ]  }  ]}, 

PiotLabel  ->  pi,  AxesLahel  ->  {xl,yl}, 
PlotRange  ->  pr  ] 

}]; 

rhatdata  *  { }  ; 
rhatddata  =  { } ; 
azdhatdata  =»  {  } ; 
eldhatdata  *  {  } ; 
azeorrdata  =  { } ; 
elcorrdata  =  { } ; 
rngnoise  =*  <<rngnoise  .dat ; 
aznoise  =  <<aznoise.dat; 
elnoise  =  <<elnoise .dat; 

data  -  Tablet  {rngnoise [ [i, 1] ] ,  aznoise [ [i,  2] ] , 
elnoise [ [i,2] ] ,  rngnoise [ [i, 2] ] }, 

{ i , Length [ rngnoise ]  }  ] ; 
rngnoise  =*  .  ; 
aznoise  =  . ; 
elnoise  =  . ; 
kpa  =»  Input  ["KPA  -  ?”]; 
kpe  =*  Input  ["KPE  =  ?"]; 
kpr  -  Input  [”KPR  =»  ?” ] ; 
kvr  -  Input [”KVR  =  ?”]; 
dtr  =  .22; 

first  sample 

time  »  data [ [i, 1] ] 
gxhat  =  data [ [ i, 2 ] ] 
gyhat  =  data [ [i, 3] ] 
senrng  *  data[[i,4]J  *) 

time  =  data[ [1,1]]; 
rhat  =*  data  [  [  1 ,  4  ]  ]  ; 
azdhat  =  data [ [1, 2] ] /1000; 
eldhat  =  data [ [1, 3] ] /1000; 

azdhatdata  =  AppendTo (  azdhatdata,  {time, azdhat*1000 } ] ; 
eldhatdata  =  AppendTo [  eldhatdata,  { time, eldhat* 1000 } ] ; 
rhatdata  =  AppendTo [  rhatdata,  {time, rhat }]  ; 
rhatddata  =  AppendTo [  rhatddata,  { time , rhatd } ] ; 


azelupdate [  ]; 

(*  second  sample  *) 

time  =  data [[2,1]]; 

rhat  =  .5* (data [ [2, 4] ]  +  rhat) ; 

rhatd  =  (data [[2, 4]]  -  rhat) /dtr; 

azdhat  =  (azdhat  +  datat [2, 23 3 /1000) /2; 

eldhat  =■  (eldhat  data[  [2, 3]  3  /1000) /2; 

azdhatdata  =  AppendTo [  azdhatdata,  { time, azdhat *1000 }} ; 

eldhatdata  =  AppendTot  eldhatdata,  {time, eldhat*1000 } ] ; 

rhatdata  =  AppendTo [  rhatdata,  {time, rhat }] ; 

rhatddata  =  AppendTo [  rhatddata,  {time,rhatc 

azelupdate [  3 ; 

( *  third  through  fourth  samples  *) 

Do[{ 

time  *  data [ [i, 13  3 ; 

rhat  =  ((i-l)*rhat  +  data [ [i, 4] ] ) /i; 
rhatd  =  (data [[i, 413  “  data [ [ i— 1 , 4] ] ) /dtr; 
azdhat  =  { ( i— 1 ) *azdhat  +  data [ [i, 2 j ] /1000) /i; 
eldhat  =»  (  (i-1)  *eldhat  +  data ( [i, 3] ] /1000) /i; 
azdhatdata  =  AppendTo [  azdhatdata,  { time, azdhat*1000 } 3 ; 
eldhatdata  =>  AppendTot  eldhatdata,  {time, eldhat*1000 } j ; 
rhatdata  »  AppendTo [  rhatdata,  {time, rhat }] ; 
rhatddata  =  AppendTo [  rhatddata,  {time, rhatd} 3 ; 
azelupdate [  ] } , 

(i.  3,  4}]; 


Do[{ 

time  =  data [ [i, 1] 3 ; 
cl  =  1  -  kpr; 
c2  =*  dtr*cl; 
c3  =*  1  -  kvr*dtr; 
terml  =  1  -  kpe; 
term2  =»  1  -  kpa; 

rhat  -  cl*rhat  +  c2*rhatd  +  kpr*data [ [ i , 4 ] 3 ; 
term3  =  (eldhat*2  +  azdhat,'2)  *dtr; 

rhatd  -  rhat*(term3  -  kvr)  +  rhatd*c3  *  data  [  [  i ,  4 )  3  *kvr ; 
eldhat  =  terml* (1  -  2*rhatd*dtr/rhat) *eldhat  * 
kpe*data[ [1,33  J /1000; 

azdhat  =  term2* (1  -  2*rhatd*dtr/rhat ) *azdhat  + 
kpa*data{ [ i ,  2  3  3/1000; 

azdhatdata  =  AppendTo {  azdhatdata,  { time, azdhat*1000 } 3 ; 
eldhatdata  =  AppendTo [  eldhatdata,  { time, eldhat*l000 } j ; 
rhatdata  =  AppendTot  rhatdata,  {time, rhat }] ; 
rhatddata  =  AppendTot  rhatddata,  {time, rhatd} ] ; 
azelupdate {  ] } , 

{ i ,  5,  Length {data  3 } J; 
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aztheo  =  <<azcheo .dat; 
eltheo  =  <<eitheo .dat; 
rngtheo  =  << rngtheo . dat ; 
rngdtheo  =  ccrngdtheo . dat ; 
azcrtheo  =  <<azcrth.dat; 
elcrtheo  =  <<elcrth.dat; 
azcorrdata  =  Rest (azcorrdata]  ; 
elcorrdata  =  Rest [elcorrdata]  ; 

azcrdif  f  =  Table [ {azcorrdata [ [i, 1 ]], azcorrdata [ [i,  2] ] - 
azcrtheo ([1,2]]},  {1,1, Length [azcorrdata] } ]  ; 

elcrdiff  =  Tablet {elcorrdata [ [i, 1] ], elcorrdata [ [i,  2]  ]  - 
elcrtheo [[1,2]]},  { i, 1, Length [elcorrdata] } ]  ; 
plotterca [azcorrdata, azcrtheo, 

"azcorr  &  theo .  azcorr  for  scenario:", 

"t (sec) ", 

"mrad", 

[50, 100}] ; 

plotterda  =  ListPlot [azcrdif f,  PlotJoined  ->  True, 

Plot Style  ->  [Thickness [0.002], Point Size [0.002] } , 
PlotRange  ->  {-5,5}, 

AxesLabel  ->  { "t (sec) " , "mrad" } , 

PlotLabel  ->  "difference  between  azcorr  and  theo. 
azcorr" ] , 

plotterce [elcorrdata, elcrtheo, 

"elcorr  &  theo  elcorr  for  scenario:", 

"t (sec) ", 

"mrad/s", 

{10,20}] ; 

plotterde  =  ListPlot [elcrdiff ,  PlotJoined  ->  True, 

PlotStyle  ->  [Thickness [0 . 002] ,  PointSize [ 0 . 002] } , 
PlotRange  ->  {-5,5}, 

AxesLabel  ->  { "t (sec) ", "mrad" } , 

PlotLabel  ->  "difference  between  elcorr  and  theo. 
elcorr" ] , 

plotters [azdhatdata, aztheo, 

"azdhat  &  theo.  az .  rate  for  scenario:", 

"t  (sec)  ", 

"mrad/s", 

{-30,-30}] ; 

plottere (eldhatdata, eltheo, 

"eldhat  &  theo  ei .  rate  for  scenario:", 

"t (sec) ", 

"mrad/s", 

{-12,4}]  ; 

plotterr [ rhatdata,  rngtheo, 

"rdhat  4  theo  rar.ce  for  scenario:", 

”t (sec) ", 

"feet", 

{ 3000,5000}  ]  ; 

piotterd [ rhatddata, rngdtheo, 

"rhatd  4  theo  ranee  rate  for  scenario:", 

"t (sec) ", 
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-  f t  /  s  , 

{-200,200} ]  ; 
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COMPUTER  PROGRAM:  KALMAN. BAS 

PURPOSE:  Filters  noisy  azimuth/elevation  rate  and  range  data 
using  adaptive  Kalman  filtering. 

RESULTS:  Filtered  azimuth/elevation  rates,  range/range  rate 
and  azimuth/elevation  corrections 

10  OPEN  ■!•,  II,  ‘B: MOISEI. SUN* 

20  OPEN  '0\  12,  '8:AZD0Tl.GUN’ 

30  OPEN  '0*,  13,  ‘B:ELDQT1.6UN* 

40  OPEN  ’O’,  14,  •BsAZCORRlN.GUN* 

50  OPEN  ‘0\  15,  *B:ELC0RR1N.6UN* 

60  PU  =  10000 
70  P22  =  300 
80  PA  =  .02 
90  PE  =  .003 
100  RR  =  10 
110  RA  *  .006 
120  RE  =  .01 
130  BR  =  .7 
140  QA  =  .006 
ISO  QE  =  .0001 
160  TINE  =  0 
170  DTR  =  .22 
180  OTAE  *  .0275 
190  I  =  I 
200  RHAT  =  SENRN6 
210  RHATD  =  0 
220  AZOHAT  *  0 
230  EIDHAT  =  0 

240  INPUT  II,  TINE,  6IHAT,  SYHAT,  SENRN6 
250  PBA  =  ( 1 -2»DTAE*RHATD/RHAT ) A2*PA*0TAEA2*0A 
260  P8E  *  (l-2»DTAE»RHATD/RHAT)A2tPE+0TAEA2»«E 
270  KPA  =  PBA/ (PBA+RA) 

280  KPE  *  PBE/(P8E*RE) 

290  PA  =  (i-KPA)»PBA 
300  PE  =  (l-KPEHPBE 
310  TERN1  =  1 -KPE 
320  TERH2  =  1-KPA 

330  ELDHAT  =  TERM! ♦( 1 -2»RHATD*DTAE/RHAT ) *ELDHAT+KPE*6YHAT / 1000 

340  AZDHAT  =  TER«2»(l-2»RHAT0»0TAE/RHAT)»AZDHAT+KPA»6IHAT/10OO 

350  IF  ((1*1)  NOD  8)  s  0,  THEN  60SUB  560 

360  ELCORR  =  RHAT»ELDHAT/2840 

370  ELCORR  *  -ATN(EICORR/S8R(1-ELCORRA2)) 

380  AZCORR  *  RHATMZDHAT/2840 

390  AZCORR  =  -ATN(AZC0RR/S8R( 1-AZC0RRA2) ) 

400  VP  =  RHATD»2840»C0S< ELCORR) »SIN( AZCORR) 

410  UCOA  =  .000005»RHATA2-.036938*RHAW00.429 
420  BETA  *-  3E-09*RHAT*2-4. 706E-05»RHATM .0216 


43-190 


430  RHO  -  .076479 

440  C  -  1.5*RHO/WCDA 

450  TOF  =  RHAT/VP/(1-C»RHAT/4)A2 

460  ELGO  =  16.085M8£TAM0F)A2/RHAT 

470  ELCORR  =  ELC0RR+EL6D 

480  PRINT  TINE,  AZDHATMOOO,  EUJHATMOOO,  AZCQRRMOOO,  ELCORRMOOO 

490  PRINT  12,  TINE,  AZDHATMOOO 

500  PRINT  13,  TINE,  EUJHATMOOO 

510  PRINT  *4,  TINE,  AZCORRMOOO 

520  PRINT  15,  TINE,  ELCORRMOOO 

530  1=1+1 

540  GOTO  240 

550  END 

560  PBll  =  P11*DTRMP12*P21+P22»DTR) 

570  PBI2  =  Pt2*0TRMP22MPll*0TR*P21)*0HEGA2) 

580  PB21  =  P2 1  ♦DTRMP22MP1  l+0TRtP12)  *0HE6A2) 

590  PB22  -  P22*OR*OTRA2*OHE6A2»DTRMP21*P12*OTR»OH£GA2*Ptl) 

600  OENON  =  PBI1+RR 

610  KPR  »  PBI1/DEN0H 

620  KVR  =  PB21/DEN0H 

630  P1I  *  PBIHRR/OENON 

640  PI 2  =  PB12IRR/DEN0N 

650  P21  =  PB2URR/DEN0N 

660  P22  =  PB22-PB2HPB12/DEN0H 

670  Cl  =  I -KPR 

680  C2  =  DTR#C1 

690  C3  *  1-KVR»0TR 

700  TERN3  =  0NEGA2*DTR 

710  RHAT  *  Cl *RHAT *C2 *RHATDtKPR*S£NRN6 

720  RHATD  =  RHAT* ( TERN3-KVR) *RHATD*C3*KVR»SENRN8 

730  RETURN 
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ABSTRACT 


A  new  sufficient  condition  for  the  robust  stability  of  a  linear  time 
invariant  continuous  time  system  subject  to  linear  time  varying 
structured  state  space  uncertainty  is  presented.  A  robust 
eigenstructure  assignment  design  is  computed  for  the  Extended  Medium 
Range  Air  to  Air  Missile.  The  new  design  method  uses  the  MATLAB™ 
Optimization  Toolbox  to  minimize  the  integral  of  the  roll  rate  with 
constraints  on  the  real  part  of  the  dutch  roil  and  roll  mode 
eigenvalues,  the  damping  ratios  of  the  dutch  roll  and  roll  mode,  the 
aileron  and  rudder  deflection  rates,  and  the  sufficient  condition  for 
robust  stability. 

Next,  we  extend  eigenstructure  assignment  to  linear  time  invariant 
plants  which  are  represented  by  the  so-called  unified  delta  model  which 
is  valid  both  for  continuous  time  and  sampled  data  operation  of  the 
plant.  We  propose  a  sufficient  condition  for  the  robust  stability  of  a 
linear  time  invariant  unified  delta  plant  subject  to  linear  time 
invariant  structured  state  space  uncertainty.  A  robust  sampled  data 
design  is  computed  for  the  Extended  Medium  Range  Air  to  Air  Missile  by 
minimizing  the  integral  of  the  roll  rate  with  constraints  on  selected 
eigenvalues,  actuator  deflection  rates,  and  the  sufficient  condition 
for  robust  stability. 

Finally,  we  extend  the  robust  stability  result  proposed  by  Sobel 
et.  al.  (1989)  and  the  new  continuous  time  result  described  in  this 
report  to  continuous  time  systems  with  a  time  delay. 
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1.  INTRODUCTION 


Recently,  Sobel  and  Cloutier  (1991)  applied  eigenstructure 
assignment  to  the  design  of  an  autopilot  for  the  Extended  Medium  Range 
Air  to  Air  Technology  (EMRAAT)  missile.  An  important  difference  between 
this  application  and  other  eigenstructure  assignment  applications  which 
have  appeared  in  the  literature  is  that  the  lateral  dynamics  of  the 
EMRAAT  missile  does  not  have  a  well  defined  dutch  roll  mode.  Therefore, 
eigenstructure  assignment  is  utilized  not  only  for  mode  decoupling,  but 
also  to  create  distinctly  separate  dutch  roll  and  roll  modes.  Sobel  and 
Cloutier  (1991)  use  the  approach  suggested  by  Andry  et.  al.  (1983)  in 
which  the  i-th  desired  eigenvector  v^  is  chosen  for  mode  decoupling. 
Then,  the  i-th  achievable  eigenvector  v^  is  chosen  as  the  projection  of 
v*  onto  the  so-called  achievability  subspace.  Sobel  and  Cloutier  (1991) 
show  that  their  design  achieves  improved  decoupling  between  an  initial 
sideslip  angle  and  the  integrated  roll  rate  (which  is  approximately 
equal  to  the  bank  angle)  when  compared  to  a  linear  quadratic  regulator 
design  proposed  by  Bossi  and  Langehough  (1988).  However,  the  design  of 
Sobel  and  Cloutier  (1991)  does  not  consider  that  the  missile’s 
aerodynamic  parameters  are  uncertain. 

A  sufficient  condition  for  the  robust  stability  of  a  linear  time 
invariant  system  subject  to  linear  time  varying  structured  state  space 
uncertainty  has  been  proposed  by  Sobel  et.  al.  (1989).  This  result, 
which  is  based  upon  the  Gronwall  lemma,  ensures  robust  stability  if  the 
nominal  eigenvalues  lie  to  the  left  of  a  vertical  line  in  the  complex 
plane.  This  line  is  determined  by  the  maximum  eigenvalue  of  a  matrix 
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which  involves  the  product  of  the  uncertainty  structure,  the  nominal 
closed  loop  modal  matrix,  and  the  inverse  of  the  nominal  closed  loop 
modal  matrix. 

In  this  report,  we  present  a  new  sufficient  condition  for  the  robust 
stability  of  a  linear  time  invariant  system  subject  to  linear  time 
varying  structured  state  space  uncertainty.  This  new  robustness 
condition  is  a  sum  of  terms  each  of  which  involves  the  i-th  right 
eigenvector,  the  i-th  left  eigenvector,  and  the  real  part  of  the  i-th 
eigenvalue.  We  also  show  that  this  new  robustness  condition  is  less 
conservative  than  the  earlier  result  of  Sobel  et.  al.  (1989). 

We  design  a  robust  controller  for  the  lateral  dynamics  of  the  EMRAAT 
missile  and  this  new  design  is  compared  to  the  earlier  orthogonal 
projection  eigenstructure  assignment  design.  The  earlier  design,  which 
was  proposed  by  Sobel  and  Cloutier  (1991),  does  not  satisfy  either 
sufficient  condition  for  robust  stability.  The  new  design  method 
proposed  in  this  paper  uses  the  MATLAB™  Optimization  Toolbox  (Grace 
1990)  to  minimize  the  integrated  roll  rate  with  constraints  on  the  real 
part  of  the  dutch  roll  and  roll  modes,  the  damping  ratios  of  the  dutch 
roll  and  roll  modes,  the  aileron  and  rudder  deflection  rates,  and  the 
new  sufficient  condition  for  robust  stability.  This  design  satisfies 
the  new  robustness  condition  while  also  yielding  an  improved  transient 
response  as  compared  to  the  design  of  Sobel  and  Cloutier  (1991). 

We  extend  eigenstructure  assignment  to  linear  time  invariant  plants 
which  are  represented  by  the  so-called  unified  delta  model  which  is 
valid  both  for  continuous  time  and  sampled  data  operation  of  the  plant. 
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We  show  that  the  eigenvectors  of  the  delta  model  are  identical  to  the 
eigenvectors  of  the  continuous  time  plant  and  an  expression  is  derived 
for  the  eigenstructure  assignment  feedback  gain  matrix  for  the  delta 
model.  We  show  that  in  the  limit  as  the  sampling  period  delta  goes  to 
zero,  the  delta  feedback  gain  approaches  the  continuous  time  feedback 
gain.  We  propose  a  sufficient  condition  for  the  robust  stability  of  a 
linear  time  invariant  unified  delta  plant  subject  to  linear  time 
invariant  structured  state  space  uncertainty . This  yields  a  new  unified 
robustness  condition  which  is  applicable  to  both  continuous  time  and 
sampled  data  operation.  A  robust  sampled  data  design  is  computed  for 
the  Extended  Medium  Range  Air  to  Air  Missile  by  minimizing  the  integral 
of  the  roll  rate  with  constraints  on  selected  eigenvalues,  actuator 
deflection  rates,  and  the  sufficient  condition  for  robust  stability. 
The  robust  design  is  compared  with  an  orthogonal  projection 
eigenstructure  assignment  design. 

Finally,  we  extend  the  robust  stability  results  proposed  by  Sobel 
et.  al.  (1989)  and  the  new  results  described  in  this  report  to 
continuous  time  systems  with  a  time  delay.  We  proof  new  sufficient 
conditions  for  the  robust  stability  of  a  linear  time  invariant 
continuous  time  system  with  a  time  delay  which  is  subject  to  linear 
time  varying  structured  state  space  uncertainty. 
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2.  ROBUST  E I GENSTRUCTURE  ASSIGNMENT  FOR  CONTINUOUS  TIME  SYSTEMS 


2. 1  Problem  Formulation 

Consider  a  nominal  linear  time-invariant  multi-input  multi-output 
system  described  by 


x(t)  =  Ax(t)  +  Bu(t) 

( la) 

y ( t )  =  Cx ( t ) 

(lb) 

where  x  €  IRn  is  the  state  vector,  u  e  Rm  is  the  input  vector,  y  e  IRr 
is  the  output  vector,  and  A,  B,  C,  are  constant  matrices. 

Suppose  that  the  nominal  system  is  subject  to  linear  time-varying 
uncertainties  in  the  entries  of  A,  B  described  by  AA(t)  and  AB(t), 
respectively.  We  shall  assume  that  the  entries  of  AA(t)  and  AB( t )  are 
uniformly  continuous  for  t  e  (-«,<»)  .  Then,  the  system  with  uncertainty 
is  given  by 

x ( t )  =  Ax( t )  +  Bu(t)  +  AA( t )x( t )  +  AB(t)u(t)  (2a) 

y(t)  =  Cx(t)  (2b) 

Further,  suppose  that  bounds  are  available  on  the  absolute  values  of 
the  maximum  variations  in  the  elements  of  AA(t)  and  AB(t).  That  is, 

I  Aa  i  j  ( t ) !  -  (aij5max;  1  =  1 . n:  J=1 . n  (3a) 

I  Abj.  j  ( t )  I  5  (bij)max;  1=1 - n;  j=1 . m  t3b) 

Define  AA+(t)  and  AB+(t)  as  the  matrices  obtained  by  replacing  the 

entries  of  AA(t)  and  AB(t)  by  their  absolute  values.  Also,  define 

A  and  B  as  the  matrices  with  entries  (a.  .)  and  (b.  .) 
max  max  ij  max  ij  max 

respectively.  Then, 
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(4a) 


{ AA( t ) :  AA+(t)  £  A  > 

max 

and 

{ AB ( t ) :  AB+( t )  £  B  >  (4b) 

max 

where  "s"  is  applied  element  by  element  to  matrices  and  A  €  (Rnxn  , 

1  max  + 

B  e  Rnxm  where  R  is  the  set  of  non-negative  numbers, 
max  +  + 

Consider  the  constant  gain  output  feedback  control  law  described 
by 

u ( t )  =  Fy( t )  (5) 

Then,  the  nominal  closed  loop  system  is  given  by 

x ( t )  =  (A  +  BFC)x(t)  (6) 

and  the  uncertain  closed  loop  system  is  given  by 

x(t)  =  (A  +  BFC)x(t)  +  [ AA ( t )  +  AB( t )FC) x ( t )  (7) 

Finally,  the  stability  robustness  problem  can  be  stated  as  follows: 
Given  a  feedback  gain  matrix  F  €  Rmxr  such  that  the  nominal  closed  loop 
system  exhibits  desirable  dynamic  performance,  determine  if  the 
uncertain  closed  loop  system  Is  asymptotically  stable  for  all  AA(t)  and 
AB(t)  described  by  Eq.(4). 

2. 1  Robustness  Results 

The  solution  proposed  by  Sobel  et.  al.  (1989)  for  the  stability 
robustness  problem  is  described  by  the  following  theorem. 

Theorem  2:  Suppose  that  F  is  such  that  the  nominal  closed  loop  system 
described  by  Eq.(6)  is  asymptotically  stable  with  a  non-defective  modal 
matrix.  Then,  the  uncertain  closed  loop  system  given  by  Eq.(7)  is 
asymptotically  stable  for  all  AA(t)  and  ABU)  described  by  Eq.(4)  if 
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where 


a  >  A  [ (M_1 ) + [ A  +  B  ( FC) + ] M+ ] 

max  max  max 


a  =  -max  Re [A. (A+BFC) ] 
i 


and  where  M  is  a  modal  matrix  of  (A+BFC)  and  A  ( • )  of  a  non-negative 

max 

matrix  denotes  the  real  non-negative  eigenvalue  A  £  0  such  that 

max 

A  £  |A.|  for  all  eigenvalues  A., 
max  1  1 1  l 

The  above  theorem  describes  a  sufficient  condition  for  the  robust 
stability  in  terms  of  the  eigenstructure  of  the  nominal  closed  loop 
system.  Robust  stability  is  ensured  provided  that  the  nominal  closed 
loop  eigenvalues  lie  to  the  left  of  a  vertical  line  in  the  complex 
plane  which  is  determined  by  a  norm  involving  the  structure  of  the 
uncertainty  and  the  nominal  closed  loop  modal  matrix. 

We  remark  that  the  sufficient  condition  given  by  Eq. (8)  can  be 
rewritten  as  shown  below. 


(-H 


A  t (M_1 )+ [ A  +  B  (FC)+]M+]  <  1 
max  max  max 


This  alternate  form  of  the  stability  robustness  condition  will  be 
useful  later  when  we  compare  it  with  the  new  robustness  condition  which 
is  derived  in  this  report. 

We  now  present  a  new  sufficient  condition  for  robust  stability  of  a 
linear  time  invariant  system  subject  to  linear  time  varying  structured 
state  space  uncertainty  which  is  described  by  the  following  theorem. 
Theorem  2:  Suppose  that  F  is  such  that  the  nominal  closed  loop  system 
described  by  Eq.(6)  is  asymptotically  stable  with  a  non-defective  modal 


matrix.  Then,  the  uncertain  closed  loop  system  given  by  Eq.(7)  is 
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asymptotically  stable  for  all  AA(  t )  and  ABU)  described  by  Eq .  14)  if 


where 


•  + 

(v.w.  ) 

1  1 


a. 

l 


(A 

max 


+  B 

max 


(FC)+1 


(10) 


a  =  -ReUUA+BFC) ) 

and  where  is  the  i-th  eigenvalue  of  (A+BFC)  with  v^  and  w.  the 
corresponding  right  and  left  eigenvectors,  respectively;  and  where 
(•)  denotes  the  complex  conjugate  transpose. 

Proof:  The  uncertain  closed  loop  plant  may  be  written  as 

x ( t )  =  A^x( t )  +  AA  (t)x(t)  (11) 

where 


and 


A  =  A  +  BFC 

c 


AA  (t)  =  AA ( t )  +  AB( t )FC 

c 


which  has  a  solution  given  by 


x(t)  =  exp(A  t)x(0)  +  f  exp[A  (t-x)]AA  (x)x(x)dx  (12) 

c  J  n  c  c 


Next,  use  the  real,  positive,  diagonal  transformation 

x ( t )  =  D_1z(t) 


(13) 


and  the  property  that 


exptDA^D  ^t)  =  DexplA^tJD  * 


and 


(14) 


exp(A  t)  =  Mexp(At)M  1  =  Y  v.w.  e  1 
c  L*  ii 

i  =  l 


.  A.t 


to  obtain 


(15) 


r  •  V  -l  r1  £  •  Ai(t-T)  , 

z(t)=D)  v  w  e  D  z(0 )+  D)  v.w.  e  AA  lx)D  z(x!dx 

^  i  *  Jn  L  i  i  c 

i=l  i=l 


U6: 
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where  M  is  a  modal  matrix  of  A  ;  A.  is  the  i-th  eigenvalue  of  A  with 

ci  c 

v^  and  w^  the  corresponding  right  and  left  eigenvectors,  respectively; 
A  is  a  diagonal  matrix  with  the  A^  on  the  diagonal;  and  (•)  denotes 
complex  conjugate  transpose. 


Note  that 


|z(t)||  0  implies  that  ||x(t)||  0 


Next,  apply  the  absolute  value  operator,  denoted  by  (•)  ,  to  both  sides 
of  Eq.(16)  where  "+"  and  ”£  “  are  applied  element  by  element  to  vectors 
and  matrices. 


t  n  _  A. ( t— t ) 

«— •  *  T 


'  (t)£  d£  viw^  e  1  D  z(0)  +  v^w..  e  1 

:  _  ,  D  :  _  1 


AA  (t)D  z(x)dx  (18) 
c 


n  ,  -a.t  tpn  ,  A.(t-x) 

£  d]T  (v  w  )  e  1  D  z+(0)+  d][  v.wi  e  1  AAc(x)D  z(r)  dx  (19) 


“  *  +  -a.t  _t  n  .  -a.(t-x)  _  + 

(v.w.)+  e  1  D  z(0)  +  D Y  (v.w. )+  e  1  AD  z+(x)dx  (20) 

L  ii  JL11  cmax 


where  a.  =  -Re(A.),  A  =  A  +  B  (FC)  and  where  we  have  used  the 
l  l  cmax  max  max 

property  that  [exptA^t)]*  =  expC-a^t). 

Next,  integrate  both  sides  of  Eq. (20)  to  obtain 


,oo  -a.t 


J  z+(t)dt  £  d£  (v.w*)+  |  e  1  dt  D  Jz+(0) 


„t  n 


♦  r  r  dv  (v.w* 

J  t=0  x=0  7  ,  11 


.  -a. ( t-x )  , 

. )+  e  1  A  D  z  (x)dxdt 

i  cmax 
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Consider  the  double  integral  in  Eq.(21)  which  may  be  written  as 


r  r*  rl  r 

.im  D)  (v.w. 

l-xo  L  t=0  Jt=0  L  ,  11 


.  +  -a. ( t-x)  _  + 

.  )+  e  1  A  D  z+(x)dxdt  ;  O^xct  (22) 

l  cmax 


The  order  of  integration  in  Eq. (22)  may  be  interchanged  because  all  of 
the  functions  inside  the  integrals  are  continuous  functions  of  t  and  x 
(Churchill  1972).  Thus,  Eq. (22)  is  equal  to 


r  rt  -R  n  „  -a. 

.  im  dV  (v.w.  )  +  e  1 

l-*x>  *-  i=0  t=0  7  ,  11 


-a. ( t-T ) 


A  D  1 z  + ( r )dtdx  1 ;0sx<t  (23) 
cmax 


Now  use  the  change  of  variables  given  by  y=t-x,  d7=dt,  y>0.  Then, 
Eq. (23)  is  equal  to 


p.t  pR-x  n  ,  +  -a.*  _ 

im  dV  (v.w. )  e  1  A  D  z+(x)dydx  ; 

i  J  o  J  L*  i  l  cmax 

[-*»  ljt=o  J  y=-T  .  -I 


;05x<t,  7>0  (24) 


r  ft  P  a  •  + 

i  lim  D)  (v.w.)  e 

R-*x>  L  JT=0  -*7=0  .  11 

1  =  1 

-  pR  n  .  -a.  n 

=  1 i m  dY  (v.w.)+e  1  A 

R-**>  L  7=0  7_.  1  1  cn 


A  D  'z  (x )dydx  1 ; a. >0 
cmax  l 


-a. 7  .  .t  1 

1  A  D  dr  z+(x )dr  ;a.>0  (26) 

cmax  Jt=q  J  i 


r  II  (  V.W. 

2  lim  dV  — ---- 
R-*»  L  7  ,  ai 


n  (v.w.  )  -a.R 

y  - i-i—  (e  1  -1)  A 


_ .  -°0 

D  z*(x)dx  ;a. : 

c  Jo  J  1 


Evaluating  the  limit  of  the  term  outside  the  integral  in  Eq.(27),  note 
that  x  is  now  a  dummy  variable  of  integration,  recall  that  the  a7  s  are 
positive,  and  substitute  the  result  into  Eq.(21)  to  obtain 


-oo  +  n  (v.w.)  n  (v.w.)  _ ,  oo 

z  +  (t)dt  5  D  )  D  1  z  +  (0 )  +  D  7  a  D  1  z  +  (t)dt  (28) 

0  L  ,  “i  L  cmax  0 

1=1  i=l 
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Take  norms  in  Eq. (28)  and  rearrange  to  obtain 


-00 

«f/ 


(t)dt||  s 


»  + 

ii  n  ( v .  w .  )  . 

Dy  — --- - D_1  z  (0) 

i  ai 


l  - 


n  t  ,+  i 

nr  (v. w. )  _-l 

D)  l  l  A  D 


(29) 


.  .  a. 
1=1  l 


cmax 


r00  r00 

Thus,  |f  z+(t)  dt ||  <  co  which  implies  that  |  ||z+(t)||  dt  < 
J0  J0 


oo  if 


n  (v.w.  ) 


D  y  [ A  +  B  (FC)+)  D_1  ||  <  1 

L  a.  max  max  11 


(30) 


i  =  l 


Note  that  x(t)  and  z(t)  are  uniformly  continuous  on  (0,oo)  because  of 
the  linearity  of  the  uncertain  closed  loop  plant  which  together  with 
Eq.  (30)  implies  that  ||z+(t)||  -»  0  as  t-x»  (Hsu  and  Meyer  1968).  This 
implies  that  ||x(t)||  4  0  as  t-»  oo  which  proves  that  the  linear  uncertain 
closed  loop  plant  is  asymptotically  stable. 


Finally,  Perron  weightings  may  be  used  for  the  matrix  D  in  Eq. (30)  in 
the  same  manner  as  shown  by  Sobel  et.  al.  (1989)  for  an  earlier 
robustness  result.  Thus,  to  reduce  conservatism,  Eq. (29)  may  be 
replaced  by 


i  n  (v.w. ) 

i  ( y  — 

max  L  a. 

1  i  =  l  1 


[A  +  B  (FC)  1 
max  max 


where  A  ( •  )  of  a  non-negative  matrix  denotes 
max 


eigenvalue  \  2  0  such  that  A  2  |A.|  for  all 

max  max  1  1 1 


|  <  1  (31) 

the  real  non-negative 
eigenvalues  A ^ . 


Q.E.D. 
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We  remark  that  the  quantity  iw  v  j  1S  defined  by  Golub  ana  Van  Loan 
(1983)  to  be  the  condition  number  of  the  i-th  eigenvalue.  This 

condition  number  is  a  measure  of  the  sensitivity  of  the  i-th  eigenvalue 

• 

to  incremental  perturbations  in  the  matrix  A+BFC.  The  quantity  v^w^» 
which  appears  in  Eq.(10),  is  weakly  related  to  this  eigenvalue 
condition  number.  Thus,  the  new  stability  robustness  condition  of 
Eq.(10)  seems  to  have  the  heuristic  interpretation  that  robustness  can 
be  achieved  by  having  the  sensitive  eigenvalues  far  into  the  left  half 
complex  plane  and  by  having  the  eigenvalues  which  are  near  the 
imaginary  axis  to  be  insensitive.  Such  an  interpretation  was  not 
possible  for  the  result  of  Sobel  et.  al.  (1989)  because  their  stability 
robustness  condition,  which  is  given  by  Eq.(9),  is  in  terms  of  the 
spectrum  as  a  whole  rather  than  the  individual  eigenvalues. 

The  new  stability  robustness  condition  of  Eq.(10)  is  less 
conservative  than  the  earlier  condition  of  Sobel  et.  al.  (1989).  This 
property  is  described  by  the  following  theorem. 

Theorem  3:  The  left  hand  side  of  Eq.(10)  is  less  than  or  equal  to  the 
left  hand  side  of  Eq.(9).  Mathematically, 


n  ,  * 

(v.w. ) 


/  V i-i —  [ A  +B  (FC) +  ]  1  s  f---U  / (M  VlA  +B  (FC)  +  ]M+ 

^  L  ou  max  max  J  [  a  j  max^  max  max 


Proof:  We  element  by  element  bound  the  matrix  on  the  left  hand  side  of 

Eq.(32)  and  recall  the  definition  that  A  =  A  +  B  (FC)*  to 

cmax  max  max 


obtain 
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n  ,  %  + 

(v.  W.  ) 

r  i  i 


n  *  i  * 

V.  (w.  ) 


y - i-i—  A  5  y  — - - i- 

L  a.  cmax  L  a. 


n  + 

<  y  _!i. 

*  + 

(w.  ) 

1 

A 

cmax 

-  L 

a 

i  =  l 

(-H- 

[  V 

. <] 

(w 

(w, 

•  + 

(w  ) 
n 


=  [----) •M+(M~1)+  A 

a  J  c 


Now,  use  the  result  that  if  B+  s  A+,  then 


A  ( B+ )  =  ]|B+|Ln  s  ||B+|Ln  s  ||A+|Ln  =  A  (A+) 

max  11  11 20,.+  11  "2D  +  11  "2D  +  max 

BAA 


where  ||  •  |L  and  ||  •  ||  are  D-weighted  two  norms  with  Perron  weights 

V  V 

+  + 

for  B  and  A  ,  respectively. 

Then,  applying  Eq. (37)  to  Eq. (36)  yields 


n  ,  V  + 
(v. w.  ) 

l  --;r- 


A  i  A 

cmax  max 


-  J— i-J  -M+(M  1  )+  Ac 


Finally,  use  the  result  (Ogata  1987)  that  if  A  and  B  are  square 
matrices,  then 

A. (AB)  =  A. (BA)  (39) 

l  l 

to  obtain 
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A 


A  4 

(  n  (v.w*)+  1 

y  — --- —  a  i 

- V 

VI 

[  (-i-)-(M'1)'  A 

M+  ] 

\  (40) 

max! 

L  a.  cmax 

*  i=l  1  J 

max  j 

[  l  “  j 

cmax 

I 

Q.E.D. 


2. 1  Application  to  the  EMRAAT  Missile 

Consider  the  Extended  Medium  Range  Air  to  Air  Technology  (EMRAAT) 

bank-to-turn  missile  which  is  described  by  Bossi  and  Langehough  (1988). 

A  sixth  order  model  of  the  yaw/roll  dynamics  at  a  10  degree  angle  of 

attack  is  considered  with  state  vector,  control  vector,  and  measurement 

T  T 

vector  given  by  x  =  [0,r,p,pT,5  ,5  ]  ,  u  =  [<5  ,5  ]  ,  and 

i  r  3i  i  3. 

T 

y  =  [0,r,p,pj]  ,  respectively.  Here  (3  is  the  sideslip  angle  (deg),  r  is 


yaw 

rate  (deg/sec 

).  P 

is  roll 

rate 

(deg/sec ) , 

Pj  is  integrated  roll 

rate 

(deg),  5r  is 

rudder  deflection 

(deg) ,  and  5 

is  aileron  deflection 
a 

(deg! 

1 .  The  state  space 

matrices 

A,  B 

,  and  C  are 

shown  below: 

f  - . 5007 

9945 

.  1736 

0 

.  109 

.00691  1 

16.83 

5748 

.01233 

0 

-132.8 

27.19 

A  = 

-322.7 

3208 

-2.099 

0 

-1620.0 

-1240.0 

0 

0 

1 

0 

0 

0  1 

0 

0 

0 

0 

-179 

0 

0 

0 

0 

0 

0 

-179 

B  = 


0  0  0  0  179  0 

0  0  0  0  0  179 


C  = 


1  0 
0  1 
0  0 
0  0 


0  0 
0  0 
1  0 
0  1 


0  0 
0  0 
0  0 
0  0 
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Sobel  and  Cloutier  (1991)  use  elgenstructure  assignment  with  an 
orthogonal  projection.  The  desired  dutch  roll  and  roll  mode  eigenvalues 
are  achieved  exactly  because  four  measurements  are  available  for 
feedback.  The  desired  dutch  roll  eigenvectors  are  chosen  to  yield  a 
complex  mode  which  is  composed  of  sideslip  angle  and  yaw  rate  with  no 
coupling  to  roll  rate  and  integrated  roll  rate.  The  desired  roll  mode 
eigenvectors  are  chosen  to  yield  a  complex  mode  which  is  composed  of 
roll  rate  and  integrated  roll  rate  with  no  coupling  to  sideslip  angle 
and  yaw  rate.  Then,  the  achievable  eigenvectors  are  computed  by  using 
the  orthogonal  projection  of  the  i-th  desired  eigenvector  v^  onto  the 
subspace  which  is  spanned  by  the  columns  of  (A^I-A)  ^B.  The  closed  loop 
eigenvalues  and  the  feedback  gain  matrix  are  shown  in  Table  1. 
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TABLE  1.  COMPARISON  OF  EMRAAT  DESIGNS 


Closed  Loop  Eigenvalues  Feedback  Gain  Matrix 


Eigen- 

structure 

Assignment 

Design 

of  Sobel 

and 

Cloutier 

(1991) 


Adr=-23.98±jl7.99 

A  , ,=-10.01±j9,98 
roll 

A  t  =  -132.1 
act 


A  .  =  -161.1 
act 


Robust 

Design 


A 

A 

A 

A 


dr=-14.94±jl6.68 


roll 

act 


=-40. 55±j92. 90 
=  -150.3 


act 


-99.3 


P  r  P  Pr 

'  -4.19  .233  .00374  .731 
2.89  -.290  .00631  -.812 


6 

rc 

5 

ac 


0 

r 

P 

PI 

'  -2.82 

.  162 

.0127 

1.18“ 

5 

rc 

.770 

-.  180 

.062 

3.  14  _ 

5 

ac 

Eigenvalues  are  computed  by  using  feedback  gains  which  are  rounded  to 
three  significant  digits. 


In  this  report,  we  propose  a  new  robust  design  which  minimizes 
the  integrated  roll  rate  (which  is  approximately  the  bank  angle) 
subject  to  constraints  on  the  real  part  of  the  dutch  roll  and  roll 
modes,  the  damping  ratios  of  the  dutch  roll  and  roll  modes,  the  aileron 
and  rudder  deflection  rates,  and  the  new  sufficient  condition  for 
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robust  stability.  Mathematically,  the  objective  function  to  be 


minimized  is  given  by 


JU 

j  =  ]T  iPjOcT^r 


where  T^=0.01  sec. 

The  values  of  T  and  k  are  chosen  to  include  the  time  interval  [0,0.3] 

during  which  most  of  the  transient  response  occurs.  Of  course, 

computation  of  Eq.(41)  requires  that  a  linear  simulation  be  performed 

during  each  function  evaluation  of  the  optimization.  The  constraints 

are  shown  below  where  <;  is  the  damping  ratio. 

Re  A  e  [-50,-6]  (42) 

dr 

Re  A  e  [-50,-6]  (43) 

roll 

Re  A  .  ,  <  -100  (44) 

rudder 

Re  A  , ,  <  -100  (45) 

aileron 

e.  €  [0. 4,0.8]  (46) 

dr 

?roii  6  10.4,0.8]  (47) 

| a  I  <275  deg/sec  (48) 

|5  |  <  275  deg/sec  (49) 


(v.w. )+ 

--i-i—  [A  ♦  B  (FC)  ] 
a.  max  max 

l 


}<0, 


where  for  illustrative  purposes  we  have  chosen  A  =  0.1*A  and  B  =  0. 

r  max  max 

The  actuator  deflection  rates  are  computed  from  the  slopes  of  the  time 
responses  of  the  deflections  during  the  time  interval  [0,0.03].  This 
interval  is  chosen  because  the  slopes  of  the  deflections  are  largest 


during  this  time  interval.  Mathematically, 
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(51) 


max|5  (mT_) 
1  r  2 


.  maxjS  (mT  )| 

3  |  = - - - - -  (52) 

21  T 

mT2 

where  T^O.OOl  sec  and  m=0, 1 , .  . .  , 30.  The  maximum  deflection  rates 

chosen  for  the  constraints  are  well  within  the  expected  400  deg/sec 

limit  for  the  advanced  state  of  the  art  electromechanical  actuator 

described  by  Langehough  and  Simons  (1988). 

The  parameter  vector  contains  the  quantities  which  may  be  varied  by 

the  optimization.  This  twelve  dimensional  vector  includes  Re  A^  , 

Im  A,  ,  Re  A  Im  A  Re  z.(l),  Re  z,(2),  Im  z,(l),  Im  z.(2), 

dr  roll  roll  1111 

Re  z^d),  Re  2^(2).  Im  z^d),  Im  z^{2) .  Here,  the  two  dimensional 
complex  vectors  z.^  contain  the  free  eigenvector  parameters.  That  is, 
the  i-th  eigenvector  v.^  may  be  written  as 

v  =  Lizi  (53) 

where  the  columns  of  L.  =  (A.I-A)  *B  are  a  basis  for  the  subspace  in 

ii 

which  the  i-th  eigenvector  must  reside.  Thus,  the  free  parameters  are 
the  vectors  z^  rather  than  the  eigenvectors  v^. 

The  optimization  is  performed  by  using  subroutine  constr  from  the 
MATLAB™  Optimization  Toolbox  (Grace  1990)  on  a  486™  25MHz  personal 
computer.  The  optimization  is  initialized  with  the  design  proposed  by 
Sobel  and  Cloutier  (1991)  which  yields  an  initial  value  of  5.0214  for 
the  objective  function  of  Eq.(41),  a  value  of  4.0472  for  the  left  hand 
side  (LHS)  of  the  earlier  robustness  condition  of  Eq.(9),  and  a  value  of 
2.0686  for  the  LHS  of  the  new  robustness  condition  of  Eq.(10).  The 
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optimization  is  compiete  after  2698  function  evaluations  which  requires 
approximately  one  hour  of  computation  and  yields  an  optimal  objective 
function  of  0.0284,  a  value  of  2.4432  for  the  LHS  of  the  earlier 
robustness  condition  of  Eq.(9),  and  a  value  of  0.999  for  the  LHS  of  the 
new  robustness  condition  of  Eq.  (10).  We  observe  from  Table  1  that  the 
dutch  roll  mode  is  dominant  in  the  robust  design  whereas  the  roll  mode 
was  chosen  to  be  dominant  in  the  design  of  Sobel  and  Cloutier  (1991). 
Furthermore,  the  optimization  has  assigned  the  roll  mode  damping  to  be 
the  smallest  value  which  is  allowed  by  the  constraint  of  Eq.(47). 

The  time  histories  of  sideslip  angle,  yaw  rate,  roll  rate, 
integrated  roll  rate,  rudder  deflection,  and  aileron  deflection  to  a 
one  degree  initial  sideslip  are  shown  in  figures  1,  2,  3,  4,  5,  and  6, 
respectively.  We  observe  a  significant  improvement  in  the  integrated 
roll  rate  response  (which  is  desired  to  be  zero)  when  compared  to  the 
earlier  design  of  Sobel  and  Cloutier  (1991).  The  earlier  design  of 
Sobel  and  Cloutier  (1991)  has  a  minimum  Pj(t)  of  -.646  deg  but  the  new 
design  of  this  paper  has  a  minimum  p ^  ( t )  of  -.112  deg  which  is  an 
improvement  of  approximately  80%.  We  note  that  this  improved  response 
is  obtained  with  both  smaller  aileron  and  rudder  deflections. 
Furthermore,  it  is  interesting  that  the  aileron  in  the  design  of  Sobel 
and  Cloutier  (1991)  exhibits  an  initial  positive  deflection  of 
approximately  two  degrees  before  becoming  negative,  whereas  this  large 
positive  initial  aileron  deflection  does  not  appear  in  the  new  robust 
design. 
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degrees 


0  0.1  0.2  0.3  0.4  0.5 


time  (sec) 

Fig.  1.  Sideslip  angle  (continuous  time;  /3(0)  =  1  deg) 


0  0.1  0.2  0.3  0.4  0.5 

time  (sec) 

Fig.  2.  Yaw  rate  (continuous  time;  |3(0)  =  1  deg) 
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degrees  degrees 


time  (sec) 

Fig.  3.  Roll  rate  (continuous  time;  0(O)=1  deg) 


time  (sec) 

Fig.  4.  Integrated  roll  rate  (continuous  time;  )3(0)  =  1  deg) 
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0 


Fig.  5.  Rudder  deflection  (continuous  time;  0(0)- 1  deg) 


N*»xt,  the  response  to  a  roll  rate  step  of  5  deg/sec  Is  considered. 
The  roll  rate,  rudder  deflection,  and  aileron  deflection  are  shown  in 
Figures  7,  8,  and  9,  respectively.  The  new  robust  design  exhibits  a 
roll  rate  response  with  a  significantly  smaller  settling  time  which  is 
achieved  by  using  larger  aileron  deflection  rates.  Nevertheless,  these 
deflection  rates  of  approximately  25  deg/sec  are  well  within  the 
allowable  limits. 
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Fig.  7.  Roll  rate  step  response  (continuous  time) 
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time  (sec) 

Fig.  8.  Rudder  deflection  for  step  response  (continuous  time) 


Fig.  9.  Aileron  deflection  for  step  response  (continuous  time) 
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3.  ROBUST  E I GENSTRUCTURE  ASSIGNMENT  FOR  SAMPLED  DATA  SYSTEMS:  A  UNIFIED 


APPROACH 

3. 1  Problem  Formulation 


Consider  a  nominal  linear  time-invariant  multi-input  multi-output 
system  described  by 


x(t)  =  Ax(t)  +  Bu(t) 

(54) 

y ( t )  =  Cx( t ) 

(55) 

where  x  € 

[Rn  is  the  state  vector,  u  e  Rm  is  the  input  vector,  y 

€  IRr 

is  the  output  vector,  and  A,  B,  C,  are  constant  matrices. 

The  corresponding  sampled  data  system,  which  is  obtained  by 

using 

Middleton 

and  Goodwin’s  (1990)  delta  operator,  is  shown  below. 

5x  =  A^  x  +  B^  u 

(56) 

y  =  Cx 

(57) 

where 

> 

O) 

ii 

£ 

(58) 

B^  =QB 

(59) 

Q  =  4-  f  eATdx 

A  0 

(60) 

and  where 

s  _  q-i 

5  “  A 

(61) 

and  where 

the  shift  operator  q  is  defined  by 

qx,  =  x,  , 

M  k  k+1 

(62) 

and  where  A  is  the  sampling  period. 


The  unified  state  space  model  proposed  by  Middleton  and  Goodwin  (!D30) 
is  valid  for  both  the  discrete  and  continuous  time  cases  simultaneously. 
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This  unified  model  is  described  by 


px(t)  = 

A  x ( t )  +  B  u(t) 

P  P 

(63) 

y ( t )  =  Cx(t) 

(64) 

where 

A 

'  A 

in  continuous  time 

(65) 

P 

*  A5 

in  discrete  time 

f  B 

in  continuous  time 

B 

=  . 

(66) 

P 

03 

O* 

in  discrete  time 

and  where 

P  =  < 

d 

dt 

in  continuous  time 

(67) 

L 

5 

in  discrete  time 

Suppose 

that  the 

nominal  delta  system  is 

subject  to 

linear 

time-invariant  uncertainties 

in  the  entries  of  A  , 

P 

B  described 
P 

by  dA 

P 

and  dBp,  respectively,  where 

'  dA 

in  continuous  time 

dA  = 

=  . 

(68) 

P 

^  dAs 

in  discrete  time 

r  dB 

in  continuous  time 

dB  = 

=  . 

(69) 

P 

L  dB5 

in  discrete  time 

and  where 

dA6  ■  T 

r  (A+dA) A  AAl 

'  Le  ’  e  J 

(70) 

dB5 

--HJ 

'A 

0 

(A+dA 

e 

)T  dr  (B  +  dB)  -  [A  eAx  dr  B  1 

Jo  J 

(71) 

Then,  the 

delta  system 

with 

uncertainty  is  given 

by 
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(72) 


px(t)  =  A  x(t)  +  B  u(t)  +  dA  x(t)  +  dB  u(t) 
p  p  P  P 

y(t)  =  Cx( t )  (73) 

Further,  suppose  that  bounds  are  available  on  the  maximum  absolute 

values  of  the  elements  of  dA  and  dB.  That  is, 

Ida.  .1  £  (a..)  ;  i=l,...,n;  j=l . n  (74) 

1  ij1  ij  max’ 

Idb.  .1  £  (b.  .)  ;  i=l . n;  j=l,...,m  (75) 

1  ij1  ij  max 

Then,  the  corresponding  bounds  on  the  5  system,  through  Eqs.  (70)  and 


(71),  are 


|da5(i,J)|  s  [da^ ( i . j ) lmax;  j=l . n 


db  ( i .  j  )  I  £  [db  ( i ,  j )  ]  ;  i=l . n;  j=l . m  (77) 

o  o  max 


Define  dA+  and  dB+  as  the  matrices  obtained  by  replacing  the  entries  of 
P  P 

dA  and  dB  by  their  absolute  values.  Also,  define  A  and  B _  as 

p  p  7  pmax  pmax 

the  matrices  with  entries  (a.  .)  and  (b.  .)  ,  respectively  in 

ij  max  ij  max 

continuous  time  or  with  entries  [da _ ( i , j )  ]  and  [db_(i,j)l  , 

o  max  o  max 

respectively  in  discrete  time.  Then, 

(dA  :  dA+  i  A  }  (78) 

p  p  pmax 


(dB  :  dB  £  B  > 
p  p  pmax 


where 


pmax 


in  continuous  time 


in  discrete  time 
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in  continuous  time 


pmax 


in  discrete  time 


and  where 


1  r(A+A  )A 

L  I  max  A  A 

5max  A  6  S 


1  r  .A  (A'+A  ) t 

—  [  I  e  max  dr  (B  B  : 

A  L  j  o  max 


rA  »+  T 

A  T  + 
e  dr  B 

J0 


and  where  "s"  is  applied  element  by  element  to  matrices  and 

A  e  Rnxn,  B  €  IRnxm  where  R  is  the  set  of  non-negative  numbers, 

max  +  max  +  + 

Consider  the  constant  gain  output  feedback  control  law  described 


u ( t )  =  F  y(t) 

where 


in  continuous  time 


F_  in  discrete  time 

O 


Then,  the  nominal  closed  loop  unified  delta  system  is  given  by 


px( t )  =  A  x  { t ) 
pc 


where 


A  +  BFC 


in  continuous  time 


A-  +  B.F_C  in  discrete  time 

o  o  o 


and  the  uncertain  closed  loop  unified  delta  system  is  given  by 


px(t)=A  x( t )  ■*■  dA  x(t) 
pc  pc 


where 
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dA  +  dB(FC) 


in  continuous  time 


I  dA.  +  dB-(F_C)  in  discrete  time 
v  o  o  o 

Finally,  the  stability  robustness  problem  can  be  stated  as  follows: 
Given  a  feedback  gain  matrix  F^  e  Rm*r  such  that  the  nominal  closed 
loop  unified  delta  system  exhibits  desirable  dynamic  performance, 
determine  if  the  uncertain  closed  loop  unified  delta  system  is 
asymptotically  stable  for  all  time-invariant  dA^  and  dB^  described  by 
Eqs.(78)  and  (79),  respectively. 


3,2  Robustness  Results 

Theorem  4:  (Delta  Eigenvectors ) 

Consider  the  continuous  time  plant  given  by  x=Ax+Bu  and  the  sampled  data 
plant  given  by  <5x=A_x+B_u.  The  i-th  eigenvalues  of  A  and  A  are  A.  and 

O  O  O  1 

exp( A. A)-l 

3^= - - - ,  respectively.  Let  M  be  a  non-defective  modal  matrix  and 

let  A  be  a  diagonal  matrix  with  the  A^  on  the  diagonal.  Then, 

M_1A_M  =  y.I  =  — jp- (e^-I )  if  and  only  if  M_1AM  =  A.  I  =  A.  (90) 

5  l  A  l 

Proof: 

Sufficiency:  Use  the  definition  of  A.  to  obtain 

O 

=  -4-  (91) 

5  A 

Substitute  the  infinite  series  for  exp(AA)  into  Eq .(91)  to  obtain 

2  2 

M_1ASM  =  -J-  M_1(I  +  AA  +  +  ...  -  I)M  (92) 

5  A  2! 

-12  2 

1  r  t  U-1AUA  M  A  MA  .. 

=  — r—  (I  +  M  AMA  +  - -  ♦  ...  -  I]  (93) 

A  2! 


2  3 

*  4-  (I  +  m"1AMA  ♦  (M-1AM)2  ♦  (M-1AM)3  ♦ 

A  2!  3! 


Substitute  M  AM  =  A  into  Eq. (94)  to  obtain 
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I] 


(95) 


Necessi ty: 


1  -1 ,  AA  . ...  1  ,  AA  T . 

— t—  M  (e  -I)M  =  -r-  (e  -I) 

A  A 

(97) 

-1  AA,.  AA 

=>  M  e  M  =  e 

(98) 

M_1AMA  AA 

=»  e  =  e 

(99) 

=*  M_1AMA  =  AA 

(100) 

< 

ii 

3 

T— < 

'x 

ft 

(101) 

Q.E.D. 


Theorem  5:  (Delta  Settling  Time  and  Damping  Regions) 

Consider  the  plant  described  in  theorem  1  with  eigenvalues  A^  in  continuous 
time  and  eigenvalues  for  sampled  data  operation.  The  s-plane  settling 
time  region  described  by 

Re  Ai  <  <r  (102) 

maps  into  the  y-plane  region  described  by 

Il+Ayj  <  e0*  (103) 

and  the  s-plane  damping  region  described  by 

cos(tan  *(Im  A. /(-Re  A.)))  >  <  (104) 

l  l 

maps  into  the  y-plane  region  described  by 

|  1+Ay.  j  <  exp(-^/(l-<2)1/2)  (105) 

where  <j>  =  arg(l+Ay). 
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Proof: 


Settling  Time  Region: 


Use  l+Ay^  =  exp(A^A)  to  obtain  |l+AyjJ  =  exp(A*ReAd 
Then,  Re  A.  <  <r  =»  |l+Aj\|  <  exp(<rA) 


Damping  Ratio  Region: 


A.  =  -<,u  .  +  jw,, 

1  ^i  m  di 

l+Ay^  =  exp(A^A)  =  expC-^^u^^A  +  ju^A) 
2  1/2 

Use  u  A  =  2ir  and  w, .  =  w  .  (1-C)  to  obtain 
s  di  ni  u 


(106) 


(107) 


-C.  2n  <j>,.  (j,. 

,  .  di  di 

1+Ay.  =  exp  - - - +  j2tt  - 

L  (1-C  s  s 


which  has  a  magnitude  given  by 


(108) 


.  ,  ~?i2"  "<ii 

1  L  (l-cf)1'2  "s 

and  a  phase  angle  given  by 


=  exp 


~Vi 

(i  -c2y/2 


(109) 


argd+Ay^  = 


(110) 


So  <  >  < .*  |1+At  |  <  exp  - i-~- 

l  l  ( 1-^  )  2 


where  <p  =  arg(l+Ay). 


(Ill) 


Q.E.D. 


Theorem  6:  (Delta  E i gens t rue ture  Assignment  Feedback  Gain  Matrix) 

Suppose  u=F_y  with  F.  e  !Rmxr  is  such  that  the  nominal  closed  loop  system  A_ 
o  d  o 

+B-.F.C  is  asymptotically  stable  with  a  non-defective  modal  matrix.  Let  A. 
o  d  or 

be  the  rxr  diagonal  matrix  whose  entries  are  the  assignable  closed  loop 


eigenvalues  and  let  be  the  nxr  matrix  whose  columns  are  the 


4-ld 


corresponding  achievable  eigenvectors.  If  a  solution  exists  for 


(A.  +B  F.C!  M  =  M  A. 

5  5  5  r  r  Sr 


then 


where 


and 


F.  =  v.z'uL.m  As  -AsM  )V  z_1uTn 
5  5  5  50  r  5r  5  r  r  r  rO 


BS  -  IU60  U51 1 


“r  -  IUr0  Url' 


I  V 
o  o 

0 


Z  V 
r  r 


T-, 


are  the  singular  value  decompositions  of  B_  and  CM  ,  respectively. 

or 

Furthermore,  when  A-»0,  F.-F  where 

O 

F  =  VZ^uLlM  A  -AM  )V  S_1UTn 
BBBOrr  rrrrO 

is  the  feedback  gain  for  the  continuous  time  plant  which  satisifes 

(A  +BFC)  M  =  M  A 
r  r  r 


and  where 


B  =  [UB0  Si1 


z  vT  ^ 
Vb 


(112) 

(113) 

(114) 

(115) 

(116) 

(117) 

(118) 


is  a  singular  value  decomposition  of  B  and  A^  is  a  diagonal  rxr  matrix 

containing  the  assignable  eigenvalues  A^(i=l,2 . r). 

Proof:  The  r  assignable  eigenvalues  and  their  corresponding 

eigenvectors  for  the  sampled  data  system  satisfy  the  equations  described  by 

(A*+B*F*C)V-  =  T,v.;  i=l  ,2 . r  (119) 

OOO  X  1  1 

Combining  the  r  equations  from  Eq . (119)  yields 

(A  +BSF  C)M  =  M  A.  (120) 

555  r  r  5r 

Rearrange  to  obtain 
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Substitute  the  singular  value  decompositions  of  and  CM^  to  obtain 


[uao  V 


T 

z„vi 

r  T 

r  z  v1 

5  5 

F„  (U  U  J 

r  r 

0 

5  1  rO  rl 1 

0 

L 

=  M  A  -A  M 
r  5r  6  r 


U  Z  vIf JJ  .1  VT  =  M  A  -A~M 
50  5  5  5  rO  r  r  r  dr  5  r 


(122) 

(123) 


Taking  the  required  Inverses  and  recalling  that  U^q.  U^,  V^,  and  are 
unitary,  we  obtain 

_  1  T  -IT 

(124) 


F  =  V  Z.  !uJ(M  A-  -A  M r)V Xlurn 
5  55  50  r  5r  5  r  r  r  rO 


Next,  we  consider  the  limiting  behavior  of  as  A-*0.  Middleton  and 

Goodwin  (1990)  show  that  as  A-*0,  A_-»A,  B.-*B,  and  A  -*A  .  Thus, 

5  5  5r  r 

Va'uIo  *  Vb‘ubo’  and  Fa  ’  F- 


Q.E.D. 


Theorem  7:  (Unified  Robustness  Sufficient  Condition) 

Suppose  that  F^  is  such  that  the  nominal  closed  loop  system  described  by 
Eq. (86)  is  asymptotically  stable  with  a  non-defective  modal  matrix.  Then, 
the  uncertain  closed  loop  system  given  by  Eq. (88)  is  asymptotically  stable 
for  dA  and  dB  described  by  Eqs. (78)  and  (79),  respectively  if 


/  n  (v. w. )+  \ 

1  (  Z  ■— t — A  1  <  1 

max^  f(j\)  pcmax  J 


where 


f(y. ) 


-Re(y  ) 

4-  U-(l>Ay.)  +  ] 
A  l 


continuous  time 


(125) 


(126) 


discrete  time 


A  +  B  (FC) 
max  max 


continuous  time 


pcmax  +  g  (F.C)+  discrete  time 

Smax  5max  5 


(127) 


1  <-  (A  +A  )A  +A 

.  |  max  A  A 

ASmax  =  —  [e 


Smax 


1  r  A  (A  +A  )t  __  Ji  +  -1 

=  —  [ \Qe  dT  (B  *  Bmax>  '  Jg  8  *  dT  B  ] 


(128) 


and  where  7.  is  the  i-th  eigenvalue  of  (A  +B  F  C)  with  v.  and  w.  the 

l  p  p  p  ii 

corresponding  right  and  left  eigenvectors,  respectively;  and  where  (•) 

denotes  the  complex  conjugate  transpose. 

Proof:  The  proof  starts  from  showing  the  following  lemma: 


Lemma  1:  (Delta  Uncertainty  Bounds) 


1  r  (A  +A  ) A  + 

(dA y  5  As  =  -f-  fe  ma*  -  eA  4 
5  Smax  A  [ 


(129) 


(dBJ  s  B. 

5  Smax 


■±lt 


(A  +A  )t 

max  ,  +  _  , 

e  dx  (B  +  B  ) 

max 


■  C  eA  T  dT  b*  ] 


(130) 


Proof  of  Lemma  1 : 


(dAs)+  =  ~r  [e 
+  ]  -  [i  + 

-  -r  {dA*A  +  [ 


(A+dA)A  AA 
e  -  e 


1  +  AA  +  A2  ^  +  A3- jj-  * 


dA*  A  +  A*dA  +  dA* A  +  (dA) 


+  (dA)3* A  +  A* (dA ) 2  +  (dA)3 


]*•  +  {[«♦ 

*-]}* 

(dA)2  ]-§?  •  [A 

3]4-~r 


2  3 

I  +  (A+dA)A  +  (A+dA)2-||-  ♦  (A+dA)3-jj- 


A*dA*A  +  dA*A*dA  +  dA*A2  +  A2*dA 


44-36 


—  4 —  -f A  *A  +  [a  *A  +  A  *A+  +  A2  1  — +  A  *A  *A  +  A  *A  *A 

A  ^  max  [  max  max  max  J  2!  max  max  ma: 

3 

+  A  •  ( A+  )2  +  (A+)2-A  +  (A  )2-A  +  A  •  ( A  )2  +  (A  )3  1^-  +  •••! 

max  max  max  max  max  J  3 !  J 

2  3 

=  4-1  [i  +  (A*  +  A  )A  +  (A+  +  A  )2  -4r  +  (A+  +  A  )3  A-  +  ...  1 
A  ^  [  max  max  2!  max  3!  | 

-  [i  +  A+A  ♦  (A+)2  -|y  ♦  (A+)3  -A.  ♦  ...  j  | 

1  f  (A%Amax)A  A+A 
~A~  [  6  "  6 


(dB5) 


■(-Hr. 

-H(“ 


(A+dA)x  ,  T 

e  dr  (B  +  dB)  - 


A  At  d 
e  dr  B 


2  3  4 

IA  +  (A+dA)-|j-  +  (A+dA)2A-  +  (A+dA)3A  + 


(B  +  dB) 


f  TA  .  .  A2  2  A3  3  A4  1  p 

~  ^  A  ~2T  A~3T  +  A_4T+‘*JB} 

=  |  ^  rA  ♦  A—  ♦  A  —  ♦  A  j  dB 

♦  [  dA‘~fr  *  (  A‘dA  +  dA‘A  +  (dA)2]^T  *  (  A*dA*A  +  dA*A'dA  +  dA-A2  +  A2*dA 

♦  (dA)?-A  ♦  A*  (dA)2  ♦  (dA)3jA  ♦  j  (B  +  dB)  | 

s  ~  (  [  IA  ♦  A+—  ♦  (A+)2A-  ♦  (A+)3-A  ♦  •••  1  B 

A  ^  L  2!  3!  4!  J  max 

♦  [  A  *A-  +  A+* A  +  A  •  A+  +  (A  )2]-rr  + 

L  max  2!  max  max  max  J  3! 


A+ • A  • A+  +  A  • A+*A  +  A  •(A+)2  +  (A+)2*A 

max  max  max  max  max 


♦  (A  )2-A+  ♦  A+* ( A  )2  +  (A  )3 
max  max  max 


4- 

(B 


♦  B  ) 
max 


■  +  {[ 


.2  _  3  4 

IA  +  (A  >  A  )4t  *  (A*  ♦  A  r-4-  4-  (A+  4-  A  ) J-=r-  +  • 
max  2!  max  3!  max  4! 


K 
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>  B  )  -  [  IA  +  A+-^  +  (A*)2-^-  ♦  (A*)3-^  +  •••  ]  B+ 

max  [_  2!  3!  4'  J 

.  1  r  p  »ax  dT  .  B  )  -  f4  eA  T  dr  B*1 

4  L  J0  "aX  J0  i 


Q.E.D. 


The  proof  of  theorem  7  now  continues  by  observing  that  the  uncertain  closed 


loop  system  may  be  written  as 


where 


px(t)  =  A  x(t)  +dA  x(t! 
pc  pc 


(131) 


'  A+BFC 

v  ivw 


continuous 


discrete 


dA+dBFC 


continuous 


dA  +dB_FsC  discrete 
o  do 


which  has  a  solution  given  by  Middleton  &  Goodwin) 


x ( t )  =  E(A  , t ) x ( 0 )  ♦  S  E( A  ,t-T-A)dA  x(x)dr 

a/'  w  nr  nr« 


where 


E(  A  ,t)  = 


I +A  A) 
pc 


continuous  time 


discrete  time 


(132) 


(133) 


continuous  time 


discrete  time 


(134) 
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Next,  use  the  real,  positive,  diagonal  transformation 

x ( t )  =  D  1z(t) 


(135) 


and  the  properties  that  (see  Middleton  and  Goodwin  1990) 

E(DA  D_1t)  =  DE(A  ,t)D_1 
pc  pc 


(136) 


E(A  ,t)  =  ME(A  .  t )M  =  Z  v.w.  E(y.,t) 
Pc  P  i=l  1  1  1 


(137) 


to  obtain 


z(t)  =  DZ  v.w.  E ( y  , t ) D  z(0)  +  S  DZ  v.w.  E(y. , t-x-A)dA  (x)D  z(r)dr 
1=1  1  1  1  0  1=1  11  1  pc 

(138) 


where  M  is  a  modal  matrix  of  A  ;  y.  is  the  i-th  eigenvalue  of  A  given  by 

pc  l  pc 


*i  = 


i  continuous  time 

.  A. A 

1,1., 

_  (  0  -  \  } 

A  discrete  time 


(139) 


with  v^  and  wi  the  corresponding  right  and  left  eigenvectors,  respectively; 
Ap  is  a  diagonal  matrix  with  the  y^  on  the  diagonal;  and  (•)  denotes 
complex  conjugate  transpose. 

Note  that  ||z(t)||  0  implies  that  ||x(t)||  -»  0.  Next,  we  apply  the  absolute 
value  operator,  denoted  by  (*)  +  ,  to  both  sides  of  Eq.(138)  where  "+"  and 


's"  are  applied  element  by  element  to  vectors  and  matrices. 

+  rn  *  -ii+rtn  * 

:  (t)s  DZ  v.w.  E(y.,t)D  z(0)  +  S  DZ  v.w.  E(y . , t-x-A)dA 

(  1  =  1  1  1  1  J  L  0  1=!  1  1  1 


(x)D  *z(x)dx 


(140) 


n  j  ^  .  t  n  j  ^  ^ 

sDZ  (v.w. )+  E+(y, ,t)D'z+(0)+  S  DZ  (v.w.)  E+(y. , t-x-A)A  (x)D  z(x)dx 

,,ii  i  X-,11  1  pcmax 

i=l  0  i=l 

(141) 


where 
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with  A_ 

Smax 


A  +  B 
max  max 


(FC) 


A 

pcmax 


and  B 


Smax 


A.  +  B.  (F  C ) + 
v  Smax  Smax  5 

given  by  lemma  1;  and 


continuous  time 

discrete  time 
where 


E(yL,t) 


T  t 

i  continuous  time 

e 

(l+Ay^)*'^  discrete  time 


(142) 


(143) 


Re(y  )*t  ait 

e  i  =  e 

conti nuous  time 

((1+Ayj)  ]t/A  =  e  1 

discrete  time 

and  =  -Re ( A . ) . 

Next,  apply  the  S  operator  to  both  sides  of  Eq.(141)  to  obtain 


S  z+(t)dt  s  DZ  (v.wV  S  E+(y..t)dtD"1z+(0) 
0  i=l  1  1  0  1 


<»  t  n  „ 

+  S  S  (v  w  )  E  (y,,t-r-A)A  (t)D  1z+(r)drdt 

t=0  x=0  i=l  1  1  1  pcmax 


(144) 


(145) 


where 


t 


s 

o 


E  (y^.t)dt 


Re  ( r  )  t 
e  dt 


Re(A . 


)t 

dt 


[  AZ  ((l+Ay.)+]k 
k=0  1 


oo  -a.  kA 
AZ  e  1 
k=0 


_ A _ 

l-expt-o^A) 


continuous 
time 


(146) 


discrete 

time 


Consider  the  double  5  term  in  Eq. (145)  which  may  be  written  as 
r  R  t  n  *  +  + 

S  S  DZ  (v  w  )  E  (y  ,t-r-A)A  (r)D  *z+(r)dxdt 
R-*»  L  t=0  x=0  i=l  1  1  1  pcmax 


(147) 
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In  continuous  time,  Eq.(147)  becomes 

:-t  ) 


r  _R  «t  n  ,  -a.(t-r)  +  _. 

lim  DZ  (v.w.  )  e  1  [A  +  B  (FC)  ]D  z  (x)dxdt 

D  J.nJ  _  .  .  11  max  max 

R-*»  L  J  t=0  x=0  i  =  l  J 


which  satisfies  (see  Yu  et.  al.  1991) 


n  (v.w  )  oo 

<  DZ  — i-i-  [A _ +  B _ (FC)  ]D  z 

•J  n 


a. 


max  max 


( t  )dt 


i=l  i  J0 

In  discrete  time,  let  t=kA  and  x=jA.  Then,  Eq.(147)  becomes 

n 


R  k  „  , 

lim  I  A  Z  A  Z  D  Z  (v.w.)+  [ ( 1+Ay. )r“  J_1  A 

.  _  .....  i  i  l  pcmax 

k=0  j=0  1=1 


lim 

R-*oo 


D  1  z+(j) 


Upon  changing  the  order  of  the  summations  and  letting  q=k-j-l; 
Eq.(150)  becomes 

[_  k  k  n  ,  i 

A  Z  Z  D  Z  (v.w.)+  [ ( 1+Ay, )+]q  A  D  z+(j) 

.  n  .  .  ,.ii  i  pcmax  J 

j=0  q=-j-l  1=1  J 


r  p  k  k  n  , 

s  lim  A  Z  Z  D  Z  (v.w.)  [  (1+Ajr.  JP  A 

n  .  „  ,...11  i  pcmax 

R-»oo  L  j=0  q=0  1=1  K 


D  1  z+(j)  j 


r  _  n  ,  +  R  +  .  k 

s  lim  A  D  Z  (v.w.)+  Z  [(1+Ajr.)  lq  A  D  Z  z+(j) 

R-*oo  L  i  =  l  q=0  j=0  J 


r  o  n  *  .  (  l-[  (l+Ay.  )  +  ]R+1  .  k 

[  A2  D  Z  (v.w.)  f - ^ -  1  A  D'1  Z  z  (j)  1 

L  i=i  1  1  l  i-( l+Ay, )+  J  pcmax  j=0  J 


slim 

R-*oo 


Use  Eq.(139)  to  obtain  ( 1+Ay^ )=exp( A^A) .  Hence,  ( 1+Ay^ )+=exp(-a^A) 

lim  I  1  -  [ ( l+Ay. )+]R+*  j  =  lim  |  1  -  exp [-a. A(R+1 ) ]  j  =  1  because 

R-*oo  L  J  R-*oo  L  J 

Substitute  this  result  into  Eq . ( 1 54 )  to  obtain 


n 


*  + 

(v.w. ) 
l  l 


-1  00 

s  A"  D  Z  - — - -  A  D  Z  z+(j) 

i=i  i-( l+Ar. )+  pcmax  j=o 


(148) 

(149) 

(150) 

q£0, 

(151) 

(152) 

(153) 

(154) 

and 
a.  >0. 

l 

(155) 
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Combine  Eqs.(149)  and  (155)  and  use  Eq.(134)  to  obtain 
oo  t  n  . 

S  S  D  s  (v.w. E+(y. , t-x-A)  A 

_  «  •  .  i  i  l  i 


t=0  t=0  i=l 


pcmax 


D  *  z+(t)  dx  dt 


n 

s  D  I 


(viWi}  A  D-1  S  Z+U)  dt 

.  ,  fTTTT  pcmax  o 

1=1  i 


(156) 


where 


f  -Re(?.)  =  -Re(A. )  =  a. 
1  i  ii 


f(»l)  = 


continuous  time 


+  l  -a. A 

— ~ —  [ l-(  1+A-r.  )  + ]  =  — [1-e  1  ]  discrete  time 

A  l  A 


(157) 


Substitute  Eq.(156)  into  Eq.(145)  and  use  Eq.(146)  to  obtain 


(v  w  1 
1  i  i' 


S  z  (t)  dt  s  D  Z  -7TTT-  D 

q  i-i  f(V 


i  n 

z+(0)  +  D  Z 


(viV 


i=l 


00 

A  D-1  S  z+(t)  dt 
ffrJ  pcmax  ~ 


(158) 


Take  norms  in  Eq,(158)  and  rearrange  to  obtain 


n 

II  D  Z 


•  + 

(v.  w.  )  , 

11  -  D_1  z  (0)  :| 


S  z  (t)  dt 
0 


i-1  f(V 


DE  (ViV  +  A  D'1 

i-i  7XT.T  pcmax 


(159) 


Thus, 


S  z  (t)  dt  ||  <oo  which  implies  S  II z  (t)|  dt  <  oo  if 
0  '  „  .  0 


*  * 

n  (v. w.  )  - 

D  E  ~  ■  , —  A  D 

i=l  pcmax 


<  1 


(160) 


Note  that  in  continuous  time,  x(t)  and  z(t)  are  uniformly  continuous  on  (0,oo) 
because  of  the  linearity  of  the  uncertain  closed  loop  plant  which  together 
with  Eq.(160)  implies  that  iz+(t)||  -*  0  as  t  -*  oo  (see  Hsu  and  Meyer  1968).  This 
implies  that  ||x(t)||  -*  0  as  t  -*  oo  which  proves  that  the  linear  uncertain 
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closed  loop  plant  is  asymptotically  stable.  In  discrete  time,  the  same  result 


follows  without  the  need  for  uniform  continuity. 

Finally,  Perron  weightings  may  be  used  for  the  matrix  D  in  Eq.(160)  in 
the  same  manner  as  shown  by  Sobel  et.  al.  (1989)  for  an  earlier  robustness 
result.  Thus,  to  reduce  conservatism,  Eq.(160)  may  be  replaced  by 


/  n  (v. w.  )  x 

V  (  £  *  — t A  l 

max|^_j  f(^)  pcmax  J 


<  1 


(161) 


where  A  (•)  of  a  non-negative  matrix  denotes  the  real  non-negative 
max  °  6 

eigenvalue  A  £  0  such  that  A  i  |A.  I  for  all  eigenvalues  A.. 

max  max  1  l 1  l 

Q.E.D. 

3. 3  Application  to  EMRAAT  Missile 

Consider  the  Extended  Medium  Range  Air  to  Air  Technology  (EMRAAT) 
bank-to-turn  missile  which  is  described  by  Bossi  and  Langehough  (1988). 

A  seventh  order  model  of  the  yaw/roll  dynamics  at  a  10  degree  angle  of 
attack  is  considered  which  includes  the  rigid  body  dynamics,  two  first 
order  actuator  models,  and  a  yaw  rate  washout  filter.  The  state  vector, 
control  vector,  and  measurement  vector  given  by 
x  =  [S.r.p.Pl.«r.6a.x7!T.  u  =  I«rc.«ac)T.  and  y  .  19.  ruo.  p.  p,  1T. 
respectively.  Here  0  is  the  sideslip  angle  (deg),  r  is  yaw  rate 
(deg/sec),  p  is  roll  rate  (deg/sec),  p^  is  integrated  roll  rate  (deg), 

5  is  rudder  deflection  (deg),  5  is  aileron  deflection  (deg),  x_  is 

r*  3L  f 

the  yaw  rate  washout  filter  state,  and  r  is  washed  out  yaw  rate 

wo 

(deg/sec).  The  state  space  matrices  A,  B,  and  C  are  shown  below: 
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-.5007  -.9945  .1736 

16.83  -.5748  .01233 

A  =  -322.7  .3208  -2.099 

0  0  1 

0  0  0 

0  0  0 

0  5  0 


0  .109  .00691  0 

0  -132.8  27.19  0 

0  -1620.0  -1240.0  0 

0  0  0  0 

0  -179  0  0 

0  0  -179  0 

0  0  0  -5 


‘  1  0  0  0  0  0  O' 

0  1  0  000  -1 
0  0  1  0  0  0  0 

0  0  0  1  0  0  0 

First,  we  design  an  eigenstructure  assignment  control  law  by  using 

an  orthogonal  projection.  The  delta  state  space  matrices  A_  and  B.  are 

o  o 

computed  by  using  the  MATLAB™  Delta  Toolbox.  The  sampling  period  A  is 
chosen  to  be  0.0025  seconds  for  illustrative  purposes.  The  desired 
dutch  roll  and  roll  mode  eigenvalues  are  achieved  exactly  because  four 
measurements  are  available  for  feedback.  The  desired  dutch  roll 
eigenvectors  are  chosen  to  yield  a  complex  mode  which  is  composed  of 
sideslip  angle  and  yaw  rate  with  no  coupling  to  roll  rate  and 
integrated  roll  rate.  The  desired  roll  mode  eigenvectors  are  chosen  to 
yield  a  complex  mode  which  is  composed  of  roll  rate  and  integrated  roll 
rate  with  no  coupling  to  sideslip  angle  and  yaw  rate.  Then,  the 
achievable  eigenvectors  are  computed  by  using  the  orthogonal  projection 
of  the  i-th  desired  eigenvector  v^  onto  the  subspace  which  is  spanned 
by  the  columns  of  (r,I-A.)  *B_.  The  closed  loop  delta  eigenvalues 

lO  O 
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y.\  and  the  feedback  gain  matrix  F.  are  shown  in  Table  2.  The 

l  o 

desired  and  achievable  closed  loop  eigenvectors  are  shown  in  Table  3. 


TABLE  2.  COMPARISON  OF  EMRAAT  DESIGNS  (A=0.0025  sec.) 


Closed  Loop  Eigenvalues  Feedback  Gain  Matrix 


Orthogonal 

Projection 

Design 


7dr=-23.65±jl6.97 

y  .  =-9.98±jl0.  16 
roll 

y  .  =-130.3 
act 

y  .  =  -103.8 
act 


yc...  =  -6.97 

f liter 


Robust 

Design 


7dr=-18.26±jl4.  15 

^roi  i=-61 • 39± j99- 70 

y  ,  =  -112.8 
act 

y  ,  =  -48.61 
act 


7r . , .  =  -7.42 

filter 


'  -5.43  .231  .0043 
4.42  -.285  .0050 


959 


-1.09 


6 

ac 


R 

r 

wo 

P 

PI 

'  -4. 16 

.  196 

.0154 

1.27  ' 

5 

re 

_  2.36 

-.215 

.0757 

2.16 

5 

ac 

Eigenvalues  are  computed  by  using  feedback  gains  which  are  rounded  to 
three  significant  digits. 
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TABLE  3  EIGENVECTORS  FOR  THE  EMRAAT  DESIGNSa 


Desired  Closed  Loop  Eigenvectors: 


aEigenvalues  are  computed  by  using  feedback  gains  which  are  rounded  to 
three  significant  digits.  Actuator  mode  eigenvectors  are  not  shown. 
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In  this  report,  we  propose  a  new  robust  design  which  minimizes 
the  integrated  roll  rate  (which  is  approximately  the  bank  angle) 
subject  to  constraints  on  the  time  constants  of  the  dutch  roll  and  roll 
modes,  the  damping  ratios  of  the  dutch  roll  and  roll  modes,  the  aileron 
and  rudder  deflection  rates,  and  the  new  sufficient  condition  for 
robust  stability.  Mathematically,  the  objective  function  to  be 
minimized  is  given  by 


J  =  £  [ p  j ( kA ) ] ‘ 


(162) 


The  upper  limit  on  the  index  k  is  chosen  to  include  the  time  interval 
kA  e  [0,0.3]  during  which  most  of  the  transient  response  occurs.  Of 
course,  computation  of  Eq. (162)  requires  that  a  linear  simulation  be 
performed  during  each  function  evaluation  of  the  optimization.  The 


constraints  are  shown  below  where  ^  is  the  damping  ratio. 

i  .  .  i  r  "50A  -6A. 

l+Ay,  €  [e  ,  e  ] 
dr  1 

i  .  ,  |  r  -50A  -6A , 

l1+^ronl  *  (e  •  e  1 

|l+Ar  |  <  e~50A 

1  rudder 1 


1+A*  . ,  I  <  e 
aileron* 


-0.80 


l+Ay,  €  exp - 

dr  I  U-(0.8)V/b 


1  I  4ddr  1 

t  expl  j 


(163) 


(164) 


(165) 


(166) 


(167) 


where  <t> dr  =  arg(  1 +Ayd[_ ) 
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a 


-0.80 


exp 


(  yrol  1 

^  d-(o.8)2: 


1/2 


.  exp 


-0.40 


roll 


[  1-  (0. 4 ) 2  ] 1/2 


where  0roU  =  arg( l+A*roll ) 


5  [ (k+1 )A]  -5  (kA)  | 

3.  3L 


<  275  deg/sec 


5  [ (k+1 )A]  -5  (kA) 
r  r 


<  275  deg/sec 


v  168) 


(169) 


(170) 


1  {  l  • 

max^  L 
i  =  l 


*  + 

(v. w. ) 
_ l_l _ 

a. 

l 


[A.  +  B.  (F-C)  ] 

Smax  Smax  5 


<  0.999 


(171 


where  for  illustrative  purposes  we  have  chosen  A  =  0.04*A+  and 

max 

B  =0  with  the  exception  that  the  elements  of  A  which  correspond 
max  max 

to  actuator  or  washout  filter  time  constants  have  been  set  to  zero.  The 

matrices  A.  and  B_  are  computed  from  Eqs.  (82)  and  (83), 
Smax  <5max 

respectively. 

The  actuator  deflection  rates  are  computed  from  the  slopes  of  the  time 

responses  of  the  deflections  during  the  time  interval  kA  e  [0,0.03]. 

This  interval  is  chosen  because  the  slopes  of  the  deflections  are 

largest  during  this  time  interval.  Mathematically, 

|5  [(k+l)A]-6  ( kA ) |  max|S  (mA)|  (172) 

1  a  a  1  1  a  1 

A  mA 

iS  l(k-*-l)A]-6  (kA)|  maxjS  (mA)|  (173) 

1  r _ r  1  1  r _ _ 

A  mA 
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where  m=0,  1 . 12.  The  maximum  deflection  rates  chosen  for  the 

constraints  are  well  within  the  expected  400  deg/sec  limit  for  the 

advanced  state  of  the  art  electromechanical  actuator  described  by 

Langehough  and  Simons  (1988). 

The  parameter  vector  contains  the  quantities  which  may  be  varied  by 

the  optimization.  This  twelve  dimensional  vector  includes  Re  it,  , 

dr 

Im  y  ,  Re  y  ,  Im  y  Re  z.(l),  Re  z.(2),  Im  z,(l),  Im  z.(2), 

dr  roll  roll  1  1  1  1 

Re  z3(l).  Re  z^(2),  Im  z^d),  Im  z^(2) .  Here,  the  two  dimensional 
complex  vectors  z^  contain  the  free  eigenvector  parameters.  That  is, 
the  i-th  eigenvector  v^  may  be  written  as 

v.=  L.z.  (174) 

i  li 

where  the  columns  of  L^  =  (y^I-A^)  *B^  are  a  basis  for  the  subspace  in 
which  the  i-th  eigenvector  must  reside.  Thus,  the  free  parameters  are 
the  vectors  z^  rather  than  the  eigenvectors  v_ 

The  optimization  is  performed  by  using  subroutine  constr  from  the 
MATLAB™  Optimization  Toolbox  (Grace  1990)  and  subroutine  delsim  from 
the  MATLAB™  Delta  Toolbox  on  a  486™  25MHz  personal  computer.  The 
optimization  is  initialized  with  the  orthogonal  projection  design  which 
yields  an  initial  value  of  9.1360  for  the  objective  function  of 
Eq.(162)  and  a  value  of  2.2014  for  the  left  hand  side  (LHS)  of  the 
robustness  condition  of  Eq.(10).  The  optimization  is  complete  after 
3640  function  evaluations  and  yields  an  optimal  objective  function  of 
0.0895  and  a  value  of  0.999  for  the  LHS  of  the  robustness  condition  of 
Eq.  (10).  We  observe  from  Table  1  that  the  dutch  roll  mode  is  dominant 
in  the  robust  design  whereas  the  roll  mode  was  chosen  to  be  dominant  in 
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the  orthogonal  projection  design. 

The  time  histories  of  sideslip  angle,  yaw  rate,  roll  rate, 
integrated  roll  rate,  rudder  deflection,  and  aileron  deflection  to  a 
one  degree  initial  sideslip  are  shown  in  figures  10,  11,  12,  13,  14, 
and  15,  respectively.  We  observe  a  significant  improvement  in  the 
integrated  roll  rate  response  (which  is  desired  to  be  zero)  when 
compared  to  the  initial  orthogonal  projection  eigenstructure  assignment 
design.  The  initial  design  has  a  minimum  Pj(t)  of  -.464  deg  but  the  new 
design  of  this  paper  has  a  minimum  p^tt)  of  -.103  deg  which  is  an 
improvement  of  approximately  78%.  We  note  that  this  improved  response 
is  obtained  with  both  smaller  aileron  and  rudder  deflections. 
Furthermore,  it  is  interesting  that  the  aileron  in  the  initial  design 
exhibits  an  initial  positive  deflection  of  approximately  three  degrees 
before  becoming  negative,  whereas  this  positive  initial  aileron 
deflection  is  only  approximately  one  degree  in  the  rcbust  design. 
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deg/sec  degrees 


» 


Fig.  10.  Sideslip  angle  (delta  model;  /3(0)  =  1  deg) 


time  (sec) 

Fig.  11.  Yaw  rate  (delta  model;  /3(0)  =  1  deg) 
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i 


time  (sec) 

Fig.  12.  Roll  rate  (delta  model;  /3 ( 0 )  =  1  deg) 


Fig.  13.  Integrated  roll  rate  (delta  model;  /3(0)=1  deg) 
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degrees  degrees 


Fig.  14.  Rudder  deflection  (delta  model;  I3( 0 )  =  1  deg) 


Fig.  15.  Aileron  deflection  (delta  model;  /3 ( 0 )  =  1  deg) 
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Next,  we  compute  the  time  responses  due  to  a  "1-cosine"  sideslip 
gust  as  described  in  MIL-F-8785-C.  The  state  equations  for  the  lateral 
dynamics  are  shown  by  McRuer  et.  al.  (1973)  to  be  given  by 


For  the  flight  condition  of  the  EMRAAT  missile  considered  in  this 

/  / 

paper,  the  parameters  and  Lr  are  -0.5748  and  0.3208,  respectively. 

The  inertias  corresponding  to  a  full  fuel  condition  are  I  =  11451, 
Iz  =  456282,  and  I  =  -1189.  Using  these  values,  the  gust  derivative 
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coefficients  are  computed  to  be  (N  )  =  -0.5742  and  (L  )  =  0.05962. 

r  g  r  g 

The  gust  is  defined  as  shown  in  MIL-F-8785-C  and  is  described  by 


(  0 


t<0 


0  =  •  0 . 5 ( l-cos24nt )  0  s  t  s  1/24 

g 

1  t  >  1/24 


(181) 


where  the  natural  frequency  of  the  open  loop  complex  eigenvalue  pair  is 
24.04  rad/sec. 

The  time  histories  of  sideslip  angle,  yaw  rate,  roll  rate, 
integrated  roll  rate,  rudder  deflection,  and  aileron  deflection  to 
the  "1-cosine"  sideslip  gust  are  shown  in  figures  16,  17,  18,  19,  20, 
and  21,  respectively.  We  observe  a  significant  improvement  in  the 
integrated  roll  rate  response  (which  is  desired  to  be  zero)  when 
compared  to  the  initial  orthogonal  projection  eigenstructure  assignment 
design.  The  initial  design  has  a  maximum  p^(t)  of  18.87  deg  but  the  new 
design  of  this  paper  has  a  maximum  pj(t)  of  0.6990  deg  which  is  an 
improvement  of  approximately  96'/..  We  note  that  this  improved  response 


is  obtained  with  both  smaller  aileron  and  rudder  deflections. 


degrees 
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time  (sec) 


Fig.  20.  Rudder  deflection  (delta  model;  0  =  1-cosine) 

o 
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4.  ROBUST  E I GENSTRUCTURE  ASSIGNMENT  FOR  TIME  DELAY  SYSTEMS 


Consider  a  nominal  linear  time  invariant  system  with  time  delay 
described  by 

x(t)  =  Ax(t)  +  A  x(t-x)  +  Bu(t)  1182) 

o 

y ( t )  =  Cx(t)  (183) 

where  x  €  lRn  is  the  state  vector,  u  €  IRm  is  the  input  vector,  y  e  iRr  is 

the  output  vector,  and  A,  B,  C,  and  A  are  constant  matrices. 

o 

Suppose  that  the  nominal  time  delay  system  is  subject  to  linear 
time-varying  parametric  uncertainty  in  the  entries  of  A,  B,  C,  and  Aq 
described  by  AA(t),  AB(t),  AC(t),  and  AAQ(t),  respectively.  We  shall 
assume  that  the  entries  of  AA(t),  AB(t),  AC(t),  and  AAQ(t)  are 
uniformly  continuous  for  t  e  (-oo,ao).  Then,  the  uncertain  time  delay 
system  is  given  by 

x ( t )  =  [ A+AA( t ) I x ( t )  +  [A  +AA  (t)lx(t-r)  +[B+AB(t) ]u(t)  (184) 

o  o 

y(t)=(C+AC(t) ]x(t)  (185) 

Further,  suppose  that  bounds  are  available  on  the  absolute  values  of 
the  maximum  variations  in  the  elements  of  AA(t),  AB(t),  AC(t),  and 
AA  ( t ) .  That  is, 


|Aa.j(t)| 

< 

(a.  .)  ; 

ij  max 

i=l , • • 

■  - ,n;  j=l, . . 

•  ,n 

(186) 

I Ab.  .  ( t )  1 

1  iJ 

< 

(b.  .)  ; 

ij  max 

i=l , - • 

• ,n;  j=l, . . 

.  ,  m 

(187) 

1 AcLJ( t)  | 

< 

(c.  .)  ; 

ij  max 

1=1... 

. , r;  j=l , . . 

-  .  n 

(188) 

AaoiJ(t) 1 

< 

(a  .)  ; 

olj  max 

1=1.  • 

• • ,n;  j=l , . 

.  .  ,n 

(189) 
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Define  AA+(t),  AB+(t),  AC+(t),  and  AA*(t)  as  the  matrices  obtained  by 

replacing  the  entries  of  AA(t),  AB(t),  AC(t),  and  AAQ(t)  by  their 

absolute  values.  Also,  define  A  ,  B  ,  C  ,  and  A  as  the  matrices  with 

m  m  m  mo 


entries  (a.  . ) 

ij  max 


(b.  .) 
ij  max 


(c.  .) 
ij  max 


and  ( a  .  . . 

oij) 


respectively. 


Then, 


{ AA ( t ) :  AA+(t)  £  A  >  (190) 

m 

{ AB ( t ) :  AB+( t )  £  B  >  (191) 

m 

{ AC ( t ) :  AC+ ( t )  £  C  >  (192) 

m 

{AA  (t):  AA+( t )  £  A  }  (193) 

o  o  mo 


where  "£"  is  applied  element  by  element  to  matrices  and  A^  € 

BeIR  ,Ce!R  ,A  elR  where  IR  is  the  set  of  non-negative 
m+m+mo+  + 

numbers. 

Consider  the  constant  gain  output  feedback  control  law  described 
by 

u ( t )  =  Fy( t )  (194) 

Then,  the  nominal  closed  loop  system  is  given  by 

x ( t )  =  ( A+BFC)x( t )  +  A  x(t-x)  (195) 

0 

and  the  uncertain  closed  loop  system  is  given  by 

x ( t )  =  ( A+AA ) x ( t )  +  ( A  +AA  )  x ( t -x )  +  ( B+AB ) F ( C+AC ) x ( t ) 

o  o 

=  ( A+BFC  )x  ( t )  +  ( AA+ABFC+BFAC+ABFAC)x  ( t )  -*■  ( A  +AA  )x(t-x)  (196) 

o  o 

Finally,  the  stability  robustness  problem  can  be  stated  as  follows: 
Given  a  feedback  gain  matrix  F  e  (Rmxr  such  that  the  nominal  closed  loop 
system  exhibits  desirable  dynamic  performance,  determine  if  the 
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uncertain  closed  loop  system  is  asymptotically  stable  for  all  AA(t), 
ABU),  AC ( t ) ,  and  AAQ(t)  described  by  Eqs. ( 190 )-( 193) . 


We  now  present  the  first  robust  stability  theorem  which  extends  the 
result  of  Sobel  et.  al.  (1989)  to  time  delay  systems. 

Theorem  8:  Suppose  that  F  is  such  that  the  nominal  closed  loop  system 
without  time  delay  described  by  x( t )=(A+BFC)x( t )  is  asymptotically 
stable  with  a  non-defective  modal  matrix.  Then,  the  uncertain  closed 
loop  system  with  time  delay  given  by  Eq.(196)  is  asymptotically  stable 
for  all  AA(t),  AB(t),  AC(t)  and  AAQ(t)  described  by  Eqs. ( 190)-( 193)  if 

a  a  ||(M~V[A  +B  (FC)++(BF)+C  +8  F+C  }M+|l+exp(<XT)  ||  (M_1  )  +  (A*+A  )M+|  (197) 

11  mm  m  m  m  "  "  o  mo  " 

where 

a  =  -max  Re [A. (A+BFC) ]  (198) 

l 


and  where  M  is  a  modal  matrix  of  (A+BFC). 

Proof:  x(t)=(A+AA)x(t)+(A  +AA  )x( t-x)+(B+AB)u( t )  (199) 

o  o 

y( t )=(C+AC)x( t )  (200) 

u( t )=Fy( t )  (201) 

x ( t ) = ( A+AA ) x ( t ) + ( A  +AA  )x(t-r)+(B+AB)F(C+AC)x( t ) 

0  0 

=( A+BFC )x( t ) + ( AA+ABFC+BFAC+ABFAC)x( t ) + ( A  +AA  )x(t-r)  (202) 

o  o 

We  begin  with  two  preliminary  lemmas. 


Lemma  2:  ;|M_1  (AA+ABFC+BFAC+ABFAC)M||2  s  |j  (M_1 ) +  { A^+B^ ( FC) +  + (BF ) +Cm+BmF+Cm>+M+ 1 
Proof:  Use  the  result  given  by  Kouvaritakis  and  Latchman  (1985)  that 


*♦ 
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r-  .  .  .nxm 

for  any  matrix  A  €  C 

I|a||2  *  II a+  i] 2 


Thus 

1 M' 1  ( AA+ABFC+BFAC+ABFAC }M||  s  ||  {H_1  C  AA+ABFC+BFAC+ABFAC ) M > +  j| 

|  {M_1  ( AA+ABFC+BFAC+ABFAC )M>+x ||  |  <M_1  ( AA+ABFC+BFAC+ABFAC )M>+x+| 


||  ( M~ 1 ) +  ( AA+ABFC+BFAC+ABFAC )  +M+x+ 1| 

=su 


=  ||  (M_1  )  +  {A  +B  (FC)++(BF)+C  +B  F+C  }V||  (203) 

"  mm  m  m  m  11 

Q.E.D. 

Lemma  3:  i|M_1  ( Aq+AAo )M||  s  fl  (M_1  )  +  (A%Amo)M+||2 
Proof :  ! M'1  (  Aq+*Aq  ) M ||  ^  |  < M~ 1  (  Aq+ AAq  ) M }  *  ||  g 


i|{M"1(Ao+AAo)M>+x||2  ||{M'1{Ao+AAo)M}+x+||2 


II  (M  1 ) +  ( A  +AA  )+M+x+||  |  (M-1  )+(A++A  )M+x+|| 

■'  0  0  11 2  1  0  mo  11 2 


(204) 

Q.E.D. 
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Returning  to  the  proof  of  theorem  8,  we  let  x(t)=Mz(t) 
then  z(t)=H_1x(t) ,  A=M_1 (A+BFC)M 

z(  t  )=exp{At  )z(0)  +  jSxp{A( t-s )}M_1 (AA+ABFC+BFAC+ABFAC )Mz(s)ds 

+  J'texp{A(  t-s )  >M_1  (A  +AA  )Mz(s-r)ds  (205) 

0  0  0 

Since  z(t)  is  causal,  the  second  integral  should  start  from  t, 
z( t )=exp(At )z(0 )  +  f^exp{A( t-s ) >M_1 (AA+ABFC+BFAC+ABFAC )Mz(s )ds 

+  /texp{A( t-s ) >M-1 ( A  +AA  )Mz(s-x)ds  (206) 

T  0  0 

Let  u=s— t,  i.e.  s=u+t,  ds=du,  then 

z( t )=exp(At )z(0)  +  J^exp{A( t-s ) }M_1 ( AA+ABFC+BFAC+ABFAC )Mz ( s )ds 

+  J-1  Texp{A(  t-u-r)  >M  '(A  +AA  )Mz(u)du  (207) 

o  oo 

Take  the  2-norm  on  both  sides  of  the  above  equation  to  obtain 
||  z  ( t )  |  =  ||  exp  ( A  t )  z  ( 0 )  +  J^exp  <  A ( t -s ) >M_1 (  AA+ABFC+BFAC+ABFAC )  Mz  ( s )  ds 


+  J^”Texp{A(t-u-r)  >M  1  (Aq+AAq  )Mz(u)du||  (208) 

s  ||exp(At  )z(0)  |  +  |  J^exp{A(  t-s )  }M_1  ( AA+ABFC+BFAC+ABFAC )Mz  ( s  )ds| 

+  ||  Texp(A(  t-u-T)  }M  *(A  +AA  )Mz(u)du||  (209) 

s  !  exp  (At  )z(0 )  |  +  J^||exp{A(t-s)>M_1  (AA+ABFC+BFAC+ABFAC  )Mz(s)  |  ds 
+J"^  T || exp{A(  t-u-T )  >M  1  (  Aq+AAq  )Mz(u)  ||du  (210) 

s  ||exp(At  )z(0)  |  +  /^||exp{A(  t-s  )>M_1  (AA+ABFC+BFAC+ABFAC  )Mz(s)||ds 
+  X^|exp{A( t-u-x )  }M  1 (Ao+AAq )Mz(u) fdu  (211) 

s  ||exp(At)  |  •  I  z ( 0 )  | 

+  J^||exp{A(  t-s)  >||  •  j| M_1  ( AA+ABFC+BFAC+ABFAC )MI|  •  ||z(s)  ||ds 
+  J^||exp{A(  t-s-T ) }  ||  •  || M  1  (Ao+AAq  )M||  •  ||z(s )  ||ds  (212) 

Using  ||exp(At)  ||sexp(-at) ,  where  a=-max[A^  (A+BFC)  ]  ,  we  obtain 
||  z(  t )  |  s  exp  (-at )  •  ||  z  ( 0 )  || 


+  J^exp{  -at+as )  •  ||M~l  ( AA+ABFC+BFAC+ABFAC  )Ml|  •  ||  z  ( s )  l|ds 


44-63 


(213) 


+  X^expl-at+as+ax}*  |M  1  (  Aq+AAq  )M!|  •  :|  z  ( s  )  ,|ds 
|z(  t )  |jexp(at )  s  j|z(0)!| 

+  (AA+ABFC+BFAC+ABFAC)M||  •  Jz(s)  j|  exp  (as  Ids 

+  X^exp(ax)|jM  1  (Aq+AAq  )Mj|  •  jjz(s)  ||exp(as)ds  (214) 

Using  the  Gronwall  lemma  to  obtain 
j|z(t)||exp(at)s  ||z(0)|| 

•expiXM  |M_1  (AA+ABFC+BFAC+ABFAC)M||+exp(aT)  ||M_1  (Aq-*-AAo  )H|  ]ds>  (215) 
Use  Lemmas  2  and  3  to  obtain 

|1  z ( t )  ]|exp(at  )s  |)z(0)  )j  *exp{  ( ||  (M  1  )+AcmM+j|+exp(onr)  !|  (M  *  )+AcmoM+!M  ^  (216) 

where  A  =  A  +B  (FC)%(BF)+C  +B  F+C 
cm  mm  m  m  m 


and  A  =  A  +A 

cmo  o  mo 


II  s  1  z ( 0 )  |  •  exp  (<  H  (M  1 )  +AcmM+  |j  +exp ( ar )  ||  ( M  1 ) \m(M+[|  >-a)  t 


Thus  1  z ( t )  ||  -»  0  if 


a  2  |  (M  VlA  +B  (FC)++(BF)*C  +B  F+C  ]M+ 1 +exp(ar ) | (M  1)+{A++A  )M+| 

1  mm  m  m  m  11  11  o  mo  1 


Q.E.D. 


We  next  present  the  second  stability  robustness  theorem  which 
extends  the  result  of  theorem  2  (section  2.1)  to  time  delay  systems. 

Theorem  9:  Suppose  that  F  is  such  that  the  nominal  closed  loop  system 

• 

without  time  delay  described  by  x(t )=(A+BFC)x( t )  is  asymptotically 
stable  with  a  non-defective  modal  matrix.  Then,  the  uncertain  closed 
loop  system  with  time  delay  given  by  Eq.(196)  is  asymptotically  stable 
for  all  AA(  t ) ,  ABU).  AC(t)  and  AAq  ( t )  described  by  Eqs.  ( 190)-(  193) 
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if 


max 


(  n  (v.w. )  n  (v.w. )  a.r 

_  1  l  .  r  i  1  i  * 

,L‘,  a.  cmax  .  ,  a.  comax 

i=l  l  1=1  l 


<  1 


where 


(217) 


a  =  -ReU.(A+BFC)]  ,  (218) 

and  where  is  the  i-th  eigenvalue  of  (A+BFC)  with  v.  and  w.  the 
i  ii 

corresponding  right  and  left  eigenvectors,  respectively;  and  where  (•)* 

denotes  the  complex  conjugate  transpose. 

Proof:  The  uncertain  closed  loop  plant  may  be  written  as 

x( t )=(A+AA)x( t )+(A  +AA  )x( t-T )+(B+AB)F(C+AC)x ( t ) 

o  o 

=( A+BFC)x( t ) + (AA+ABFC+BFAC+ABFAC)x( t ) + (A  +AA  )x(t-T) 

o  o 

=A  x(t)+AA  x(t)+AA  x(t-r)  (219) 

c  c  co 

where 


A  =  A  +  BFC  (220) 

c 

AA  =  AA( t )  +  AB ( t ) FC+BFAC ( t ) +AB ( t ) FAC ( t )  (221) 

c 

AA  =  A  +  AA  (t)  (222) 

co  o  o 

which  has  a  solution  given  by 


x(t)  =  exp(Act)x(0) 

+  f  exptA  (t-s)]AA  (s)x(s)ds 
J  c  c 

o 

+  exp[A  (t-s)]AA  (s)x(s-r)ds 
j  c  CO 

0 

Next,  use  the  real,  positive,  diagonal  transformation 


x(t)  =  D  1 z ( t ) 


and  the  property  that 


(223) 


(224) 
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expCDA^D  *t)  =  Dexp(Act)D  1 

and 

n  ^  A.  t 

exp(A  t)  =  Mexp(At)M  =  £  v.w.  e  1 

i  =  l 

to  obtain 

n  A  t 

z ( t ) =D  T  v.w*  e  D  z(0) 

.,xi 


t  n  #  A  (t-s)  _ 

+  D  V  v.w?  e  1  AA  (s)D  z(s)ds 
I  .u,  1  l  c 

J  n,  ;  —  1 


t  n  #  A  (t-s)  _ 

♦  D  V  v.w?  e  AA  (s)D  z(s-x)ds 

J  .  ,  l  i  co 

Jo  1  =  1 


i  225 ) 


(226) 


(227) 


where  M  is  a  modal  matrix  of  Ac;  A^  is  the  i-th  eigenvalue  of  with  v^ 
and  w^  the  corresponding  right  and  left  eigenvectors,  respectively;  A  is 
a  diagonal  matrix  with  the  A^  on  the  diagonal;  and  (•)*  denotes  complex 
conjugate  transpose.  Note  that 

j|z(t)j|  ->  0  implies  that  [| x ( t )  ||  -»  0  (228) 
So  it  is  sufficient  to  prove  that  ||z(t)l|  ->  0. 


Since  z(t)  is  causal,  the  second  integral  in  Eq. (227)  should  start  from 

T. 

n  •  t  -1 
z(t)=D  V  v.w*  e  1  D  z(0) 

i  =  !  1  1 

»t  n  A  (t-s) 

+  D  T  v.w?  e  AA  (s)D  z(s)ds 

.,11  c 

J  o  1  =  1 

.t  n  a  A  (t-s)  _ 

♦DT  v.w*  e  AA  (s)D  z(s-x)ds  (229) 

I  .  ,  i  i  co 

Jx  1  =  1 
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In  the  second  integral  of  Eq. (229)  let  u=s-t,  i.e.  s=u+t,  ds=du.  then 


n  A.t 

z( t )=D  T  v.w*  e  1  D  z(0) 
i  =  l  1  1 


.t  n  A. ( t-s ) 


+  D  5*  v.w.  e  1  A  A  (s)D  z(s)ds 

Jo  i=l  1  1 


t-T  n  A.(t-u-r) 


p  V.  V  u  a  . 

+  D  £  v  w*  e  1 

Jo  i  =  l 


AA  (u+t)D  ^z(u)du 
co 


(230) 


Note  that  u  is  a  dummy  variable  of  integration.  Therefore,  we  may 
substitute  s  for  u  in  Eq.  (230). 


z  (t)  s 


n  A.  t 

DT  v.w*  e  1  D  z(0) 

iii  1  1 


,t  n  A. (t-s) 


p  ^  ^  .  V  V.  o  /  __  - 

D  Y.  v.w*  e  1  AA  (s)D  z(s)ds 
Jo  i  =  l  1  1 


t-x  n  A. ( t-s-r) 


J  “Evi'1 

0  1  =  1 


AA  (s+t)D  z(s)ds 
co 


n  -a.  t 

s  D£  (v.w*)+  e  1  D  z  (0) 
i=l  1  1 


^  A. (t-s)  -I 

D  V  v.w*  e  1  AA  (s)D  z(s) 

i=l  1  1  c  J 


ds 


.t-T  r  n  A.  ( t-S-T ) 

■*  1  AA  (s+t)D  z(s) 

CO 


-t-Tr  n  ) 

'  D.£  Vl  6 

0  L  1  =  1 


ds 


n  •  -a. t 

s  D£  (v.w*)*  e  1  D  z  (0) 
i  =  l  1  1 


ft  n  +  -a. (t-s)  _  + 

1  D  F  (v.w.)  e  1  A  D  z  (s)ds 

J0  i=l  11  cmax 
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.t  n 


+  D  £  (v. w* )+e  1 

I  .  ,  11 

o  1=1 


a. ( t-s-x ) 


cmax  max  max 


comax  o  omax 


.  .  V  V.  x.  !  ^ 

A  D  z  (s)ds 

(231) 

comax 

» 

(232) 

,(FC)  + 

(233) 

(t) 

(234) 

and  where  we  have  used  the  property  that  [exp(A  t)l  =  exp(-a^t). 
Next,  integrate  both  sides  of  Eq.(231)  to  obtain 

-oo  n  .oo  -a.  t  , 

z  (t)dt  s  DJ]  (v.w*)  |  e  1  dt*D  z+(0) 

^o  i=l  1  1  o 

.oo  t  n  ^  -a.(t-s)  , 

+  D  £  (v.w*)+  e  1  A  D  z  (s)dsdt 
11  cmax 


oJ  o  i  =  l 
-oo  ,t  n 


aV  »*  VA.  . 

D  £  (v  w*)  e  1 
o  i=l 


-a. (t-s-x) 


A  D  1z+(s)dsdt 
comax 


.oo  -a .  t 


+  lim 

R-*» 


(0) 

-a. (t-s) 


=  D£  (v.w*)+r  e  1  dt*D  *z+ 
i=l  1  1  J  o 

fH.t  n  — oc . 

D  £  (v.w*)  e  1 

oo  i = 1  11 

-B-t  n  -a. 

D  £  (v.w*)+  e  1 

oo  i=l  1  1 


A  D  1z+(s)dsdt 
cmax 


+  lim 

R-*n 


-a. (t-s-x) 


A  D  *z+(s)dsdt 
comax 


; Oss^t 


(235) 

The  order  of  integration  in  Eq. (235)  may  be  interchanged  because  all  of 

the  functions  inside  the  integrals  are  continuous  functions  of  t  and  s 
Churchill  (1972).  Thus,  Eq. (235)  is  equal  to 


1  im 

R-XJ0 


-R-t  n  -a. 

D  £  (v.w*)+  e  1 
oo  i=l  1  1 


-a. ( t-s ) 


A  D  *z+(s)dsdt 
cmax 
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-R-t  n  m  +  -a.(t-s-r) 

+  lim  D  [  (v.w. )  e  1  A  D  z+(s)dsdt  ;  Ossst 

„  11  comax 

R-XD  J  0  0  1  =  1 


t  .R  n 


-a. (t-s) 


=  lim  D  [  (v.w?)  e  1  AD  z+(s)dtds 

„  J _ _ J  .  „  ii  cmax 

r-xd  s=o  t=o  1  =  1 

-t  .R  n  -a.  { t-s-r ) 

+  lim  D  [  (v.w?)  e  1  AD  z+(s) 

II.  .  .  ii  comax 

R-XD  S=0J  t=0  1  =  1 


A  D  1z+(s)dtds  ;Ossst 
comax 


(236) 

Now  use  the  change  of  variables  given  by  y= t-r,  dy=dt,  y>0.  Then, 
Eq. (236)  is  equal  to 


ft  pR""S  n  ^  ^  u. .  £  . 

lim  D  £  (v  w.  )  e  1  A  D  z+(s)dyds 

„  J  „ _ J  .  .  i  l  cmax 

r-xd  s=o  y=-s  1=1 

_t  r-s  n  -a.y  a.r 

♦  lim  D  J]  (v.w.)  e  1  e  1  A  D  z+l 

R^,[Js=oJy=-s  i=l  1  1  COmax 

-t  .R  n  #  -ay 

-  lim  D  Y.  (v.w  )  e  A  D  z+(s)dyds 

„  _  J  __  J ,  ,  ii  cmax 

r-x»  s=o  y=o  i  =  l 

ft  fR  n  -a.y  a.x 

♦  lim  D  J]  (v.w.)  e  1  e  1  A  D  z+(s 

R*4Js=oJy=o  i=l  1  1  comax 


comax 


D  1z+(s)dyds  ;0:ss£t,y>0 


comax 


■“1  +  .  . 

D  z  (s)dyrds  ;0ss^t,y>0 


R  n 


lim  j  D  Y.  (v.w,  )  e  A  D  dyf  z  (s)ds 

„  J _ „  .  .  l  i  cmax  J 

r-xd  y=o  1=1  J  s=o 


fR  n  #  +  -a . y  a . t  t 

+  lim  D  £  (v  w  )  e  1  e  1  A  D  dy[  z+(s)ds  ;  a.>0 
„  _  J _ „  ,  ,  ii  comax  J  i 

R-*CO  1  =  1  J 


R~kd[j  y=o  i=l 

n  (v.w?)+  -a.R 

*  lim  D  l  — — —  (e  1  -1)A 
„  ;  _  i  ~a.  cmax 

R-xd  1  =  1  1 


,  .00 

O''/ 

*  r\ 


(s)ds 


r  r  * ,  ♦ 

n  (v.w. )  -a.R  a.r 

+  lim  D  T  — — —  (e  1  -1  )e  1 
.  ,  -a. 

R-xd  1  =  1  1 


.  R  a .  T  oo  | 

1  - 1  )e  1  A  D_1|  z+(s)ds|;  a.>0  (237) 

comax  J  i 
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Evaluate  the  limit  of  the  term  outside  the  integral  in  Eq.(237),  note 


that  s  is  now  a  dummy  variable  of  integration,  recall  that  the  a^’ s  are 
positive,  and  substitute  the  result  into  Eq. (235)  to  obtain 

_<»  n  (v.w.  ) 

z  (t)dt  s  D  £  — — —  D  z  (0) 

J  o  i=l  “i 


n  (v.w*)+ 


n  (v.w. )  a.r 


/  U  W.  W.  .  ...... 

d[  l  — - — - — A  ♦  Y.  -  e  1  A 

I  Aw  /y  r*  mav  *-* 

U=1  i 


cmax  .  a. 

1=1  l 


comax 


\  ,  -CO 

)D'  1/ 


( t  )dt 


(238) 


Take  norms  in  Eq. (238)  and  rearrange  to  obtain 


f  z+  ( t  )dt  ||  s 
^  o 


n  (v.w*)+ 

D  r  — Li —  D  z  (0) 

1=1  l 


,  n  (v  w*)+ 

D(  I  — p- 

L=1  i 


,  * «  + 
n  (v  w.  )  a  x 

A  +  y  ^  *  c  * 

cmax  comax 


(239) 


D 


-1 


Thus 


.00 

,  if  z+ 

v  A 


(t)dt||  <  oo  if 


(n  (v.w?)+ 

z-^r- 

1=1  i 


n  (v. w" )+  a. x 
,  r>  11  i  , 

A  -f  y  ^  ^ 

cmax  . a.  comax 

1=1  l 


D 


-1 


<  1 


(240) 


Note  that 


||f  z+(t)dt||  <  oo  implies  that  |  ||z  (t)|dt  <  oo 
0  ^0 


(241) 


Also  note  that  x(t)  and  z(t)  are  uniformly  continuous  on  (0,co)  because 
of  the  linearity  of  the  uncertain  closed  loop  plant  which  together  with 
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Eqs.  ( 240 ) - ( 241 )  implies  that  |jz+(t)||  •»  0  as  t-^0  (Hsu  and  Meyer  (19??). 
This  implies  that  ||x(t)||  0  as  t-»0  which  proves  that  the  linear 
uncertain  closed  loop  plant  is  asymptotically  stable. 

Finally,  Perron  weightings  may  be  used  for  the  matrix  D  in  Eq.  (240) 
in  the  same  manner  as  shown  by  Sobel  et.  al.  (1989)  for  an  earlier 
robustness  result.  Thus,  to  reduce  conservatism,  Eq. (240)  may  be 
replaced  by 


n  (v.w*)+  n  (v.w*)+  a.r 

_  l  l  „  l  l  l  . 

y  _ fa  +  y  _ 0  fa 

max|.^,  a.  cmax  a.  comax 

x=l  l  i=l  l 


V  <  1 


(242) 


where  A  ( • )  of  a  non-negative  matrix  denotes  the  real  non-negative 
max 

eigenvalue  A  £0  such  that  A  -|A. I  for  all  eigenvalues  A.. 

°  max  max  1  l 1  l 


Q.E.D. 


We  remark  that  the  term  exp(ax)  appears  in  Eq.(240).  Therefore,  it 
is  more  difficult  to  satisfy  the  sufficient  condition  as  the  time  delay 
x  becomes  larger.  If  r  ®,  the  norm  in  Eq.  (240)  goes  to  infinity 
because  a  is  positive  for  a  stable  system.  This  implies  that  the 
sufficient  condition  for  robust  stability  cannot  be  satisfied  for  a 
system  with  infinite  time  delay. 
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5.  CONCLUSIONS 


A  new  sufficient  condition  for  the  robust  stability  of  a  linear  time 
invariant  system  subject  to  time  varying  structured  state  space 
uncertainty  has  been  proven.  The  proof  is  based  upon  showing  that  the 
integral  of  the  norm  of  the  uniformly  continuous  system  state  is 
bounded.  A  robust  design  method  was  proposed  for  the  EMRAAT  missile 
which  minimizes  the  excursions  in  roll  rate  due  to  an  initial  sideslip 
subject  to  constraints  on  settling  time,  damping,  control  surface 
deflection  rates,  and  robustness.  The  new  robust  design  compares 
favorably  with  an  earlier  design  which  used  eigenstructure  assignment 
with  an  orthogonal  projection  for  computing  the  achievable 
eigenvectors.  In  particular,  the  maximum  of  the  absolute  value  of  the 
integrated  roll  rate  is  reduced  by  approximately  80’/.  when  compared  to 
the  earlier  design. 

We  have  extended  eigenstructure  assignment  to  linear  time  invariant 
plants  which  are  represented  by  the  so-called  unified  delta  model  which 
is  valid  both  for  continuous  time  and  sampled  data  operation  of  the 
plant.  We  have  shown  that  the  eigenvectors  of  the  delta  model  are 
identical  to  the  eigenvectors  of  the  continuous  time  plant  and  an 
expression  was  derived  for  the  eigenstructure  assignment  feedback  gain 
matrix  for  the  delta  model.  We  have  proposed  a  sufficient  condition 
for  the  robust  stability  of  a  linear  time  invariant  unified  delta  plant 
subject  to  linear  time  invariant  structured  state  space  uncertainty.  A 
robust  sampled  data  design  is  computed  for  the  Extended  Medium  Range 
Air  to  Air  Missile  by  minimizing  the  integral  of  the  roll  rate  with 
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constraints  on  selected  eigenvalues,  actuator  deflection  rates,  and  the 
sufficient  condition  for  robust  stability.  The  robust  design  is 
compared  with  an  orthogonal  projection  eigenstructure  assignment 
design.  In  particular,  the  maximum  integrated  roll  rate  to  a  "1-cosine" 
sideslip  gust  is  reduced  from  18.87  degrees  for  the  orthogonal 
projection  design  to  0.6990  degrees  for  the  robust  design. 

Finally,  we  have  presented  two  new  sufficient  conditions  for  the 
robust  stability  of  a  continuous  time  system  with  time  delay  which  is 
subjected  to  linear  time-varying  structured  state  space 
uncertainty. 
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ABSTRACT 


The  voltage  transients  across  the  MOS-Controlled  Thyristor (MCT) 
are  reduced  to  less  than  50  V  at  270  V  dc  by  proper  selection  of 
filtering  capacitor  and  snubber  circuit.  The  voltage/ current 
characteristics  of  MCT  are  obtained  with  a  shunt  dc  motor.  The 
voltage/current  characteristics  of  MCT  with  a  dynamic  load  are 
different  from  the  static  load.  The  resistive  load  switched  at  a 
different  frequency  in  parallel  with  an  inductive  load  reduces  the 
voltage  transients.  A  digital  simulation  using  PSpice  is  developed 
for  static  and  dynamic  loads.  The  digital  simulation  results 
verify  the  experimental  results. 
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INTRODUCTION 


Power  electronic  devices  can  be  used  for  control  and  efficient 
utilization  of  electrical  energy.  However,  these  devices  may  be 
damaged  by  the  electrical  transients  due  to  switching  of  the 
device  or  the  load.  Unlike  Gate-Turn-off  thyristor  which  can  be 
turned  on  by  applying  a  small  positive  pulse  to  the  gate  but 
requires  about  one  fifth  of  normal  current  to  turn-off, 
MOS -Control led  Thyristor  (MCT)  can  be  turned  on  and  off  with  a 
very  small  amount  of  current  pulse  applied  between  gate  and  anode. 
This  device  combines  the  ease  of  MOS  gate  control  with  high 
voltage  and  high  current  capability  of  SCR  [l].  MOS -Control led 
thyristor  has  the  potential  to  be  used  for  medium  and  high  power 
applications  in  "More  Electric  Aircraft"  at  270  V  dc.  The  voltage 
and  current  characteristics  of  MCT  depend  on  the  nature  of  the 
load,  voltage,  and  the  switching  frequency  [2]. 

Very  large  voltage  transients  appear  with  inductive  load  during 
turn-on  of  MCT  without  a  snubber  circuit.  Voltage  transients  of 
less  than  50  volts  are  observed  with  a  proper  choice  of  polarized 
snubber  circuit  and  a  filtering  capacitor  at  270  V  dc.  The  voltage 
and  current  characteristics  are  also  obtained  with  a  shunt  dc 
motor.  A  comparison  is  made  between  the  digital  simulation  and  the 
experimental  results.  The  voltage  waveforms  for  two  static  loads 
connected  in  parallel  and  switched  at  different  frequencies  are 


also  obtained. 


VOLTAGE  AND  CURRENT  CHARACTERISTICS  OF  MCT  WITH  A  STATIC  LOAD 
Figure  1  shows  the  single  line  diagram  of  the  circuit  used  for 


evaluating  the  voltage/current  characteristics  of  MCT  at  270  V  dc. 
A  three-phase  15kVA,  240  V  (line-to-line)  ac  generator  supplies 
power  to  a  three-phase  diode-bridge  rectifier.  Two  800  *iF 
capacitors  are  connected  at  the  rectifier  output  to  reduce  the 
ripple  in  the  dc  voltage.  A  filtering  capacitor  and  a  polarized 
snubber  circuit  are  connected  at  the  MCT.  A  free-wheeling  diode  is 
connected  across  the  load  to  reduce  the  switching  voltage 
transients  across  the  MCT. 


3 -phase 
ac  generator 

Fig.  1.  Single  line  diagram  of  the  circuit  to  evaluate 

voltage/current  characteristics  of  MCT  at  270  V  dc  with 
a  static  load. 

Figure  2  shows  the  input  and  output  voltage  of  gate  drive  unit. 
The  input  is  a  0  to  5  V  signal  supplied  from  a  function 
generator.  Figure  3  shows  a  voltage  waveform  at  a  switching 
frequency  of  1  kHz  for  a  resistive  load  at  263  V  dc.  Figures  4  and 
5  show  the  expanded  voltage  waveforms  during  turn-on  and  turn-off 


Fig.  2.  Input  and  output  voltage 
Top  Trace: Input  voltage 
5V/cm,  0V=6cm 
Sweep:  200  fis/cm 


of  gate  drive  circuit. 

Bottom  Trace: output  voltage 
lOV/cm,  0V=3cm 


f  # 
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Fig.  3.  Voltage  waveform  across  the  resistive  load  at  a  switching 
frequency  of  1  kHz. 

Vertical:  50V/cm,  0V=lcm  Sweep:  200  ns/cm 
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Fig.  4.  Voltage  waveform  across  the  resistive  load  during 
turn-on  of  MCT. 

Vertical:  50V/cm,  OV=lcm  Sweep:  500  ns/ cm 


Fig.  5.  Voltage  waveform  across  the  resistive  load  during 
turn-off  of  MCT. 

Vertical:  50V/ cm,  0V=lcm  Sweep:  1  ys/cm 
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of  MCT.  Figure  6  shows  a  voltage  waveform  at  a  switching  frequency 
of  1  kHz  for  an  inductive  load  at  263  V  dc.  Figures  7  and  8  show 
the  expanded  voltage  waveform  during  turn-on  and  turn-off  of  MCT 
with  the  inductive  load.  A  small  overshoot  in  voltage  is  observed 
during  turn-on  of  MCT.  However,  no  voltage  transient  is  observed 
during  turn-off  of  MCT. 

Figure  9  shows  the  voltage  and  current  waveforms  in  the 
inductive  load  at  260  V  dc  at  switching  frequency  of  l  kHz.  The 
current  increases  and  decreases  linearly  during  turn-on  and 
turn-off  of  MCT  due  to  the  inductance  in  the  load.  Figure  10  shows 
the  single  line  diagram  of  the  circuit  for  digital  simulation 
using  PSpice.The  supply  voltage  is  modeled  by  an  ideal  source. 
Figure  11  shows  the  voltage  waveform  across  the  inductive  load 
obtained  from  digital  simulation  at  a  switching  frequency  of  1 
kHz.  Figure  12  shows  the  expanded  waveform  during  turn-on  of  MCT. 
Voltage  transients  only  appear  during  turn-on  of  MCT.  The  digital 
simulation  does  not  match  exactly  with  the  experimental  results 
due  to  the  difficulty  of  modeling  the  physical  circuit  elements  in 
the  digital  simulation. 

EFFECT  OF  FILTERING  CAPACITOR  ON  TURN-ON  TRANSIENT  VOLTAGE  OF  MCT 
The  value  of  filtering  capacitor  of  MCT  has  considerable  effect 
on  the  voltage  transients  during  turn-on  of  MCT  with  an  inductive 
load.  Figure  13  shows  the  voltage  at  the  cathode  of  MCT  during 
turn-on  with  C=  100.2  uF  and  0.2  uF.  A  large  transient  voltage 
appears  with  C=0.2  jiF.  Figure  14  shows  the  cathode  voltage  of  MCT 
during  turn-on  for  C=200  uF  and  100  uF.  An  increase  in  voltage 
overshoot  is  observed  with  C=200  uF.  A  C=100.2  uF  appears  the  best 
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Fig.  6.  Voltage  waveform  across  the  inductive  load  at  a  switching 
frequency  of  1  kHz. 

Vertical:  50V/ cm,  0V=lcm  Sweep:  200  us/ cm 


Fig.  7.  Voltage  waveform  across  the  inductive  load  during 
turn-on  of  MCT. 

Vertical:  50V/cm,  0V=lcm  Sweep:  500  ns/ cm 
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Fig.  8.  Voltage  waveform  across  the  inductive  load  during 
turn-off  of  MCT. 

Vertical:  50V/cm,  OV-lcm  Sweep:  1  us/ cm 


Fig.  9. 


Voltage  and  current  waveforms  across  the  inductive  load 

at  a  switching  frequency  of  1.4  KHz* 

Sp  Trace:  Voltage  Bottom  Trace :  Current 

50V/cm,  OV*lcm  3 . 03A/cm,  0A= 3cm 

Sweep: 100  us/cm  4540  Sweep:  100  us/ 


Temperature  27  0 


Z\St 


Fig.  11.  Voltage  waveform  from  digital  simulation  a 
inductive  load  at  a  switching  frequency  of 
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Oate/T,me  run  12/21/91  132701  Temperature  270 
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Fig.  12.  Voltage  waveform  from  digital  simulation  across  the 
inductive  load  during  turn-on  of  MCT. 


Fig.  13.  Voltage  waveforms  across  the  inductive  load 
at  a  switching  frequency  of  1  kHz. 

Top  Trace:  100.2  uF  Bottom  Trace:  0.2  uF 

20V/cm,  0V=lcm  50V/cm,  0V=lcm 

Sweep: 500  ns/ cm  Sweep:  10  us/cm 


Fig.  14. 


Voltage  waveforms  across 
at  a  switching  frequency 
Left  Trace:  200  uF 
20V/ cm,  0V=lcm 
Sweep: 500  ns/cm 


the  inductive  load 
of  1  kHz. 

Right  Trace:  100.2  uF 
20V/ cm,  0V=lcm 
Sweep:  500  ns/cm 
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choice  for  the  filtering  capacitor.  However,  the  digital 
simulation  showed  less  overshoot  in  voltage  with  a  higher  value  of 
filtering  capacitor. 

VOLTAGE  AND  CURRENT  CHARACTERISTICS  OF  MCT  WITH  A  DYNAMIC  LOAD 
Figure  15  shows  the  single  line  diagram  of  the  circuit  for 
evaluating  the  voltage/current  characteristics  of  MCT  with  a 
dynamic  load.  A  7.5  hp(  240  V  shunt  dc  motor  is  supplied  current 
through  a  MCT.  The  MCT  is  turned  on  during  the  starting  of  the  dc 
motor.  Figure  16  shows  the  gate  voltage  of  MCT  and  the  voltage 
across  the  shunt  motor.  A  voltage  of  180  V  is  applied  to  the  dc 
motor  when  the  MCT  is  on.  The  terminal  voltage  of  the  motor  goes 
to  zero  when  the  MCT  is  turned  off.  The  terminal  voltage  of  the 
motor  remains  zero  for  approximately  450  ns  before  increasing  to 
150  V.  The  terminal  voltage  goes  to  180  V  when  the  MCT  is 
turned  on  again.  Figure  17  shows  the  line  current  and  the  motor 
terminal  voltage  during  MCT  turn-on  and  turn-off.  The  line  current 
immediately  goes  to  zero  when  the  MCT  is  turned  off  and  increases 
linearly  due  to  the  armature  inductance  when  the  MCT  is  turned  on. 
Figure  18  shows  the  armature  current  and  the  terminal  voltage  of 
the  motor  during  MCT  turn-on  and  turn-off.  The  armature  current 
flows  through  the  free-wheeling  diode  and  decreases  linearly  when 
the  MCT  is  turned  off.  The  field  current  remains  approximately 
constant  during  MCT  turn-off  period  due  to  the  large  inductance  of 
the  field  winding.  Figure  19  shows  the  single  line  diagram  of 
circuit  for  digital  simulation  using  PSpice.  The  back  emf  of  the 
motor  is  represented  by  a  constant  voltage  source.  Figure  20  shows 
the  terminal  voltage  of  the  motor  from  the  digital  simulation  when 
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Fig.  16.  Gate  voltage  of  MCT  and 
Top  Trace:  Gate  voltage 
50V/ cm,  0V=3cm 
Sweep: 500  fis/cm 


terminal  voltage  across  the  motor. 
Bottom  Trace:  Voltage 
50V/ cm,  0V=lcm 
Sweep:  500  ys/cm 


Fig.  17. 


Line  current  and  terminal 
Top  Trace:  Current 
6.06A/cm,  0A=5cm 
Sweep: 500  us/ cm 


voltage  across  the  motor. 
Bottom  Trace:  Voltage 
50V/ cm,  0V=lcm 
Sweep:  500  u s/cm 
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Fig.  18. 


Armature  current  and 
Top  Trace :  Current 
6.06A/cm,  0A=5cm 
Sweep: 500  u s/cm 


terminal  voltage  across  the  motor. 
Bottom  Trace:  Vo 1  cage 
50V/ cm,  0V=lcm 
Sweep:  500  ns/ cm 


diagram  of  the  circuit  for  digital  simulation 


oz-st- 


the  MCT  is  switched  at  167  Hz.  The  voltage  waveform  from  digital 
simulation  is  similar  to  the  experimental  results. 


PARALLEL  LOAD  SWITCHING  AT  DIFFERENT  FREQUENCIES 
Figure  21  shows  the  voltage  waveforms  across  the  resistive  and 
the  inductive  loads  connected  in  parallel.  The  resistive  load  is 
turned  on  and  off  by  the  first  MCT  at  a  frequency  of  1  kHz  and  the 
inductive  load  is  switched  at  1.5  kHz  by  the  second  MCT.  Figure  22 
shows  the  voltages  across  the  two  load  during  turn-on.  The  turn-on 
time  of  the  MCT  supplying  the  inductive  load  is  greater  than  the 
MCT  supplying  the  resistive  load.  The  addition  of  the  resistive 
load  in  parallel  to  the  inductive  load  reduces  the  turn-on 
transient  voltage  of  the  inductive  load. 
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Fig.  21.  Voltage  waveforms  across 
connected  in  parallel. 

Top  Trace:  Inductive  load 
50V/ cm,  0V=2cm 
Sweep: 2 00  us/cm 


the  resistive  and  inductive  loads 

Bottom  Trace: Resistive  load 
50V/cm,  0V=lcm 
Sweep:  200  us/ cm 


Fig.  22.  Voltage  waveforms  across  the  resistive  and  inductive  loads 
connected  in  parallel  during  turn-on. 

Left  Trace:  Resistive  load  Right  Trace: Inductive  load 
50V/ cm,  0V=lcm  50V/ cm,  0V=lcm 

Sweep: 100  ns/cm  Sweep:  100  ns/cm 
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Non  Newtonian  Effects  of  Powder  Lubricant  Slurries 
In  Hydrostatic  and  Hydrodynamic  Bearings 

by 

Dr.  Don  W.  Dareing 
OBJECTIVE  OF  THE  RESEARCH 

The  objective  of  the  research  was  to  evaluate  the  behavior  of  powder  lubricant  slurries  in 
thick  film  lubrication  applications.  The  powder  lubricant  was  graphite  and  the  carrier  fluid 
was  ethylene  glycol.  The  evaluation  was  based  on  laboratory  testing  and  mathematical 
formulations  accounting  for  non  Newtonian  rheological  properties  of  these  types  of  slurries. 

Laboratory  test  rigs  were  designed  and  fabricated  to  evaluate  the  slurry  in  a  hydrodynamic 
journal  film,  a  hydrostatic  fluid  film  and  a  squeeze  film.  Fluid  film  pressure  measurements 
were  also  taken  for  ethylene  glycol  alone  to  establish  a  baseline  set  of  data  for  the 
evaluation.  Even  though  the  potential  application  of  this  particular  powder  lubricant  slurry 
is  for  high  temperature  environments  the  testing  during  this  pilot  program  was  conducted 
at  room  temperatures. 

One  of  the  goals  was  to  formulate  fluid  film  pressure  equations  which  would  account  for  the 
non  Newtonian  behavior  of  powder  lubricant  slurries.  Mathematical  pressure  formulations 
were  developed  concurrently  with  the  collection  of  laboratory  pressure  data. 


SIGNIFICANCE  OF  THE  RESEARCH 

The  Department  of  Defense,  NASA,  and  the  US  gas  turbine  industry  have  recently 
embarked  on  a  major  new  initiative,  the  Integrated  High  Performance  Turbine  Engine 
Technology  (IHPTET)  program.  The  goal  of  IHPTET  is  to  double  propulsion  capability 
shortly  after  the  turn  of  the  century.  With  the  advent  of  IHPTET  has  come  the  requirement 
for  significantly  advanced  gas  turbine  engine  lubricants  and  lubrication  system  components. 

One  approach  currently  being  pursued  in  trying  to  meet  the  IHPTET  requirements  is  the 
use  of  powder  solid  lubricants  for  the  various  lubrication  system  components.  These 
powders  could  be  supplied  to  the  components  via  a  pressurized  air  stream  or  through  pump 
pressurized  slurries  containing  the  poser  lubricant. 

In  developing  the  powder  lubricant  slurry  approach,  it  is  clear  that  the  fundamental 
lubrication  principles  must  be  fully  understood.  The  proposed  research  will  provide 
fundamental  insight  into  the  rheological  effects  of  powder  slurries  on  hydrostatic,  squeeze 
films  and  hydrodynamic  lubrication  films. 
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The  results  from  the  research  will  provide  a  basis  for  formulating  the  more  complex 
problem  of  elastohydrodynamic  (EHD)  lubrication  of  rolling  contact  bearings  and  gears. 


METHOD  OF  ANALYSIS 

Much  of  the  effort  during  this  one  year  project  was  directed  at  the  design  and  fabrication 
of  laboratory  test  rigs.  These  rigs  were  needed  to  experimentally  determine  the  effects  of 
powder  lubricants  on  the  lubrication  fluid  film  pressure  of  a  carrier  fluid.  The  carrier  fluid 
used  for  this  study  is  ethylene  glycol. 

Two  test  rigs  were  built.  One  test  rig  is  based  on  a  hydrodynamic  journal  bearing.  The 
other  test  rig  is  designed  to  develop  hydrostatic  and  squeeze  films.  These  test  rigs  were 
designed  and  fabricated  during  the  first  half  of  1991.  They  were  debugged  and  instrumented 
during  the  third  quarter  of  the  year.  The  mathematical  analysis,  data  collection  and  report 
writing  took  place  during  the  last  quarter  of  1991. 

Two  graduate  students,  Mr.  Zhenming  Wu  and  Mr.  Anup  Batra,  were  assigned  to  work  on 
this  project  under  the  direction  of  the  principle  investigator,  Dr.  D.  W.  Dareing.  Even 
though  the  funding  of  this  research  by  the  Air  Force  will  formally  end  on  December  31, 
1991,  these  two  students  will  continue  to  collect  and  analyze  data  during  the  spring  semester 
of  1992  and  use  the  results  of  this  research  as  the  basis  for  their  Masters  Thesis. 
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DESCRIPTION  OF  LABORATORY  TEST  EQUIPMENT 


Laboratory  testing  was  an  important  aspect  of  this  one  year  project  because  it  helped  to  give 
insight  into  the  behavior  of  powder  lubricant  slurries  in  thick  film  lubrication  applications. 
Laboratory  testing  also  gave  both  faculty  and  students  an  opportunity  to  observe  lubrication 
film  performance  and  to  expand  experimental  skills  in  this  area  of  research. 

Two  test  rigs  were  designed  and  built  especially  for  this  research  project.  One  rig  was 
designed  to  study  lubricants  in  a  hydrostatic  condition.  This  rig  was  also  used  to  observe 
film  pressures  in  a  squeeze  film  mode.  The  second  rig  was  designed  to  study  lubricants 
under  a  hydrodynamic  condition.  This  rig  is  basically  an  instrumented  laboratory  journal 
bearing. 


These  two  laboratory  test  rigs  are  described  below. 

Hydrostatic  Test  Rig 

The  hydrostatic  test  rig  (shown  in  Figures  A1  and  A2)  contains  two  parallel  surfaces 
separated  by  a  lubricant.  The  lubricant  is  supplied  to  the  surfaces  by  an  external  pump 
through  an  internal  recessed  pocket.  The  pump  has  the  following  specifications: 


Make 

Model 

Motor/Pump  Coupling 
Maximum  Flow  Rate 
Rated  Maximum  Pressure 


-  Micropump  Corporation 

-  415 

-  Magnetic  Drive 

-  630  ml/min 

-  20  psi 


The  active  film  is  dictated  by  the  geometry  of  the  upper  surface  which  has  an  outside 
diameter  of  4  inches  and  an  inside  diameter  of  2  inches.  The  thickness  of  the  film  is 
constant  for  a  given  test  but  this  film  thickness  can  be  changed  by  means  of  a  fine 
adjustment  thread  (48  NF  ).  This  fine  thread  connects  the  upper  bearing  pad  to  the  frame. 

The  two  surfaces  in  this  particular  test  rig  do  not  rotate  so  that  the  temperature  within  the 
fluid  film  does  not  change  appreciably  from  the  bulk  temperature  of  the  lubricant.  The 
main  reason  for  fixing  the  two  surfaces  was  limited  funds  for  the  project.  This  feature  of 
the  test  rig,  however,  allowed  precise  measurements  of  predetermined  settings  in  film 
thicknesses  by  using  fixed  LVDT  displacement  sensors; 


make  -  Trans-Tek,  Inc. 
model  -  DC-DC  Series  240 
Model  02141-0000 
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The  upper  thrust  bearing  pad  is  mounted  within  its  vertically  moveable  support  so  that 
adjustments  can  be  made  to  the  plane  of  the  bearing  surface.  Adjustment  screws  are 
mounted  at  each  of  three  LVDT  locations  to  maintain  parallelism  between  the  upper  and 
lower  bearing  surfaces.  This  feature  was  very  important  in  collecting  consistent  data 
because  parallelism  of  the  surfaces  greatly  affected  volume  flow  rate  between  the  bearing 
surfaces. 

Laboratory  equipment  in  the  Mechanical  Engineering  Department  at  the  University  of 
Florida  does  not  include  a  profilometer.  Therefore  it  was  not  possible  to  measure  surface 
waviness  or  surface  roughness.  However,  pressure  and  flow  rate  measurements  indicate 
some  waviness  in  the  surfaces  as  measured  flow  rates  were  higher  than  calculated  flow  rates 
for  film  thickness  settings  less  than  0.002  inches. 

Fluid  film  pressures  are  measured  at  four  (4)  different  location  across  the  film.  These 
pressure  points  are  located 

a)  within  the  recess  pocket 

b)  1/4  inch  from  the  inside  pocket 

c)  1/2  inch  from  the  inside  pocket 

d)  3/4  inch  from  the  inside  pocket 

The  pressure  at  the  outside  radius  of  the  film  is  ambient.  Fluid  pressures  at  these  points 
are  measured  by  dial  pressure  gages. 

Flow  rates  through  the  bearing  is  measured  by  use  of  a  rotameter  flow  meter.  The  flow 
meter  was  calibrated  for  pure  ethylene  glycol  and  for  the  powder  slurry.  The  calibration 
was  made  by  pumping  the  liquid  into  a  beaker  of  known  volume  and  recording  time. 

Hydrodynamic  Test  Rig 

The  hydrodynamic  test  rig  (shown  in  Figure  A3  and  A4)  contains  a  rotating  journal  and  a 
nonrotating  bearing.  The  outside  diameter  of  the  journal  is  2.0  inches.  The  radial 
clearance,  c,  is  0.003  inches  and  clearance  ratio  c/r  =  0.003.  The  length  of  the  active 
journal  bearing  film  is  three  (3)  inches.  The  nonrotating  bearing  is  mounted  on  a  vertically 
moveable  support.  The  vertical  position  of  the  bearing  relative  to  the  journal  can  be 
adjusted  by  means  of  a  fine  thread  (  48  threads  per  inch  ).  This  feature  allows  the  fluid  film 
geometry  to  be  changed  and  fixed  for  a  given  test.  By  moving  the  bearing  relative  to  the 
journal,  the  minimum  film  thickness  can  be  set  and  fixed  for  a  given  test. 

The  vertical  position  of  the  bearing  relative  to  the  shaft  is  measured  by  use  of  an  LVDT 
displacement  transducer.  This  LVDT  is  used  to  establish  desired  minimum  film  thicknesses 
between  the  journal  and  bearing. 

The  journal  shaft  is  mounted  on  taper  roller  bearings  that  are  fixed  to  the  frame  of  the  rig. 
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The  shaft  is  driven  by  an  electric  motor: 


make 

-  Dayton  Electric  Mfg  Co. 

model 

-  2M168B 

volts 

-  90  volts  DC 

current 

-  5.5  amps 

speed 

-  1725  rpm 

power 

-  0.5  hp 

A  variable  speed  control  allows  the  motor  speed  to  be  varied  continuously  up  to  about  1725 
rpm.  Output  motor  is  stepped  up  by  use  of  a  4  to  1  timing  gear  pair.  Journal  shaft  speed 
can  therefore  go  as  high  as  6900  rpm  with  this  drive  arrangement. 

Lubricant  is  supplied  to  the  test  journal  bearing  at  the  top  side  of  the  bearing  by  a  low 
volume/low  pressure  pump.  The  lubricant  drains  into  the  pump  from  a  funnel  type 
reservoir  located  beneath  the  bearing. 

Small  holes  are  drilled  through  the  bearing  to  provide  access  for  measuring  fluid  film 
pressure  around  the  journal.  A  diagram  showing  these  pressure  ports  is  given  in  Figure  11. 
These  pressure  ports  are  located  in  a  transverse  plane  across  the  center  of  the  bearing.  In 
addition,  one  pressure  port  is  located  off  of  this  center  plane  to  indicate  the  effects  of  end 
leakage  on  fluid  film  pressure. 

A  thermocouple  was  placed  in  the  bearing  to  measure  the  effects  of  fluid  film  friction  on 
local  temperature  in  the  film.  The  temperature  rise  was  significant  and  greatly  affected  the 
viscosity  of  the  lubricants.  This  temperature  rise  is  discussed  in  a  later  section  of  the  report. 

A  special  pressure  transducer  was  made  to  measure  fluid  film  pressures  around  the  journal. 
The  design  of  the  pressure  transducer  was  suggested  by  Professor  A.  Seireg.  The  pressure 
transducer  consisted  of  a  small  cylinder  containing  strain  gauges  cemented  to  its  surface. 
The  small  cylinder  expanded  as  a  thin  walled  cylinder  when  subjected  to  pressure.  Surface 
strains  could  be  calibrated  to  indicate  pressure.  This  pressure  transducer  was  inexpensive, 
easy  to  make  and  gave  accurate  pressure  readings. 
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MATHEMATICAL  FORMULATIONS 


The  mathematical  formations  in  this  section  were  developed  to  predict  the  pressure 
distributions  in  hydrostatic  and  squeeze  films.  Hydrostatic  pressure  equations  were  used  to 
predict  pressures  in  the  hydrostatic  laboratory  test  rig.  A  comparison  of  predicted  and 
measured  data  are  discussed  in  the  next  section.  Squeeze  film  pressure  equations  and  test 
data  are  not  yet  available  for  this  report;  it  will  be  gathered  this  spring  by  Mr.  Wu.  In 
addition,  hydrodynamic  pressure  equations  have  not  been  developed  because  of  time 
constraints.  In  this  case,  Reynold’s  differential  equation  of  lubrication  is  non  linear.  Mr. 
Batra  will  focus  on  the  solution  of  this  equation  during  the  spring  term. 

Hydrostatic  Film  Equations 

The  hydrostatic  pressure  equations  below  are  developed  for  the  thrust  bearing  configuration 
shown  in  Figure  1.  Because  the  different  properties  of  Newtonian  and  non-Newtonian  fluids, 
two  different  mathematical  models  are  applied. 

For  ethylene  glycol,  Newtonian  fluid  model  is  applied.  The  following  equation 
describes  the  pressures  profile  under  beneath  the  working  surface  of  the  bearing. 


P  = 


6  nQ 
n  he3 


(1) 


where 

p:  Pressure  (psi)  at  any  point  along  the  radius  of  the  bearing, 

R:  Radius  (in)  of  the  bearing, 
r:  Any  radius  (in)  between  R0  and  R, 

Q:  Flow  rate  (in3/sec), 
h0:  Film  thickness  (in), 
m:  Viscosity  (reyns). 

This  equation  is  from  Theory  and  Practice  of  Lubrication  For  Engineers,  by  Dudley 
D.  Fuller,  (see  ref.  1) 


For  the  mixture  of  graphite  powder  and  ethylene  glycol,  Power  Law  Model  is  applied. 
The  following  equation  shows  relation  between  flowrate  Q  and  the  related  parameters. 


2b  (4^ 


dp  \  n 
dx 


2n»l 

n 


n 


2n+l 


) 


(2) 
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where 


K:  Power  Law  constant, 
n:  Power  Law  exponent. 


This  equation  is  given  in  "Non-Newtonian  Behavior  of  Powder  Lubricants  Mixed  with 
Ethylene  Glycol",  by  Dareing  and  Dayton,  (ref.  3).  Replacing  b  by  2nr,  dx  by  dr,  in  solving 
for  pressure  gives 


p  =  Q  nK  (■JjSL)-  4> 


(2n-l) 


o  1-n  _  r  1-n 

(  ~  - - ) 

1-n 


Figure  1  -  Hydrostatic  Thrust  Bearing  Geometry 
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PRESENTATION  OF  THE  RESULTS 


Initial  testing  with  the  two  laboratory  test  rigs  was  conducted  with  pure  ethylene  glycol  as 
the  lubricant.  The  data  from  this  initial  testing  provided  a  check  on  equipment  and 
instrumentation  through  the  comparison  of  measured  pressure  data  with  analytical 
prediction  of  pressure  based  on  well  established  equations  of  lubrication  for  Newtonian 
fluids.  The  experimental  data  obtained  by  using  pure  ethylene  glycol  also  provided  <x 
baseline  from  which  to  compare  the  effects  of  adding  powdered  graphite  to  the  carrier  fluid, 
ethylene  glycol. 

Laboratory  data  collected  with  both  hydrostatic  and  hydrodynamic  test  rigs  are  presented 
below.  The  data  includes  measurements  for  pure  ethylene  glycol  and  for  the  graphite  slurry. 


Measurements  of  Hydrostatic  Pressure  and  Flow  Rates 
Discussion  of  measurements  with  pure  ethylene  glycol. 

Initial  testing  with  pure  ethylene  glycol  indicated  the  importance  of  surface  geometry  and 
parallelism  between  the  two  mating  bearing  surfaces.  The  effects  of  these  factors  on 
pressure  and  flow  rate  through  the  bearing  were  magnified  when  the  minimum  film 
thickness  is  of  the  t  er  of  0.001  and  0.002  inches.  Surface  geometry  was  not  measured 
because  of  lack  of  equipment.  However  parallelism  of  the  two  mating  surfaces  was 
controlled  by  three  adjusting  screws. 

To  control  the  parallelism  of  the  mating  surfaces,  three  LVDTs  (linear  variable  differential 
transformer)  were  attached  to  the  circular  edge  of  one  bearing  120  deg  apart.  Adjusting 
screws  were  also  located  at  these  points.  Since  the  planar  position  of  a  surface  is 
determined  by  three  points,  properly  adjusting  these  devices  allowed  the  film  thickness  to 
be  controlled  manually. 

The  readings  of  three  LVDT  will  change  whenever  a  change  of  film  thickness  happens,  if 
this  happens,  by  adjusting  the  adjustment  devices  until  the  readings  of  LVDT  return  to  their 
original  values,  the  film  thickness  is  controlled. 

However,  because  of  the  un-flatness  of  the  surfaces  of  the  bearing  and  the  base  of  the  rig, 
even  though  the  position  of  the  bearing  surface  can  be  controlled  ideally,  the  surface 
waviness  causes  unevenness  of  the  film  thickness.  This  was  a  serious  problem  which  affects 
the  flow-rate  and  pressure  relations. 

With  small  film  thickness,  the  effect  of  surface  unflatness  is  apparent  and  the  relationship 
between  input  pressure  and  flow-rate  deviated  significantly  from  theoretical  predictions.  But 
with  larger  film  thickness  such  as  0.004  or  0.005  inch,  the  influence  of  surface  unflatness  on 
film  pressure  and  flow  rate  is  not  so  strong. 
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A  rotameter  was  used  in  the  test  rig  system  to  measure  flow  through  the  hydrostatic  test 
bearing.  This  flow  meter  was  calibrated  simply  by  measuring  volume  flow  into  a  beaker  and 
recording  time  for  this  flow  to  take  place.  The  calibration  curve  for  pure  ethylene  glycol 
is  given  in  Figure  2.  6  3 


Figure  2  -  Flow  Meter  Calibration  Curve  for  Pure  Ethylene  Glycol 
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Since  the  flow-rate  is  less  than  630  ml/min  and  both  bearing  surfaces  are  nonrotating, 
temperature  increases  due  to  fluid  friction  was  negligible. 


The  purpose  of  the  initial  testing  was  to  experimentally  determine  the  relation  between 
input  pump  pressure  and  flow  rate  between  the  bearing  surfaces.  This  experimental  data 
was  then  compared  with  theory  predictions.  The  predicted  input  pressure  vs  flow-rate 
(Figure  3)  is  linear  according  to  the  equations  given  earlier,  i.e.,  with  the  increase  of  flow- 
rate  Q,  the  inlet  pressure  of  the  bearing  p  is  proportionally  increased. 

The  experimental  data  shown  in  Figure  3  closely  match  the  predicted  data.  There  is 
however  a  little  deviation  at  high  values  of  flow-rate  and  input  pressure.  This  difference  is 
possible  due  to  the  influence  of  surface  unflatness.  The  obtained  data  show  that  a  flow-rate 
of  200  ml/min  requires  a  14  psi  input  pressure  if  the  film  thickness  is  0.004  inches.  But  if 
the  film  thickness  is  increased  to  0.005  inches,  a  flow-rate  of  211  ml/min  requires  only  an 
input  pressure  of  7.5  psi. 


Figure  3  -  Pump  Pressure  verses  Flow  Rate 
(Pure  Ethylene  Glycol) 
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The  predicted  input  pressure  vs  radius  (Figures  4  and  5)  is  a  logarithm  curve.  The 
theoretical  curves  are  based  on  the  equations  given  in  the  mathematical  portion  of  the 
report.  While  the  measured  pressure  in  the  recess  closely  match  the  predicted  input 
pressure,  measured  pressures  elsewhere  are  somewhat  lower  than  predicted.  This  difference 
is  probably  due  to  the  unevenness  of  the  bearing  surfaces. 


Rodius(in) 


Figure  4  -  Hydrostatic  Pressure  Distribution 

*  Pure  Ethylene  Glycol 

*  Film  Thickness  =  0.004  inches 

*  Flow  Rate  =  297  ml /min 
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Figure  5  -  Hydrostatic  Pressure  Distribution 

*  Pure  Ethylene  Glycol 

*  Film  Thickness  =  0.005  inches 

*  Flow  Rate  =  475  ml/min 
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Discussion  of  measurements  with  the  graphite/ethylene  glycol  slurry 

The  slurry  mixture  is  made  of  graphite  powder  and  ethylene  glycol  having  a  weight  ratio  of 
one  part  by  weight  of  graphite  powder  to  eight  part  of  ethylene  glycol.  The  rheological 
properties  of  the  slurry  mixture  are  given  by  Dareing  and  Dayton  (ref.3).  Their  work  shows 
the  non-Newtonian  features  of  the  slurry. 

To  minimize  the  effects  of  settling  on  the  results,  fresh  slurry  mixtures  were  used 
periodically. 

The  rotameter  had  to  be  recalibrated  for  the  (1:8)  slurry  mixture.  The  calibration  procedure 
in  this  case  was  the  same  as  for  the  pure  ethylene  glycol.  The  calibration  curve 
corresponding  to  this  particular  slurry  mixture  is  given  in  Figure  6. 


Flowmeter  Reading 


Figure  6  -  Flow  Meter  Calibration  Curve  for  Powder  Slurry 
(1:8  Graphite/Ethylene  Glycol  Mixture) 


46-15 


Input  Pro»»urel[p»0 


* 


Film  thickness  h0  is  the  most  decisive  parameter  in  the  system  because  it  weighs  nearly  a 
third  power  of  a  given  value  (see  the  mathematical  discussion  section).  Therefore,  the  flow- 
rate  Q  is  very  sensitive  to  any  changes  of  the  film  thickness.  A  little  increase  or  decrease 
of  film  thickness  will  cause  large  increase  or  decrease  of  flow-rates.  This  is  proved  by  test 
data. 

The  relation  between  input  pressure  p  and  flow-rate  Q  is  not  linear  for  the  powder  lubricant 
slurry  (Figure  7).  This  is  one  of  the  main  differences  between  the  performances  of 
Newtonian  and  non-Newtonian  fluids.  If  other  parameters  are  Fixed,  the  input  pressure  is 
proportional  to  the  nth  power  of  flow-rate  Q.  In  this  case,  n  is  less  than  one. 

The  measured  data  (also  in  Figure  7)  have  a  good  match  while  the  flow-rate  is  small,  anu 
have  some  deviation  when  the  flow  rate  becomes  larger  (Q  is  about  140  ml/min  when  h0 
is  0.004  inches;  Q  is  about  240  ml/min  when  h0  is  0.005  inches). 


Figure  7  -  Pump  Pressure  verses  Flow  Rate 

(1:8  Graphite/Ethylene  Glycol  Mixture) 
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There  are  several  reasons  for  these  deviations.  First,  the  changes  of  viscosity  due  to  the 
settling  of  graphite  powder  particles,  makes  the  loads  carrying  ability  of  the  slurry  lower 
than  the  theoretically  calculated.  Second,  the  unevenness  of  film  thickness  due  to  the 
unflafness  of  bearing  surface.  This  is  the  main  reason  of  the  deviation  because  this 
parameter  has  a  power  factor  nearly  to  three. 

The  predicted  pressure  profile  along  the  radius  of  the  bearing  (Figures  9  and  10)  is 
approximately  proportional  to  a  negative  nth  power  of  any  value  of  r  between  the  radian 
of  the  recess  and  the  edge  of  the  bearing,  according  to  Power  Law.  The  power  factor  n  is 
less  than  one.  The  measured  data  (also  in  Figures  9  and  10)  also  show  a  close  match  to  the 
predicted  data.  Though  there  are  little  deviations  from  the  predicted  curve,  the  tendency 
of  pressure  changing  along  with  the  radius  is  clear.  The  unevenness  of  film  thickness 
contributes  the  deviation. 

The  data  clearly  show  that  the  slurry  mixture  has  the  load  carrying  ability  twice  as  much  as 
for  pure  ethylene  glycol.  This  means  that  the  apparent  viscosity  of  the  graphite  slurry 
mixture  is  about  twice  that  of  pure  ethylene  glycol.  If  the  slurry  mixture  is  used  in  a  heavy 
duty  hydrostatic  thrust  bearing  system,  the  system  will  have  a  larger  mechanical  stiffness  as 
a  result  of  the  addition  of  the  graphite. 
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1.2 


1.4 


1.6 


1.8 


Rodius(in) 


Figure  8  -  Hydrostatic  Pressure  Profile 

*  1:8  Graphite/Ethylene  Glycol  Mixture 

*  Film  Thickness  =  0.004  inches 

*  Flow  Rate  =  155  ml/min 
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Input  Pre ssure(pal) 


RocJius(in) 


Figure  9  -  Hydrostatic  Pressure  Distribution 

•  1:8  Graphite/Ethylene  Glycol  Mixture 

•  Film  Thickness  =  0.005  inches 

•  Flow  Rate  =  270  ml/min 
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Measurements  of  Hydrodynamic  Pressure 

The  experimental  test  rig  shown  in  Figures  A3  and  A4  was  used  to  collect  the  data  given 
in  this  section  of  the  report.  The  objective  here  is  to  collect  performance  data  for  the 
graphite/ethylene  glycol  slurry  under  a  hydrodynamic  fluid  film  condition.  Initially,  the 
hydrodynamic  journal  bearing  test  rig  was  used  to  collect  fluid  film  pressure  data  using  only 
pure  ethylene  glycol,  which  is  the  carrier  fluid,  as  the  lubricant.  Figure  10  shows  a 
schematic  of  a  journal  bearing  and  a  typical  pressure  profile  for  a  Newtonian  fluid.  The 
initial  testing  with  strictly  ethylene  glycol  gave  baseline  data  from  which  to  compare  slurry 
pressure  data. 

A  unique  feature  of  the  hydrodynamic  test  rig  is  the  ability  to  control  fluid  film  geometry. 
In  a  typical  journal  bearing  the  fluid  film  geometry  adjusts  to  the  externally  applied  load. 
In  this  case  the  applied  load  and  fluid  pressure  profile  adjusts  to  the  film  geometry.  This 
feature  allows  the  minimum  film  thickness  to  be  varied  in  a  controlled  manner.  With  such 
an  arrangement,  the  position  the  minimum  clearance  (or  minimum  film  thickness)  is  at  the 
bottom  of  the  bearing  and  180°  from  the  lubricant  inlet  position  (i.e.  <?  =  0). 

Since  fluid  film  geometry  was  one  of  the  controlled  variables,  it  was  important  to  assure 
proper  alignment  of  the  journal  within  the  test  bearing.  To  accomplish  this,  the  bearing 
block  was  anchored  to  the  test  rig  housing  to  prevent  the  bearing  from  turning  or  from 
shifting  laterally  due  to  hydrodynamic  fluid  film  pressure.  Red  coloring  dye  was  applied  on 
the  inner  circumference  of  the  bearing  and  the  journal  rotated  by  hand  under  slight  bearing 
contact  conditions  to  check  for  parallelism.  Also,  the  journal  was  set  concentrically  and  run 
within  the  bearing  to  confirmed  proper  alignment;  under  this  condition  fluid  film  pressure 
measurements  were  zero.  One  reason  for  slight  errors  in  pressure  measurements  can  be 
attributed  to  the  inability  to  judge  the  preciseness  of  the  alignment.  The  clearance  between 
the  journal  and  the  bearing  is  adjusted  by  off-centering  the  bearing  with  the  fine  threaded 
mechanism  shown  in  the  photograph.  A  Linear  Variable  Differential  Transformer  (LVDT) 
is  used  to  measure  the  adjustments  made  to  minimum  film  thickness. 

Figure  11,  shows  the  positions  of  the  5  pressure  taps  around  the  test  bearing.  Four  of  the 
taps  (taps  4,5,6,  &  1)  are  30°  apart  around  the  bearing  circumference.  The  fifth  pressure 
tap  (tap  2)  is  located  longitudinally  away  from  the  plane  containing  the  other  four  pressure 
taps  in  order  to  determine  the  effect  of  end  leakage  on  pressure  profiles.  Pressure  from 
these  five  taps  is  transmitted  to  a  pressure  transducer  by  means  of  1/8  inch  copper  tubing. 

The  transducer  is  a  thin  wall  cylinder  containing  four  strain  gauges.  The  strain  gauges  sense 
the  expansion  of  the  cylinder  cause  by  the  internally  applied  fluid  pressure.  The  nominal 
diameter  of  the  cylinder  is  2  inches  and  its  length  is  3  inches.  The  high  sensitivity  of  the 
pressure  transducer  is  obtained  by  having  its  wail  made  up  of  0.05  inch  brass  shim  stock. 

For  the  thin  walled  cylinder, 
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where, 


Hoop  stress  a  =  (d/2t)P 
Internal  pressures  =  2ot/d  =  2Eet/d 


Modulus  of  Elasticity  for  brass,  E  =  15.4  x  106  psi 
Wall  Thickness  t  =  0.05  in  and  cylinder  diameter  d  =  2  in. 

Strains  are  measured  using  foil  type  strain  gages  connected  in  a  4-arm  wheatstone  bridge 
configuration,  its  electrical  network  allowing  for  compensation  for  temperature  effects  as 
well  as  cancellation  of  signals  caused  by  extraneous  loading.  A  Bruel  &  Kjaer  strain  gage 
apparatus  gives  direct  readout  of  the  strains.  The  apparatus  has  a  sensitivity  ranging  from 
a  minimum  of  100^  strain  to  a  maximum  of  30,000/x  strain.  A  standard  compressed  air 
supply  of  known  pressure  was  used  to  calibrate  the  pressure  transducer  and  instrumentation. 

The  maximum  rotating  speed  capability  of  the  input  drive  was  about  6800  rpm.  All  testing 
was  conducted  at  a  speed  of  3100  rpm.  The  journal  speed  was  determined  by  using  a  digital 
tachometer.  One  of  the  aspects  of  the  testing  program  was  to  measure  amount  of 
temperature  increase  in  the  fluid  film  due  to  journal  rotational  speed.  A  thermocouple  with 
a  resolution  to  read  l/10th  of  a  degree  was  installed  at  port  3  to  measure  the  film 
temperature. 
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Figure  10.  Schematic  View  of  a  Hydrodynamic  Journal  Bearing 
Hydrodynamic  Bearing 

T  lubricant  Inlet 

Radius  ol  sHaltal  .O’ 
Bearing  leng*i®3.0‘ 

I/O  Ratio*  1 .5 
Racial  Clearance*!)  003* 

3 

4 


1  6,2  5 

Pressure  Ports 

Figure  11.  Bearing  Showing  the  Position  of  Pressure  Taps 


Pressure  and  Temperature  Measurements  with  Pure  Ethylene  Glycol 

Fluid  film  pressures  were  measured  at  each  of  the  5  ports  mentioned  above  for  various  Film 
thicknesses  during  the  course  of  this  study.  Typical  pressure  data  is  given  in  Table  1  for 
three  different  values  of  minimum  thickness.  A  polar  plot  of  this  data  follows  a  typical 
pressure  profile  as  shown  in  Figure  10.  Highest  pressures  were  obtained  at  pressure  port 
5.  With  the  pressure  taps  located  30  degree  apart,  it  was  difficult  to  estimate  the  precise 
magnitude  and  position  of  the  peak  pressure.  These  pressure  readings  were  taken  after 
fluid  film  temperature  stabilized. 


Table  1 

Measured  Pressures  for  Pure  Ethylene  Glycol 

(psi) 


Port  1 

Port  6 

Port  5 

Port  4 

Port  2 

0.0015 

0 

5 

28 

7 

5 

0.0012 

5 

18 

40 

10 

16 

0.0010 

6 

25 

49 

12 

24 

Table  2  shows  how  the  fluid  film  temperature  increased  as  a  result  of  fluid  friction.  The 
data  indicates  there  was  about  a  15  degree  Fahrenheit  increase  in  temperature  over  a  25 
minute  period  of  operation.  This  temperature  rise  was  large  enough  to  cause  a  substantial 
drop  in  measured  pressure  as  indicated  in  Table  2.  This  steady  drop  in  pressure  is  due  to 
a  reduction  in  the  viscosity  of  ethylene  glycol. 


Table  2 

Rise  in  Lubricant  Film  Temperature  During  Test  Run 
(Pure  Ethylene  Glycol) 


Time 

Pressure 

Temp 

(min) 

(Psi) 

(°F) 

0 

22 

75 

5 

19 

80 

10 

17.5 

83 

15 

16 

86 

20 

15.5 

88 

25 

15.2 

89 
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The  next  step  in  the  study  was  to  compare  these  pressure  measurements  with  lubrication 
film  pressures  predicted  by  classical  analytical  methods. 

Analytical  predictions  of  pressure  corresponding  to  the  above  test  conditions  are  discussed 
below. 

Theoretical  Predictions  of  Pressure  for  Ethylene  Glycol 

D.D.  Fuller  (Ref.  1)  summarizes  the  state-of-the-art  for  predicting  lubrication  film  pressures. 
It  is  common  practice  to  account  for  the  effects  of  end  leakage  through  a  load  reduction 
factor,  n  .  Following  Fuller’s  approach, 


h»in  =  ®  R  (1  -  e) 


(4) 


For  a  film  thickness  of  0.0012  in.,  the  corresponding  eccentricity  ratio  of  0.6,  journal  speed 
of  3100  rpm  and  viscosity  of  ethylene  glycol  as  14  cP, 

Pavo  =  An  z  N  /  132  (1000  m)2 

(5) 

=  49.84  psi 


Factor  A  =  2.12  relates  to  the  vector  representing  in  direction  and  magnitude,  the  resultant 
load  carrying  capacity  of  the  film  pressure.  Table  3  gives  the  theoretically  predicted 
pressures  for  other  film  thickness  based  on  Fuller’s  equations. 


Table  3 

Theoretical  Pressures  Predicted  Using  Fuller’s  Equations 


(inches)  (psi) 

0.0018  0.4  21.84 

0.0015  0.5  32.48 

0.0012  0.6  49.84 


Raimondi  and  Boyd  (Ref.  2)  have  evaluated  performance  characteristics  for  full  journal 
bearings.  Their  characteristic  plots  for 
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=  0.0012  inches 
e  =  0.6 

L/D  ratio  =  1.5 

predict 

S  (Sommerfeld  No)  =  0.0389 

®Pm»x  =  18  degrees 
P  =  1 397  P 

*■  m*x  '  1  *vg 

=  60  psi  (at  PiVg  =  49.84  psi) 

The  data  obtained  for  pure  ethylene  glycol  although  in  good  agreement  are  lower  than  the 
theoretical  predictions  for  the  same  journal  speed  and  same  eccentricity  ratios. 

The  calculations  by  Raimondi  and  Boyd  are  based  on  (a)  full  journal  bearing  having  oil  film 
extending  completely  around  the  bearing  (b)  no  end  leakage  and  (c)  some  viscosity  at  all 
point  in  the  lubricant  film.  These  conditions  do  not  exist  practically  and  pressures  of  a 
practical  bearing  would  be  less  than  that  indicated.  An  attempt  to  make  a  quantitative 
comparison  of  the  analysis  and  experiments  presented  is  difficult  for  the  above  reasons. 

Temperature  has  had  a  very  vital  effect  on  the  pressures  due  the  dependability  of  viscosity 
on  temperature.  The  variation  of  viscosity  with  temperature  for  ethylene  glycol  is  illustrated 
in  Figure  12.  It  has  been  previously  state  that  an  approximate  rise  of  15°  is  observed  during 
a  test  run. 

Pressure  and  Temperature  Measurements  for  the  Graphite/Ethvlene  Glvcol  Slunv 

The  slurry  mixture  comprising  of  one  part  (weight)  of  graphite  to  eight  parts  (weight)  of 
ethylene  glycol  was  mixed  and  tested  in  the  hydrodynamic  journal  bearing  testing  rig. 
Minimum  film  thickness  was  varied  much  the  same  as  for  the  pure  ethylene  glycol  tests. 
Pressure  measurements  for  the  graphite  slurry  are  given  in  Table  4. 

Table  4 

Measured  Pressures  for  Non-Newtonian  Lubricant 
(1:8  (by  weight)  Graphite  and  Ethylene  Glycol  Slurry) 


Pressure  (psi) 


hmin  (in.) 

Port  1 

Port  6 

Port  5 

Port  4 

Port  2 

0.0015 

10 

20 

20 

5 

20 

0.0012 

25 

32 

32 

8 

30 

0.0010 

34 

60 

45 

10 

59 
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Measurements  of  film  temperature  for  the  slurry  testing  are  given  in  Table  5. 


Table  5 

Rise  in  Lubricant  Film  Temperature  During  Test  Run 


(Graphite  Slurry) 


Pressure 

Temp 

(Psi) 

(°F) 

24 

75 

20 

83 

18 

88 

16 

90 

15.5 

93 

15 

95 

Variations  of  film  temperature  for  the  graphite  slurry  under  the  same  conditions  as  used  for 
pure  ethylene  glycol  reveals  that  fluid  friction  is  significantly  greater  when  graphite  is  added 
to  pure  ethylene  glycol.  The  limiting  coefficient  of  friction  for  the  slurry  corresponds  to  the 
coefficient  of  graphite  acting  alone.  Union  Carbide  (ref  4)  gives  the  coefficient  of  friction 
of  graphite  as  0.24. 
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Viscosity  (cP) 


Viscosity  .VS.  Temperature 


Ethylene  Glycol  (C2II602) 


Figure  12  -  Variation  of  Viscosity  with  Temperature  for 
Pure  Ethylene  Glycol 
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CONCLUSIONS 


The  objective  of  this  project  was  to  determine  the  affects  of  adding  a  powder  lubricant  to 
a  carrier  fluid  on  lubrication  film  pressure.  The  effects  are  best  evaluated  in  terms  of  the 
difference  in  fluid  film  pressure  between  the  pure  carrier  fluid  and  the  powder  slurry. 
Because  of  the  short  duration  of  this  project,  only  graphite  was  considered  and  the  carrier 
or  transport  fluid  was  ethylene  glycol. 

Two  types  of  lubrication  films  were  studied.  One  film  type  was  hydrostatic.  The  other  was 
hydrodynamic.  Two  separate  laboratory  test  rigs  were  designed,  built  and  instrumented. 

Test  data  taken  with  the  hydrostatic  test  equipment  showed  that  laboratory  test  data  agreed 
with  theory  predictions  based  on  the  non  Newtonian  rheological  properties  of  the  graphite 
slurry.  This  agreement  gives  a  high  level  of  confidence  in  the  use  of  the  fluid  film  pressure 
equations  derived  and  given  in  this  report.  As  the  data  collected  from  the  hydrostatic  rig 
is  based  on  non  rotating  conditions,  more  work  is  need  to  include  the  effects  of  friction  and 
inertia  forces  in  the  mathematical  and  experimental  studies. 

Test  data  taken  with  the  hydrodynamic  test  equipment  was  taken  under  dynamic  or  rotating 
conditions.  The  addition  of  graphite  to  the  ethylene  glycol  produced  an  increase  in 
hydrodynamic  pressure  above  that  developed  by  the  pure  ethylene  glycol  lubricant.  The 
graphite  increased  the  effective  viscosity  of  the  carrier  fluid  causing  not  only  an  increase  in 
hydrodynamic  pressure  but  also  an  increase  in  fluid  temperature. 

This  work  was  meant  to  be  a  pilot  study  to  the  more  direct  application  of  powder  lubricants 
in  high  temperature  applications.  The  results,  however  have  possible  application  to  low 
temperature  equipment  such  as  squeeze  film  dampers  and  pumping  losses  in  supplying 
slurries  to  high  temperature  areas. 
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NOMENCLATURE 


Minimum  film  thickness,  in. 
c  Radial  clearance,  in. 

e  =e/c  Eccentricity  ratio,  dimensionless 

r  Journal  radius,  in. 

Journal  diameter,  in. 

Axial  length  of  bearing,  in. 

c/r  Clearance  modulus  of  bearing,  dimensionless 
Load,  lb. 

Journal  speed,  rpm. 

Viscosity,  centipoise 
Sommerfeld  number,  dimensionless 
Film  pressure,  psi. 

x  Max.  Pressure  developed  in  the  film,  psi. 

0  Angular  coordinate;  measured  in  direction  of  rotation,  deg. 

0  Position  of  minimum  film  thickness,  deg. 

0p™x  Position  of  maximum  film  pressure,  deg. 
ep0  Position  at  which  film  terminates,  deg. 

n  Side  leakage  factor 
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APPENDIX 

Photographs  of  Experimental  Test  Rigs 
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Figure  Ai  -  HyJ  rixiynum  i  i 
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ABSTRACT 


This  report  contains  a  brief  summary  of  the  work  completed  on  a  Research  Initiation  Grant 
that  was  sponsored  by  the  Air  Force  Office  of  Scientific  Research  (AFOSR)  and  funded  through 
Universal  Energy  Systems,  Inc,  (UES).  The  main  thrust  of  the  Research  Initiation  Grant  was  to 
install  a  burner  in  the  Combustion  Laboratory  at  Brigham  Young  University  that  had  been 
designed  to  "specifically  reproduce  recirculation  patterns  and  LBO  (lean  blow  out)  processes  that 
occur  in  a  real  gas  turbine  combustor"  (Sturgess,  et  al..  1990).  Secondary  objectives  of  the  study 
were  to  conduct  operational  checkout  tests  of  both  burner  configurations,  conduct  limited  lean 
blow  out  experiments  with  the  burner,  and  begin  the  design  and  installation  of  the  laser  based 
diagnostic  capability  that  exists  at  BYU  onto  the  burner. 

The  burner,  which  was  provided  to  BYU  by  the  Aero  Propulsion  and  Power  Laboratory  at 
Wright  Patterson  Air  Force  Base  (APPL.WPAFB),  was  designed  by  researchers  at  Pratt  and 
Whitney  Aircraft,  Inc.  (Sturgess,  et  al.,  1990),  and  was  fabricated  in  the  machine  shops  at  Wright 
Patterson  Air  Force  Base.  There  are  two  configurations  used  in  the  burner.  The  first,  which  is 
referred  to  as  the  Pratt-Whitney  Task  100  burner,  uses  a  central  fuel  tube  surrounded  by  a 
concentric  air  jet.  The  second  configuration,  which  is  referred  to  as  the  Pratt-Whitney  Task  150 
Burner,  has  replaced  the  central  fuel  and  air  tubes  with  an  actual  high  swirl  injector  out  of  a 
commercial  Pratt-Whitney  jet  engine. 

During  this  study,  a  test  facility  (which  includes  a  translating  test  stand,  a  water  quenched 
exhaust  hood,  and  various  air  and  fuel  feed  systems  for  the  combustor)  was  designed,  installed, 
and  successfully  checked  out.  Optical  systems  were  also  designed  to  interface  the  laser  based 
diagnostic  capabilities  that  exist  in  the  Chemical  Engineering  Department  at  BYU  with  the  burner 
test  facility  The  actual  interfacing  of  the  laser  based  instalments  will  be  completed  under  a 
second  AFOSR  Research  Initation  Grant  that  is  being  funded  through  Research  and 
Development  Laboratories,  Inc.  (RDL).  Additional  combustion  experiments  including  CARS 
temperature  measurements  and  laser  sheet  lighting  of  the  combustor  flames  will  be  conducted 
under  the  new  RDL  Research  Initiation  Grant. 

Limited  LBO  data  was  collected  in  this  study  with  both  Task  100  and  Task  150 
configurations.  Preliminary  comparisons  have  been  made  between  results  from  the  Task  100  and 
the  Task  150  configurations  operating  at  BYU  and  with  data  from  the  identical  burner  located  at 

WPAFB.  The  values  of  0Lbo  for  the  BYU  Task  100  test  results  were  in  the  range  of  0.72  -  0.83. 
Similar  values  for  this  same  burner  obtained  at  WPAFB  were  in  the  range  of  0.51-0.55. 
Preliminary  analysis  of  this  data  suggests  that  the  reduced  ambient  pressure  at  BYU  (ca  12.6  psia) 
compared  to  the  ambient  pressure  at  WPAFB  (ca  14  2  psia)  may  be  responsible  for  the 

differences  in  (j)L0o  that  have  been  observed.  Further  investigation  of  this  effect  are  planned 
under  the  new  RDL  Research  Initiation  Grant. 

Values  of  0L3O  for  the  Task  150  configuration  ranged  from  0.45  at  air  flow  rates  of  about 
250  slpm  to  a  peak  of  about  0.6  at  air  flow  rates  of  about  350  slpm.  A  decrease  in  <i)L9o  was 
observed  as  air  flow  was  increased  beyond  the  350  slpm  range  to  a  value  of  about  0  34  at  an  air 
flow  rate  of  about  1000  slpm.  In  general,  the  0LBO  results  for  the  Task  150  burner  measured  at 
BYU  agreed  with  values  of  0LBO  obtained  at  WPAFB  during  the  summer  of  1991  (Hedman,  and 
Warren.  1991).  However,  the  values  of  0LBo  obtained  at  BYU  for  the  high  swirl  fuel  injector  (Task 
150)  were  found  to  be  significantly  different  than  the  results  obtained  at  BYU  for  the  Task  100 
burner 

The  research  on  the  Task  100  and  Task  150  burners  will  be  continued  under  the  support  of 
the  new  RDL  Research  Initiation  Grant.  This  new  study  will  continue  to  investigate  and  study  the 
complex  flow  fields  and  flow  patterns  in  the  Task  150  with  the  intent  of  providing  an  in  depth 
understanding  of  the  complexities  of  the  Task  150  burner  and  obtaining  data  to  verify  current 
mathematical  models  of  the  combustion  process  inside  a  turbojet  engine  combustor 
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INTRODUCTION 


BACKGROUND 

This  report  contains  a  brief  summary  of  the  work  completed  on  a  Research  Initiation  Grant 
that  was  sponsored  by  the  Air  Force  Office  of  Scientific  Research  (AFOSR)  and  funded  through 
Universal  Energy  Systems,  Inc.  (UES).  The  main  thrust  of  the  Research  Initiation  Grant  was  to 
install  a  burner  in  the  Combustion  Laboratory  at  Brigham  Young  University  that  had  been 
designed  to  "specifically  reproduce  recirculation  patterns  and  LBO  (lean  blow  out)  processes  that 
occur  in  a  real  gas  turbine  combustor"  (Sturgess,  et  al.,  1990).  Secondary  objectives  of  the  study 
were  to  conduct  operational  checkout  tests,  conduct  limited  lean  blow  out  experiments,  and 
begin  the  design  and  installation  of  the  laser  based  diagnostic  capability  that  exists  at  BYU  onto 
the  burner.  The  grant  from  UES  was  received  by  the  Department  of  Chemical  Engineering  in 
November  1990.  Prior  to  receiving  the  grant,  the  principal  investigator  had  spent  the  summer  of 
1990  participating  in  the  Summer  Faculty  Research  Program  at  the  Aero  Propulsion  and  Power 
Laboratory,  Wright-Patterson  Air 
Force  Base  (APPL,  WPAFB).  During 
this  time  extensive  measurements 
were  made  in  the  combustion  flame  of 
a  burner  with  a  confined,  coannular  jet 
with  a  sudden  expansion  (Hedman, 

1990).  This  burner,  which  operates 
on  gaseous  propane  fuel,  is  known  as 
the  Pratt  and  Whitney  Task  100 
Combustor. 

A  schematic  of  the  Task  100 
Combustor  is  shown  in  Figure  1 .  The 
Task  100  combustor  consists  of 
coaxial  jets  with  a  29  mm  diameter 
central  fuel  jet  surrounded  by  a 
40  mm  diameter  annular  air  jet.  The 
jets  are  located  in  the  center  of  a  150 
mm  diameter  duct.  A  sudden 
expansion,  rearward  facing  bluff  body, 
with  a  step  height  of  55  mm,  is  located 

at  the  exit  plane  of  the  coaxial  jets.  Figure  1  -  P  &  W  Task  100  Burner  with 

62  %  Orifice  Plate. 
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The  combustor  test  section  incorporates  flat  quartz  windows  to  accommodate  laser  and  other 
optical  access,  but  uses  a  metal  shell  with  metal  corner  fillets  to  reduce  the  vorticity  and  eliminate 
its  effect  on  the  bulk  flow  field  in  the  combustor.  This  box-section  combustor  with  comer  fillets 
allows  reasonable  optical  access,  while  providing  a  cross  section  that  approximates  a  two- 
dimensional  axisymmetric  cross  section.  The  bluff  body  provides  a  recirculation  region  that  can 
stabilize  the  flame. 

Also  being  studied  at  the  Aero  Propulsion  and  Power  Laboratory,  WPAFB  is  a  rectangular 
combustion  chamber  with  four  swirling  fuel  injectors  from  a  commercial  Pratt  &  Whitney  jet  engine. 
This  burner  is  known  as  the  Task  200  combuster.  It  was  designed  to  simulate  a  segment  of  a  real 
jet  engine,  but  with  the  four  swirling  fuel  injectors  very  complex  flow  patterns  and  recirculation 
zones  were  produced  that  were  extremely  difficult  to  measure  and  analyze. 

During  the  summer  faculty  research  program  of  1990  the  idea  of  implementing  a  swirl  fuel 
injector  from  the  Task  200  burner  into  the  Task  100  burner  was  conceived.  This  combination  has 
become  known  as  the  Task  150  configuration  The  advantage  of  the  Pratt  and  Whitney  Task  150 
Combustor  was  that  it  would  allow  the  combustion  characteristics  of  a  jet  engine  fuel  injector  to  be 
investigated  in  a  simpler  geometry  where 
various  diagnostic  measurements  (primarily 
laser  based  optical  measurements)  could  be 
more  easily  made.  A  schematic  drawing  of  the 
Task  150  Combustor  is  shown  in  Figure  2.  A 
drawing  that  shows  the  installation  of  the  high 
swirl  fuel  injector  in  greater  detail  is  presented 
in  Figure  3. 

Only  a  high  swirl  P  &  W  swirl  fuel 
injector  has  been  provided  for  this  UES 
(AFOSR)  study.  The  high  swirl  injector  is 
referred  to  by  representatives  of  Pratt  and 
Whitney  as  a  "bill  of  materials  injector”  used  in 
production  engines.  A  low  swirl  injector  was 
used  at  WPAFB  during  the  1991  summer 
faculty  research  program,  but  this  injector  was 
not  available  to  this  program.  The  Task  150 

burner  swirls  the  entering  fuel  and  air  flows  as  Figure  2  -  P  &  W  Task  150  Burner 
is  commonly  done  in  a  real  turbojet  engine  With  High  Swirl  Fuel  Injector. 
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combustor  and  could  eventually  allow  the  investigation  of  liquid  jet  engines  fuels.  The  work  in  the 
Task  150  combustor  provides  a  bridge  between  the  work  in  the  Task  100  Combustor  and  the 
Task  200  combustor. 

The  main  thrust  of  this  Research  Initiation  Grant  has  been  to  incorporate  an  actual  swirling 
fuel  injector  like  that  used  in  the  Pratt  and  Whitney  Task  200  Combustor  into  the  Pratt  and 
Whitney  Task  100  Combustor  hardware. and  examine  the  feasibility  of  making  meaningful 
combustion  diagnostic  measurements  in  more  complex  swirling  flows. 

The  burner,  which  was  provided  to  BYU  by  the  Aero  Propulsion  and  Power  Laboratory  at 
Wright  Patterson  Air  Force  Base  (APPL.WPAFB),  was  designed  by  researchers  at  Pratt  and 
Whitney  Aircraft,  Inc.  (Sturgess,  et  al„  1990),  and  was  fabricated  in  the  machine  shops  at  Wright 
Patterson  Air  Force  Base.  In  this  study,  the  Aero  Propulsion  and  Power  Laboratory  at  Wright- 


Patterson  Air  Force  Base  provided  BYU  the  Pratt  and  Whitney  Task  100  combustor  hardware. 
Pratt  and  Whitney  Aircraft,  East  Hartford.  Connecticut  provided  a  high  swirl  injector  from  an  actual 
Pratt  and  Whitney  turbojet  engine.  The  Aero  Propulsion  and  Power  Laboratory  began  the 
fabrication  of  the  Task  150  combustor  the  fall  of  1990  and  the  combustor  was  delivered  to  BYU 
the  end  of  March  1991  The  schedule  of  the  Research  Initiation  Grant  was  based  on  a  delivery  of 
hardware  in  November  1990.  Also,  the  high  swirl  fuel  injector,  the  aluminum  dome,  and  the 
aluminum  diffuser  section  were  taken  back  to  WPAFB  dunng  a  related  1991  summer  faculty 
research  program  (Hedman  and  Warren.  1991).  The  March  arrival  of  the  Task  150  hardware  and 
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the  use  of  some  of  the  hardware  at  WPAFB  during  the  summer  of  1991  delayed  the  installation  of 
the  research  combustor  until  the  end  of  October  1991. 

OBJECTIVE 

The  objectives  of  the  Research  Initiation  Program  were  limited  in  scope  and 
restricted  to  1)  the  design,  fabrication,  and  installation  of  a  test  facility  for  the  P  &  W  research 
combustor  in  the  existing  Combustion  and  Reactions  Laboratory  Building  (B-41)  at  Brigham 
Young  University,  2)  the  performance  of  initial  checkout  tests  and  a  limited  amount  of  lean  blow 
out  limit  tests  so  a  comparison  between  BYU  data  to  previous  work  done  at  WPAFB  can  be 
performed  and.  3)  the  initial  interfacing  the  burner  with  the  existing  laser  based  diagnostics 
available  at  BYU. 

APPROACH 

This  research  program  was  based  on  the  Aero  Propulsion  and  Power  Laboratory,  Wright- 
Patterson  AFB  providing  the  burner  hardware,  and  Pratt  and  Whitney  Aircraft  providing  the  Task 
150  high  swirl  injector.  The  translating  test  stand,  water  quenched  exhaust  system,  and  air  and 
fuel  feed  systems  were  to  be  designed,  fabricated  and  installed  by  researchers  at  Brigham  Young 
University.  The  funding  of  the  study  was  rather  limited.  Consequently,  facility  components 
(particularly  air  flow  system  components)  that  were  available  from  other  facilities  that  have  since 
been  disassembled  were  used.  Additionally,  limited  checkout  and  lean  blow  out  combustion 
experiments  were  to  be  performed.  The  design  of  the  burner  facility  was  governed  by  the 
following  constraints:  1)  high  temperature  exhaust,  2)  rigidity  of  a  movable  test  stand, 
3)  potential  for  computer  controlled  translating  test  stand  and  data  collection,  and  4) 
limited  funds. 

The  water  quenched  hood  was  designed  to  permit  air  and  propane  flow  rates  of  up  to 
4000  and  104  slpm  (70  F)  respectively.  The  peak  flow  rate  of  propane  gives  a  heat 
release  of  about  544,000  Btu/hr.  The  adiabatic  flame  temperatures  near  the  lean 
flammability  limit  of  propane  and  air  and  near  stoichiometric  fuel/air  ratio  is  about  is 
about  1508  °K  (2255  °F)  and  2275  °K  (3400  °F)  respectively  (Gordon  and  McBride, 
1976).  Consequently,  the  water  quenched  exhaust  hood  was  over  designed  to 
accommodate  a  heat  release  of  one  million  Btu/hr  and  reduce  the  exhaust  products  to  a 
temperature  of  about  a  100  °C  (212  °F)  or  below. 
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Since  the  combustion  and  flame  characteristics  in  the  burner  were  to  be  analyzed 
by  laser  techniques,  the  laser  diagnostic  volume  needed  to  be  translated  in  three 
dimensions  inside  the  burner.  The  approach  was  to  design  a  translating  test  stand  that 
would  move  in  two  dimensions,  with  motion  in  the  third  dimension  being  provided  by 
motion  of  the  optical  components  on  the  laser  optical  tables.  The  translational  test  stand 
needed  to  be  very  sturdy  with  a  minimum  deflection.  Also,  the  test  stand  needed  to  be 
movable.  Due  to  physical  space  constraints  of  the  building  that  contains  the  burner 
facility,  it  is  necessary  to  move  the  burner  out  of  the  road  so  that  the  laser  beams  can  be 
directed  to  other  reactors  located  in  the  Combustion  Laboratory.  It  is  anticipated  that  the 
translating  stand  and  moveable  laser  system  components  will  be  eventually  integrated 
into  a  computer  automated  control  system.  Therefore  provisions  were  made  in  the 
designs  so  as  to  facilitate  the  conversion  to  computer  control. 

The  budget  for  the  research  project  was  reasonable  yet,  limited.  Therefore, 
simplistic  designs  and  fabrication  of  hardware  components  were  needed.  All  of  these 
constraints  were  considered  during  the  design  process  of  the  burner  facility. 

TECHNICAL  DISCUSSION 


TEST  FACILITY 

The  burner  test  facility  located  in  building  the  Combustion  and  Reactions 
Laboratory  Building  on  the  Brigham  Young  University  campus,  consists  of  four  main 
systems:  1)  a  2  dimensional  translating  test  stand,  2)  a  water  cooled  exhaust  system,  3) 
the  utility  systems,  and  4)  the  laser  diagnostic  system.  The  translating  test  stand  allows 
the  burner  to  be  positioned  vertically  and  horizontally  such  that  different  laser 
diagnostic  measurements  may  be  performed.  The  exhaust  hood  cools  the  exhaust  from  the 
combustor  and  discharges  it  outside  of  B-41.  The  utility  system  consists  of  three  sub¬ 
systems,  the  water  system,  air  system,  and  the  fuel  system.  Figure  4,  an  over  view  of 
the  burner  test  facility,  shows  a  front  view  of  the  water  cooled  exhaust  system,  and  a 
side  view  of  the  translating  test  stand. 

2-D  Translating  Test  Stand.  The  AFOSR  burner  is  mounted  on  the  2-D  translating  test 
stand  as  shown  in  Figure  4.  The  test  stand  is  used  to  interface  the  burner  to  the  existing  laser 
instruments  available  at  BYU,  and  provides  the  translation  capability  to  align  the  laser  diagnostic 
volume  inside  the  burner.  The  test  stand  has  three  main  components,  the  translator  frame, 
the  x-axis  translator,  and  the  z-axis  translator.  The  translator  frame  is  constructed  of 
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Figure  4  •  Overview  of  the  BYU  (Pratt  and  Whitney 
Task  100  or  Task  150)  Test  Facility 


3  in.  (76  mm)  square  steel  tubing  and  a  i/2  in.  (13  mm)  mounting  plate.  The  frame  is 
mounted  on  four  casters  that  allow  it  to  be  rolled  about  easily.  The  stand  can  be  secured 
by  positioning  four  leveling  screws.  The  x-axis  translator  is  mounted  inside  the  z-axis 
carriage  assembly.  The  x-axis  translators  are  four  hardened  steel  rods  that  slide 
through  four  linear  bearings.  The  combustor  is  mounted  in  the  burner  mounting  frame 
located  at  one  end  of  the  x-axis.  The  burner  can  be  moved  manually  along  the  x-axis. 
The  z-axis  translator  is  made  up  of  a  carriage  assembly  that  rides  up  or  down  along  two 
square  steel  post.  The  z-axis  is  driven  be  a  ball  screw  configuration  attached  to  an 
electric  motor.  The  translating  test  stand  allows  the  diagnostic  window  of  the  burner  to 
be  positioned  a  maximum  of  6"  (150  mm)  along  the  x-axis  and  a  maximum  of  20"  (508 
mm)  up  and  down  the  z-axis.  The  third  axis  of  movement  is  obtained  optically  through 
the  lasers  optical  components  and  is  described  later  in  the  Laser  Diagnostic  Design 
section. 


Water  Cooled  Exhaust  System.  The  exhaust  system  was  designed  to  handle  a  heat 
release  of  up  to  one  million  Btu/hr.  The  system  consists  of  a  water  cooled  hood,  sheet 
metal  ducting,  and  an  exhaust  fan  as  depicted  in  Figure  4.  The  water  cooled  hood  is 
comprised  of  two  concentric  stainless  steel  cylindrical  shells  welded  at  both  ends  in 
which  water  is  allowed  to  enter  and  exit.  The  water  enters  through  three  inlet  pipes. 
The  pipes  duct  the  inlet  water  to  the  bottom  of  the  cylindrical  shell.  The  water  the  flows 
upward  through  the  shell  where  it  exits  out  of  three  pipes  that  are  manifolded  together 
and  directed  to  a  drain  pipe.  The  stainless  steel  hood  is  suspended  from  the  second  floor 
I-beams.  The  top  of  the  burner  can  extend  8”  (200  mm)  up  into  the  hood.  This  allows 
for  radiation  adsorption  when  the  burner  is  operated  in  a  rich  high  flow  fuel  condition. 

Suspended  and  stacked  inside  the  stainless  steel  hood  is  a  set  of  four  compact  heat 
exchangers.  These  heat  exchangers  have  been  designed  to  cool  the  exhaust  to  a 
temperature  with  in  the  range  of  180  -  200  °F  (82  -  94  °C).  The  cooled  exhaust  is 
channeled  through  a  set  of  sheet  metal  ducting,  as  shown  Figure  4,  and  discharged  outside 
of  the  building  by  a  heat  resistant  fan.  The  heat  resistant  fan  has  a  three  speed  variable 
control  and  pulls  a  maximum  of  5450  CFM.  Louvers  were  installed  in  the  wall  of  the 
building  at  the  fan  exit  to  prevent  backdraft  into  the  building  when  the  hood  was  not  in 
use. 


utility  Systems.  The  utility  systems  designed  for  the  BYU  burner  test  facility 
consists  of  three  flow  systems:  the  cooling  water  system,  the  combustion  air  system,  and 
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the  propane  fuel  system.  Each  of  the  three  systems  have  controls  and/or 
instrumentation  devices  (i.e.  pressure  and  temperature  read  outs)  located  on  a  main 
control  panel. 

Water  System.  This  system  supplies  cooling  water  to  the  exhaust  hood  and 
radiators.  The  cooling  water  is  brought  in  from  the  building's  main  water  supply  and 
regulated  to  a  water  pressure  of  5  psig.  A  thermocouple  has  been  attached  to  and 
centered  in  the  exhaust  exit  duct  of  the  exhaust  system  hood  to  monitor  the  temperature 
of  the  exhaust  gases.  The  flow  rate  cooling  water  can  be  regulated  to  insure  that  the 
temperature  of  the  exhaust  gases  remains  less  than  about  200  °F  (ca  90  °C).  The  flow 
rate  of  the  cooling  water  is  regulated  by  either  adjusting  the  on/off  valve  or  setting  the 
pressure  regulator  to  a  higher  or  lower  pressure. 

Air  System.  The  air  flow  system  was  designed  to  control  the  upstream  pressure 
to  a  choked  flow  nozzle.  This  provides  a  very  simple  control  system  since  the  flow 
through  a  choked  flow  nozzle  is  directly  proportional  to  upstream  pressure,  In  order  to 
provide  the  flow  ranges  desired,  250  -  4000  slpm  (70  °F),  three  separate  choked  flow 
nozzles  were  designed  and  fabricated.  The  throat  diameters  of  0.1485  in.,  0.214  in.,  and 
0.296  in.,  give  flow  rate  ranges  of  250-1000,  500-2000,  and  1000-4000  slpm  (70 
F)  over  the  operating  pressure  range  available  with  the  BYU  air  system. 

The  air  flow  control  system  can  be  operated  as  either  a  closed  loop  feedback 
control  or  an  open  loop  feed  control.  A  schematic  of  the  closed  feedback  control  loop  is 
shown  in  Figure  5.  The  closed  feedback  control  loop  consists  of  a  Val-Tek™  pneumatic 
control  valve,  a  pressure  gauge,  a  thermocouple,  the  set  of  three  different  choked  flow 
control  nozzles  (one  for  each  flow  range),  a  pressure  transducer,  a  signal  amplifier,  and 
the  electronic  feedback  control  unit. 

Dry  air  is  supplied  to  the  burner  facility  from  low  pressure  tanks  located  outside 
of  the  Combustion  Laboratory  building.  The  Val-Tek™  control  valve  regulates  the 
pressure  of  the  incoming  air  to  the  choked  flow  nozzle.  The  air  flow  rate  is  regulated  in 
the  closed  feedback  control  loop  by  adjusting  the  gain  control  on  the  feedback  control 
unit.  This  setting  establishes  the  desired  up  stream  pressure  on  the  choked  flow  control 
nozzle.  The  actual  upstream  pressure  on  the  control  nozzle  is  monitored  by  the  pressure 
gauge.  A  pressure  transducer  is  used  to  provide  the  feedback  pressure  signal  to  the 
control 
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Figure  5  *  Schematic  of  Test  Facility  Air  Delivery  System. 

unit,  which  is  amplified  and  compared  to  the  gain  control  setting.  The  Val-Tek™ 
pneumatic  control  valve  opens  or  closed  accordingly.  The  closed  loop  feedback  control 
system  will  hold  a  nearly  constant  pressure  over  an  extended  period  of  time.  However, 
slight  pressure  fluctuations  (ca  ±  1  psig)  were  caused  by  minor  oscillations  in  the 
control  system.  The  stability  of  this  control  system  is  very  similar  to  that  experienced 
in  the  test  facility  used  in  Building  450  at  WPAFB. 

In  the  open  loop  feedback  control  system,  the  transducer,  amplifier,  and  gain 
control  are  replaced  by  connecting  a  variable  voltage  supply  directly  to  the  electro¬ 
pneumatic  transducer  on  the  Val-Tek™  control  valve.  Since,  the  valve  position  is 
proportional  to  the  voltage  applied,  its  position  can  be  set  by  adjusting  the  voltage.  A 
steady  voltage  gives  a  very  steady  valve  position,  and  a  very  steady  pressure  to  the 
choked  flow  control  nozzle.  There  is  a  slight  reduction  in  pressure  over  a  long  time 
(tens  of  minutes)  interval  caused  by  the  slight  reduction  in  tank  pressure  with  time  as 
the  tank  pressure  bleeds  down.  This  can  be  easily  corrected  by  making  slight 
adjustments  in  the  voltage  applied  to  the  electro-pneumatic  transducer  on  the  Val-Tek™ 
control  valve. 

The  closed  loop  control  system  will  be  preferred  when  performing  diagnostic 
measurements  that  require  a  near  steady  flow  rate  over  a  long  time  period,  but  where 
minor  fluctuations  in  flow  rate  will  have  no  appreciable  effect  (e.g.  CARS  temperature 
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mapping,  or  sheet  lighting  experiments).  The  open  loop  control  system  will  be 
preferred  when  short  term  fluctuations  cannot  be  tolerated  (e.g.  when  determining  lean 
blow  out  limits)  and  exact  flow  rates  are  critical. 


In  order  to  achieve  a  designed  range  of  air  flow,  one  of  three  choked  flow  nozzles 
is  used.  The  flow  rate  through  a  given  choked  flow  nozzle  is  determined  from  Equation  1 : 


Air  flow  =  Cd  A  |  P-j 


9 


Vf2  /  (k+1)l<k+1)<k'1) 
VgkRT-, 


(1) 


Where  Cd  is  the  discharge  coefficients,  At  is  the  area  of  throat  of  the  nozzle.  is  the  up  stream 
nozzle  pressure,  g  is  the  gravitational  constant, T-j  is  the  nozzle  inlet  temperature,  k  is  the  specific 
heat  ratio  of  air,  and  R  is  the  ideal  gas  constant. 


The  initial  air  flow  rate  correlations,  which  are  primarily  a  function  of  up  stream  pressure,  were 
determined  by  assuming  a  discharge  coefficients  (Cd)  of  .98  for  each  of  the  three  different 
diameter  nozzles  (eventually  Cd  will  be  determined  with  a  flow  calibration  system).  By  using  the 
above  equation,  assuming  70  °F,  and  using  an  upstream  pressure  range  of  15-120  PSI,  a  flow 
curve  has  been  developed  for  each  diameter  nozzle.  Figure  6  presents  the  final  correlations 
(Equations  2,  3,  and  4)  for  the  three  air  flow  nozzles  to  be  used  in  the  burner  facility. 


The  calibration  equations  obtained  for  each  nozzle  are  shown  below. 


Air  Flow  Rate  Nozzle 

(slom.  70  FI  Diameter,  in.  Nozzle  Flow  Rate  Correlation  Equation 


250  -1000 

0.1485 

Air 

500-2000 

0.214 

Air 

1000-4000 

0.296 

Air 

Flow  (slpm,  70  °F)=9.00*(P+  12.6) 
Flow  (slpm,  70  °F)=18.69*(P+  12.6) 
Flow  (slpm,  70  °F)=35.76*(P+  12.6) 


(2) 

(3) 

(4) 


Where  P  is  the  up  stream  pressure  (psig)  of  the  nozzle,  and  12.6  psia  is  the  mean  ambient 
pressure  in  the  BYU  Combustion  Laboratory. 


Fuel  Flow  System.  The  fuel  flow  system  has  been  designed  to  provide  a  maximum 
propane  flow  rate  of  104  slpm  (70  °F).  Figure  7  shows  a  schematic  of  the  fuel  system. 
This  system  consists  of  a  100  lb  (ca.  20  gal.)  bottle  of  liquid  propane  gas  with  a 
pressure  regulator,  a  pressure  valve,  a  thermocouple,  and  two  rotameters.  In  order  to 
obtain  high  resolution  measurements  for  both  the  Task  100  and  the  Task  150,  two 
rotameters  (same  flow  capacity)  have  been  installed.  One  meter  has  a  fine  adjusting 
valve  and  the  other  rotameter  has  a  course  valve.  The  meters  are  mounted  on  the  control 
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panel  and  are  connected  in  parallel.  The  total  flow  rate  of  propane  is  the  sum  of  the  flow 
rates  through  both  rotameters.  The  fine  valve  allows  precise  control  of  the  propane  flow 
rate  through  one  of  the  rotameters,  which  allows  precise  fuel  flow  rates  to  be  set  over 
the  wide  range  of  flow  rates  needed  for  both  the  Task  150  and  Task  100  burners. 

The  determination  of  the  fuel  flow  rate  as  a  function  of  rotameter  scale  reading  and 
back  pressure  on  the  rotameter  was  obtained  in  the  following  manner.  Air  flow  calibration 
data  for  both  flow  meters,  as  shown  in  Table  1,  was  provided  by  the  manufacturer.  Equation  5, 


O  10  20  30  40  50  60  70  80  90  100 

Up  Stream  Pressure  of  Choked  Nozzle,  psig 


Figure  6  -  BYU  (P  &  W)Test  Facility  Air  Control  Calibration. 

used  for  flow  in  a  variable  area  flow  meter  (Beckwith,  et  al.,  p.  497),  was  used  to  correct  the  factory 
air  flow  calibration  to  propane: 


QC3  =  Qair 


VjPr 


t  •  Pc3^P a'r 

(pf  ‘  Pair)Pc3 


(5) 


Where  pf  is  the  density  of  the  float,  pC3  is  the  density  of  propane  at  standard  conditions,  and  pajr 
is  the  density  of  air  at  standard  conditions. 
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Figure  7-  Schematic  of  Fuel  Delivery  System  for  Test  Facility. 


A  high  pressure  differential  is 
required  to  achieve  high  fuel  flows  through 
the  restricted  Task  150  high  swirl  injector. 

Consequently,  the  differential  pressure  (AP) 
must  be  considered  in  the  flow  rate 
correlation.  The  density  of  the  propane 
flowing  through  the  rotameter  was  modeled 
by  the  ideal  gas  law.  A  correlation  of  flow  rate 
to  scale  reading  at  various  upstream  pressure 
levels  was  made,  and  the  effect  of  upstream 
pressure  was  determined  by  correlating  the 
linear  curve  fit  coefficients  (  A  and  B  in 
Equation  6)  with  pressure  level.  The 
correlations  (Equations  7  and  8)  for  A  and  B  were  based  on  ambient  temperature  and  pressure  of 
70  °F  and  12.6  psia  respectively.  The  correlation  equations  used  for  the  propane  flow  rates  are 
as  follows: 


Table  - 

1 .  Manufacturer  Flow  meter  Calibration 
Data  for  Air  @  70  F  and  1  atm. 

Scale  Reading 

Flow  (slpm) 

150.0 

_ 

59.494 

140.0 

— 

55.744 

130.0 

— 

51.772 

120.0 

— 

48.038 

110.0 

— 

43.862 

100.0 

— 

39.806 

90.0 

— 

35.649 

80.0 

— 

31.626 

70.0 

— 

27.520 

60.0 

— 

23.314 

50.0 

— 

19.211 

40.0 

— 

15.382 

30.0 

— 

11.492 

20.0 

— 

7.660 

10.0 

— 

3.728 

Fuel  Flow  (slpm  70  °F)  =  A  +  B  -(Scale  Reading)  (6) 

Where  : 

A  =  -0.37013  +  (1.1978  E-2)  AP  -  (3.31319  E-  4)AP2  +(3.3030  E-6)  AP3  (7) 

B  =  0.34926  (1.1303  E-2)  AP  +  (2.9557  E-  4)  AP2  -  (3.1174  E-6)  AP3  (®) 

AP  is  in  psig 
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The  flow  rate  of  the  propane  is  based  on  a  factory  air  calibration  table  converted  to  propane 
with  an  assumed  temperature  of  70  °F.  The  flow  rate  of  propane  is  determined  from  the  rotameter 
scale  reading  and  the  down  stream  pressure.  These  correlations  will  eventually  be  checked 
against  a  calibrated  flow  cell. 

Test  Facility  Operational  Limits.  The  stoichiometric  fuel  to  air  ratio  (A/F)st  can  be 
determined  from  the  following  stoichiometric  chemical  equation: 

C3H8  +  5  02  +18.590  N2  +  0.224  Ar  +  0.007  CO2  -> 

3  C02+4  H2O+18.590  N2  +  0.224  Ar  +  0.007  CO2  (9) 

The  stoichiometric  balance  gives  an  (A/F)st  equal  to  23.821  (mol  air  /  mol  propane). 

The  value  of  fuel  equivalence  ratio  ($)  is  calculated  by  the  following  relations: 

<t>  =  23.821  •  Fuel  Flow  /  Air  Flow  (1 0) 

Where: 

Fuel  Flow  =  the  measured  flow  rate  of  propane,  slpm  (70  °F) 

Air  Flow  =  the  measured  flow  rate  of  air,  slpm  (70  °F). 

The  operational  envelope  in  terms  of  air  flow  and  fuel  flow  for  the  BYU  test  facility  are 
displayed  in  Figure  8.  Lines  of  q  are  shown  between  the  rich  and  lean  flammability  limits  of 
propane  and  air. 

Laser  Diagnostic  Design.  The  CARS  laser  system  is  contained  in  a  room  adjacent  to 
the  burner  facility.  Initially,  This  laser  system  will  be  used  to  make  CARS  temperature 
measurements  in  both  the  Task  100  and  Task  150  burners.  The  laser  diagnostic  system 
uses  a  Nd-Yag  laser  as  the  pump  laser.  This  pump  laser  is  used  to  provide  the  two  pump 
beams  needed  in  a  folded  BOXCARS  phase  matching  configuration.  The  pump  laser  is  also 
used  to  pump  a  dye  laser  that  is  tuned  to  the  Stokes  Raman  resonance  of  nitrogen.  The  dye 
laser  beam  combined  with  the  two  pump  beams  in  a  proper  phase  matching  configuration 
generates  a  CARS  signal  at  the  nitrogen  anti-Stokes  Raman  frequency  that  contains 
information  regarding  the  temperature  and  species  concentration  of  the  nitrogen  gas.  An 
alternate  scheme  that  is  also  being  considered  is  a  USED  CARS  phase  matching 
configuration.  Once  the  CARS  signal  has  been  generated,  it  is  returned  to  the 
spectrometer  on  a  fiber  optic  cable  for  subsequent  collection  with  the  optical  multi¬ 
channel  analyzer  and  analysis  with  the  Micro-VAX  II  computer. 
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The  optical  design  for  the  USED  CARS  laser  setup  is  presented  in  Figure  9.  On 
either  side  of  the  burner  are  optical  breadboards  that  are  elevated  to  a  height  of  63  1/2" 
(1.62  m)  from  the  ground.  The  third  axis  of  movement  is  achieved  through  translation 
of  the  field  lenses  located  near  the  burner  windows.  The  diagnostic  volume  needs  to 
travel  only  a  maximum  of  6"  across  the  burner.  This  travel  is  perpendicular  yet,  in  the 
same  plane  as  the  movement  of  the  x-axis  translator  described  above. 

Other  laser  diagnostic  techniques  are  being  considered  to  investigate  the 
combustion  and  flame  characteristics  in  the  burner.  These  techniques  include  LDA 
measurements  of  gas  velocities,  and  laser  sheet  lighting  with  TiCl4  seeding  for  flow 

patterns,  as  well  as  laser  induced  fluorescence  of  OH  or  NO  using  dye  laser  sheet  lighting 
and  film  or  electronic  CCD  cameras.  The  laser  sheet  lighting  using  the  TiCU  seed  and  a 
film  camera  are  readily  available  within  the  BYU  Combustion  Laboratory.  The  other 
laser  techniques  will  require  additional  equipment  that  is  not  currently  available. 


Air  Flow  Rate,  slpm  (70  F) 


Figure  8.  •  Operating  Limits  for  BYU  (P  &  W)  Test  Facility. 
EXPERIMENTAL  RESULTS 

After  the  all  of  the  facility  systems  were  assembled  and  installed,  flow  tests  were 
performed  an  the  air,  water,  and  fuel  systems.  After  correcting  some  minor  problems,  a  number  of 
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operational  checkout  tests  were 
performed  with  the  Task  150  burner 
configuration.  The  burner  was  found 
to  operate  well.  No  operational 
problems  were  found.  The  air  flow 
system  controlled  well,  and  the 
exhaust  hood  cooling  system  proved 
to  be  more  than  adequate.  Finally, 
three  sets  of  lean  blow  out  data  were 
collected  for  with  the  Task  150  and 
the  Task  100  burners.  This  permitted 
preliminary  comparisons  between  the 
two  burner  configurations  at  the 
reduced  ambient  pressure  in  the  BYU 
Combustion  Laboratory,  and  to 
previous  LBO  work  done  at  WPAFB 
on  both  the  Task  100  and  the  Task 
150  burners.  A  brief  summary  of  the 
checkout  tests  and  the  LBO 
experimental  results  follows. 

Check  Out  Tests.  Checkout 
tests  were  conducted  on  each  of  the 
different  test  facility  systems  to  insure  they  operated  as  designed.  The  translational  test  stand 
positioned  the  burner  in  the  x  and  z-axis  as  designed.  When  the  water  cooled  hood  was  checked 
out.  the  inner  jacket  experienced  some  deformation  due  to  unexpected  high  pressure  from  the 
main  water  line.  The  hood  was  repaired  and  a  water  pressure  regulator  set  at  5  psig  was  installed 
to  prevent  future  over  pressurization.  A  thermocouple  placed  in  the  center  of  the  hood  exit 
showed  that  the  burner  exhaust  was  being  quenched  to  about  200  °F  ( 93  °C)  or  less.  Further 
more,  the  exhaust  ducting  leading  from  the  hood  to  the  exit  fan  were  only  warm  to  the  touch 
which  indicated  excellent  cooling  of  the  burner  exhaust. 

During  the  check  out  test  of  the  air  system,  the  closed  feedback  control  loop  could  be 
stabilized  down  to  a  plus  or  minus  1  psig  oscillation  at  50  psig.  Where  as  with  the  open  feedback 
control  loop,  once  a  voltage  was  set  no  instability  was  noted  in  the  pressure;  although,  the 
pressure  did  decrease  slightly  over  long  periods  of  time  due  to  pressure  blow  down  of  the  supply 


Figure  9  -  Schematic  of  Optical  Train 
Configuration  for  Used  CARS 
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tanks.  No  problems  were  discovered  when  the  fuel  system  was  checked  out.  Although,  the 
needed  hardware  to  interface  the  laser  diagnostic  techniques  to  the  facility  have  been  designed 
and  purchased,  the  laser  diagnostic  system  has  not  yet  set  up  on  the  burner  facility. 

When  Task  150  burner  was  checked  out,  it  was  ignited  and  operated  through  a  series  of 
air  and  fuel  flow  rates.  No  problems  were  encountered  during  these  check  out  experiments. 

Lean  Blow  Out  Test  Results.  A  series  of  lean  blow  out  test  were  conducted  with  both  the 
Task  150  and  the  Task  100  burners.  Three  sets  of  LBO  data  were  obtained,  one  with  the  Task 
150  burner,  and  two  with  the  Task  100  burner.  The  Task  150  burner  was  configured  with  the  0 
inch  extension  and  0%  orifice  plate  for  Test  Set  1.  For  Test  Set  2.  the  Task  100  burner  was 
configuration  the  same  as  Test  set  1 .  In  the  last  test  set,  Test  Set  3,  the  Task  100  burner  still  had 
no  extension,  but  a  62  %  orifice  plate  was  was  installed  on  the  exit  of  the  burner. 

The  lean  blow  out  limit,  <J)LS0,  was  determined  by  setting  the  air  flow  in  the  combustor,  and 
slowly  reducing  the  fuel  flow  rate  (gaseous  propane)  until  the  flame  extinguished.  A  LBO  test 
matrix  was  created  for  each  test  set.  The  test  matrix  for  Test  Sets  1  and  2  were  identical,  where  as 
Test  Set  3  was  similar  out  with  fewer  data  points.  The  matrix  for  Test  Sets  1  and  2  started  at  an  air 
flow  rate  of  250  slpm  and  increased  at  an  increment  of  about  50  slpm  until  and  air  flow  rate  of  about 
1000  slpm  was  reached.  Test  Set  3  began  at  an  air  flow  rate  of  about  900  slpm  and  decreased  in 
increments  of  about  200  slpm  down  to  and  air  flow  rate  of  about  300  slpm.  Four  LBO  limits  were 
collected  for  each  air  flow  rate  setting  for  all  three  test  sets.  For  each  test  set  collected,  the  fuel 
equivalence  ratio  at  LBO  was  plotted  against  the  corresponding  air  flow  rate.  Adjustments  for 
barometric  pressure  differences  between  the  several  days  of  testing  was  taken  into  account  in  the 
air  flow  rate.  The  results  for  each  burner  are  presented  and  compared  below,  and  to  LBO  data 
form  previous  work  at  WPAFB  (Hedman,  1990,  and  Hedman  and  Warren,  1991) 

BYU  Task  150  LBO  Limit  Results.  Figure  10  presents  data  from  the  Task  150  burner.  At 
lower  air  flow  rates,  this  burner  would  extinguish  completely  from  a  flame  that  was  attached  to  the 
injector.  At  higher  air  flow  rates,  greater  than  about  400  slpm,  the  flame  would  not  extinguish,  but 
would  lift  and  stabilize  on  a  down  stream  recirculation  zone.  Further  reduction  in  fuel  flow  rate, 
would  cause  the  flame  to  eventually  extinguish  completely.  At  air  flow  rates  above  about  740 
slpm.  it  was  never  possible  to  get  the  flame  to  attach  to  the  injector.  Consequently,  lean  blow  out 
at  these  higher  air  flow  rates  would  always  occur  from  a  lifted  flame  mode.  It  is  important  to  note 
that  the  attached  LBO  mode  and  the  separated  LBO  mode  overlapped  in  the  air  flow  range  from 
about  400  to  about  740  slpm.  Further  investigation  into  the  operational  character  of  this  burner  is 
needed  before  an  adequate  understanding  of  the  LBO  phenomena  will  be  obtained. 
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Fuel  Equivalence  Rallo  at  LBO 
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Figure  10  -  Comparsion  of  LBO  Data  between  WPAFB  (Summer  1991) 
and  BYU  (Fall  1991)  for  P&W  Task  150  HS  Burner 
(0  in.  Extension,  0%  Orifice). 
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Figure  11  -  Comparison  of  BYU  LBO  Data  between  P&W  Task  150 
HS  Burner  and  P&W  Task  100  Burner  (0  in.  Extension). 
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Comparison  of  BVU  and  WPAFB  Task  150  LBO  Data.  In  Figure  10,  the  results  for  the 
BYU  Task  150  burner  have  been  compared  with  results  from  the  Task  150  burner  collected  at 
WPAFB  during  the  summer  of  1991.  Both  sets  of  data  were  collected  under  the  same 
burnerconfiguration  and  operating  conditions,  except  for  a  difference  in  barometric  pressure  as 
indicated  in  the  figure.  Since  a  limited  amount  of  data  was  collected  for  this  configuration  and 
operating  condition  at  WPAFB  last  summer,  only  a  partial  comparison  can  be  made.  Both  sets  of 
results  are  with  in  the  same  range  and  follow  the  same  general  trend  (i.e.  d)LB0  started  low  for 
both,  reached  a  maximum  and  then  decreased).  Also,  both  sets  of  results  exhibited  some 
variation  of  0LBO  during  the  very  low  air  flow  rates  (150  -300  slpm).  This  variation  in  (j)LBO  for  both 
burners  at  BYU  and  WPAFB  was  thought  to  be  caused  from  the  very  unstable  flame  structure  that 
exists  at  those  low  air  flow  rates  (i.e.  low  Reynolds  Number).  It  appeared  that  there  was  reasonable 
agreement  between  the  BYU  Task  150  burner  results  and  the  Task  150  burner  results  at  WPAFB. 
Additional  data  is  needed  in  order  to  draw  a  more  accurate  correlation  between  operating  the  Task 
150  Burner  at  BYU  and  at  WPAFB. 

BYU  Task  100  LBO  Results.  No  Task  100  configuration  from  the  1990  Summer  Faculty 
Research  Program  (Hedman.  1990)  exactly  matched  the  Byu  Task  100  combustor  configuration 
or  operating  conditions  of  Test  Set  2..  LBO  tests  at  WPAFB  were  conducted  primarily  for  the  Task 
100  burner  configuration  with  the  10  inch  extension  and  0%  exhaust  orificeplate  and  with  air  flow 
rates  of  a  1000  slpm  (70  °F)  or  greater.  For  these  reasons,  a  direct  comparison  can  not  be  made 
between  the  recent  preliminary  BYU  data  and  the  data  taken  at  WPAFB.  Nevertheless,  a 
qualitative  evaluation  was  done  between  the  BYU  Task  100  burner  results  and  the  WPAFB  Task 
100  burner  results. 

Figure  11  compares  the  values  of  fuel  equivalence  ratio  at  LBO  (0LBo)  for  the  Task  100 
burner  Test  Sets  2  and  3  to  the  values  of  0LBO  obtained  for  the  Task  150  burner  (Test  Set  1).  The 
values  of  (J)L0O  exhibit  a  very  different  trend  than  do  the  BYU  Task  150  results.  The  values  of 
(J)L30  for  the  Task  100  burner  are  considerable  higher  than  the  values  obtained  for  the  BYU  Task 
50  burner.  The  Task  100  and  Task  150  burners  were  both  expected  to  have  values  of  <J)LBO  of 
about  0.50,  near  the  lean  flammability  limit,  similar  to  well-stirred  reactor  performance  (Sturgess,  et 
al..  1990).  The  Task  150  results  seem  to  be  about  what  would  be  expected,  but  the  Task  100 
results  are  much  higher  that  expected,  and  much  higher  than  similar  results  at  WPAFB  (Hedman, 
1990).  The  values  for  (j)LB0  for  the  WPAFB  summer  1990  work  were  generally  in  the  range  of. 50 
to  0.55  and  the  values  of  0, 30  for  the  BYU  Task  1 00  results  fell  within  a  range  of  0.72-  0.83.  The 
higher  values  of  0LBO  for  the  BYU  Task  100  burner  compared  to  the  from  WPAFB  could  be 
attributed  to  the  following  reasons:  1)  differences  in  the  purity  level  of  the  propane  fuel;  2) 
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incorrect  calibration  in  the  air  and/or  fuel  flow  meters;  3)  the  different  burner  configuration  and  air 
flow  rates,  and  4)  the  difference  in  barometric  pressure  at  WPAFB  compared  to  BYU. 

There  are  several  types  of  impurities  found  in  propane  that  could  increase  the  observed 
differences  in  <j)UB0.  Inert  gases  like  CO2  or  N2  occupy  volume  and  cause  a  lesser  volume  of 
propane  to  flow  through  the  flow  meters  than  is  being  metered.  This  could  cause  an  apparently 
higher  value  of  <J)Lbo-  This  is  thought  to  be  an  unlikely  cause  of  the  observed  differences,  since 
the  values  of  (j)LBo  for  the  Task  150  burners  at  BYU  and  WPAFB  are  in  relatively  good  agreement. 

The  increased  values  of  (j)LB0  for  the  BYU  Task  1 00  burner  compared  to  that  measured  at 
WPAFB  may  have  been  caused  by  the  air  flow  rate  and  the  fuel  flow  rate  calibrations.  The 
coefficient  of  discharge  (C<j )  for  the  choked  flow  nozzle  in  the  air  system  was  assumed  to  be  0.98 
(typical  values  of  C<j  lie  with  in  the  range  of  0.97  -0.99)  and  the  air  calibration  curve  was 
determined  from  the  formula  for  a  choked  flow  nozzle.  This  formula  gives  the  maximum  flow  rate 
that  can  possibly  pass  through  a  nozzle.  Flow  rates  greater  than  predicted  by  this  formula  would 
have  been  needed  in  order  to  explain  the  observed  results.  For  the  fuel  flow  meters,  the  factory 
calibration  of  air  flow  was  used  and  converted  to  a  propane  flow  rate.  This  flow  correlation  could  be 
in  error,  and  could  explain  the  observed  results.  However,  this  is  also  thought  to  unlikely  since 
the  values  of  (j)LBo  tor  the  Task  150  burners  at  BYU  and  WPAFB  are  in  relatively  good  agreement. 
Since  neither  the  air  or  fuel  flow  calibrations  have  been  verified  against  a  know  standard  (e.g. 
ASTM  standards  for  choked  flow  nozzles)  there  exists  some  possibility  of  calibration  errors. 
These  flow  meters  will  be  calibrated  before  any  final  experimental  results  are  taken  or  reported. 

There  were  some  differences  in  the  two  burner  configurations  and  operating  conditions. 
The  BYU  burner  did  not  have  the  10  inch  extension  like  the  WPAFB  burner  and  the  WPAFB 
burner  was  not  be  run  at  the  lower  air  flow  conditions  of  the  BYU  burner.  It  was  not  expected 
though  that  these  differences  would  produce  the  dramatic  differences  observed  for  (|)LBo  in  the 
BYU  Task  100  burner.  Experiments  will  be  repeated  with  identical  geometries  and  air  flow  rates 
before  final  conclusions  about  the  observed  effects  are  drawn. 

Although  a  comparison  at  exactly  the  same  Task  100  geometrical  configuration  has  not 
been  possible  (BYU  has  not  yet  received  the  burner  extensions  from  WPAFB),  visual 
observations  have  suggested  that  there  are  some  differences  in  the  flame  structure  between  the 
flames  observed  in  the  Task  100  burners  at  WPAFB  and  BYU.  The  flame  at  BYU  appeared  to  be 
somewhat  longer  that  that  observed  at  WPAFB.  At  WPAFB  the  ambient  pressure  is  about  14.2 
psia  and  the  ambient  pressure  at  BYU  is  about  1 2.6  psia.  about  a  1 .6  psi  difference. 
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In  an  attempt  to  understand  the  cause  for  the  higher  values  of  <J>Lbo  f°r  the  BYU  Task  100 
burner,  the  third  test  set  was  conducted  with  the  62  %  orifice  pressure  plate  installed  on  the 
burner  exit.  By  restricting  the  exiting  flow  of  the  burner  with  the  orifice  plate,  a  slight  back 
pressure  was  created  in  the  burner.  Twelve  LBO  limit  data  points  were  collected  during  this 
limited  test  set.  As  shown  in  Figure  1 1 ,  the  values  of  <})Lbo  were  slightly  lower  when  compared  to 
the  values  from  Test  Set  2.  This  observation  suggests  that  the  reduced  back  pressure  in  the 
combustion  chamber  when  conducting  experiments  at  BYU  may  have  an  impact  on  the  (|)lbo 
results.  There  was  no  apparent  effect  of  ambient  pressure  on  the  Task  150  experimental  results. 
This  preliminary  evaluation  of  these  results  suggests  that  the  differences  in  flow  patterns  and 
<j)LQ0  between  the  BYU  and  WPAFB  experiments  may  be  attributed  to  the  differences  in  ambient 
pressure  (12.6  psia  at  BYU  versus  14.2  at  WPAFB).  Further  experiments  both  at  BYU  and 
WPAFB  with  identical  burner  configurations  are  needed  to  fully  resolve  this  difference. 
Simulation  of  a  reduced  ambient  pressure  with  the  addition  of  a  nitrogen  diluent  will  also  be  done. 

ACCOMPLISHMENTS.  CONCLUSION,  AND  RECOMMENDATIONS 
ACCOMPLISHMENTS 

The  following  have  been  completed  during  the  twelve  month  period  of  the  Research  Initiation 
Grant. 

1 .  The  burner  test  facility  was  completed,  operation  of  the  Task  100  and  the  Task  150  has 
been  demonstrated,  and  a  limited  series  of  lean  blow  out  tests  were  made  to  check  out  the  valid 
operation  of  the  Task  100  and  Task  150  combustors. 

2.  Lean  blow-out  measurements  with  the  Task  150  configuration  has  been  compared 
directly  to  similar  measurements  made  at  the  Aero  Propulsion  and  Power  Laboratory,  Wright- 
Patterson  AFB.  Dayton,  Ohio. 

3.  A  limited  amount  of  lean  blow-out  measurements  with  the  Task  100  configuration  have 
been  collected  and  compared  to  similar  measurements  made  at  the  Aero  Propulsion  and  Power 
Laboratory,  Wright-Patterson  AFB,  Dayton,  Ohio.  Also  LBO  measurements  with  the  Task  150 
configuration  have  been  collected  and  compared  directly  to  the  measurements  from  the  Task  100 
configuration  to  determine  the  influence  of  the  swirling  jet  streams  on  the  lean  blow-out  limit. 

4  A  design  for  the  implementation  of  the  existing  BYU  laser  based  combustion  diagnostic 
instrument  with  the  burner  test  facility  has  been  completed. 
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CONCLUSIONS 

1.  The  BYU  LBO  results  tor  the  Task  100  burner  showed  significant  differences  in  (j)L80 
and  in  the  flame  structures  as  an  apparent  function  of  ambient  pressure.  The  values  of  <f)LB0 
obtained  at  an  ambient  pressure  of  about  12.6  psia  were  much  higher  than  those  obtained  in 
similar  work  at  WPAFB  at  an  ambient  pressure  of  about  14.2  psia.  The  reduced  barometric 
pressure  in  the  burner  also  caused  the  flame  in  the  BYU  Task  100  burner  to  appear  longer  in  the 
combustion  chamber  near  LBO  than  was  observed  in  previous  work. 

2.  It  appeared  that  the  high  swirl  injector  used  in  the  Task  150  burner  prevented  large 
changes  (as  was  caused  in  the  Task  100  by  changes  in  barometric  pressure)  in  the  values  of  (j)LBO 
when  compared  to  the  results  of  similar  work  dune  at  WPAFB  on  an  identical  burner.  Although 
more  LBO  testing  with  varying  back  pressure  is  needed  to  better  substantiate  this  claim. 

3.  Two  different  modes  of  <J)LB0  occur  in  the  Task  150  in  an  air  flow  range  between  about 
400  and  740  slpm,  a  <j)LBo  associated  with  the  transition  of  the  flame  from  an  attached  mode  to  a 
lifted  mode,  and  the  (J)LBO  associated  with  the  final  extinguishment  of  the  flame  from  the  lifted 
mode. 

RECOMMENDATIONS 

1 .  It  is  recommended  that  further  investigations  of  the  complex  flow  field  and  flow 
patterns  in  side  the  Task  150  be  conducted.  Flow  visualization  techniques  could  be  used  to 
examine  the  complex  flow  structures  that  are  known  to  exist  in  the  swirling  and  recirculating  flows. 
Cars  temperature  measurements  would  also  be  important  in  better  understanding  the  complex 
flow  fields  and  chemical  reactions. 

2.  It  is  recommended  that  the  effects  of  major  changes  in  back  pressure  on  (t)LBo  in  both 
the  Task  100  and  the  Task  150  be  studied. 

3.  It  is  recommended  that  a  comprehensive  study  be  made  of  the  transition  range  of  the 
attached  LBO  and  separated  LBO  limit  in  the  Task  150.  Understanding  the  mechanisms  that 
governed  the  transition  between  these  two  operating  modes  is  important  to  the  design  of  the 
swirl  fuel  injector  and  combustion  chamber.  A  better  understanding  of  this  phenomena  may  lead 
to  improved  combustion  stability  margins  for  jet  engines. 
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4.  It  is  recommended  that  the  air  and  the  fuel  flow  meters  be  calibrated  against  a  known 

calibration  standard  and  that  the  actual  composition  of  the  propane  being  used  at  both  BYU  and 

WPaFB  be  determined. 
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ABSTRACT 


Analysis  and  modeling  of  aircraft  270V  dc  electrical  power 
systems  are  the  main  topic  of  the  research  project.  HVDC  at  270 
volts  has  many  advantages  over  the  currently  used  three  phase 
electrical  power  systems  in  an  aircraft.  There  are  some 
problems,  particularly  instability  in  the  presence  of  a  constant 
power  load,  which  are  addressed  in  this  research.  Computer 
models  have  been  developed  for  the  various  components  of  the  HVDC 
system  and  constant  power  load.  The  analysis  using  EMTP  is 
included  in  this  report. 
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1  INTRODUCTION: 


Aircraft  Electrical  Power  Systems,  at  present  use  three 
phase  AC  systems  and  research  and  development  into  the  possible 
use  of  270V  dc  as  an  alternative  is  underway.  Electrical 
Laboratory  of  Aero  Power  Propulsion  Laboratory  at  Wright 
Patterson  Air  Force  Base  is  very  much  concerned  with  computing 
modeling  of  HVDC  systems.  One  of  the  main  problems  foreseen  is 
instability  when  constant  power  loads  are  applied  to  the  HVDC 
distribution  system. 


2  OBJECTIVES: 

Electrical  load  on  an  aircraft  power  system  usually  consists 
of  a  combination  of  lighting  and  motor  loads.  In  addition  there 
is  a  considerable  amount  of  dc  load  in  an  aircraft  for  flight 
critical  computers  etc.  The  main  power  supply  in  an  aircraft  at 
present  is  a  three  phase  supply  at  400  Hz.  This  frequency  must 
remain  constant  in  the  presence  of  variable  speed  turbines  which 
supply  mechanical  power  to  the  electrical  generators.  There  are 
two  methods  generally  used  for  obtaining  constant  frequency. 

1.  Constant  Speed  Constant  Frequency  Systems  (CSCF) :  A 
Constant  Speed  Drive  (CSD)  is  used  to  keep  the 
generator  running  at  a  constant  speed  even  while  the 
input  to  the  constant  speed  drive  varies  considerably. 
The  advantage  is  that  the  electrical  output  from  the 
alternator  is  at  a  constant  frequency  with  very  little 
harmonic  content.  The  main  disadvantage  of  this  system 
is  the  very  high  maintenance  required  of  the  CSD. 
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2. 


Variable  Speed  Constant  Frequency  Systems  (VSCF) :  In 
this  system  a  variable  speed  alternator  supplies  a 
variable  frequency  three  phase  power.  This  output  is 
rectified  and  inverted  to  produce  constant  frequency 
three  phase  power.  The  main  advantage  of  this  system 
is  low  maintenance  cost.  The  major  disadvantage  is  the 
presence  of  harmonics  in  the  output  due  to  the  inverter 
operation. 

As  can  be  seen  above,  the  three  phase  ac  distribution  system 
has  some  drawbacks  and  research  and  development  is  under  way  for 
using  a  270V  dc  supply  as  the  main  electrical  power  supply.  In 
this  system,  the  inverter  portion  of  the  VSCF  system  can  be 
eliminated  and  all  of  the  dc  loads  can  be  directly  applied  to  it. 
The  low  voltage  dc  loads  at  28V  can  be  supplied  by  using  dc-dc 
converters.  As  far  as  motor  loads  are  concerned,  inverter  fed 
brushless  dc  motors,  inverter  fed  induction  motors  or  inverter 
fed  switch  reluctance  motors  can  be  used. 

The  Electromagnetic  Transients  Program  (EMTP) ,  which  was 
developed  in  the  early  seventies  for  Bonneville  Power 
Administration  (BPA) ,  is  an  excellent  tool  for  analyzing 
transient  behavior  of  a  power  system.  This  package,  used  in  the 
present  research,  is  constantly  being  revised  and  modified.  It 
has  been  found  that  constant  power  loads  cause  instability  in 
HVDC  systems  and  use  of  a  large  capacitor  and  harmonic  filters 
eliminate  the  instability  and  reduce  the  ripple  current  drawn 
from  the  power  system. 
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A  more  detailed  analysis  of  the  HVDC  system  when  motor  loads 
are  present  should  be  undertaken.  Since  EMTP  can  be  used  to 
model  various  types  of  motors,  it  is  possible  to  analyze  the 
system  under  motor  loads  using  EMTP. 

3  EMTP 

The  Electromagnetic  Transients  Program  was  developed  in  the 
early  seventies  by  Dr.  W.  Scott  Meyer  of  Bonneville  Power 
Administration.  This  program  is  being  constantly  updated  and  has 
numerous  users  all  over  the  world.  The  program  is  very  extensive 
and  has  all  the  features  necessary  for  the  analysis  of  Aircraft 
Power  Systems.  Some  of  the  highlights  of  the  program  are  as 
follows: 

The  various  kinds  of  elements  that  can  be  represented 

are: 

1 .  RLC  branches 

2.  Distributed  lines 

3.  Switches  including  diodes,  thyristors,  power 
transistors  and  time  controlled  switches. 

4 .  Sources 

(a)  Voltage  sources,  both  ac  and  dc 

(b)  Current  sources  both  ac  and  dc 

(c)  Voltage  and  Current  sources  controlled  by  any 
other  variable 

(d)  Alternators  with  their  full  representation 

(e)  DC  generators  and  motors  with  their  full 
representation 
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(f)  Induction  motors  with  their  full  representation 

(g)  Analytical  sources 

5.  Control  System  Blocks.  These  blocks  can  be  linear  or 
nonlinear.  They  can  be  represented  by  transfer 
function  blocks  for  linear  components.  The  inputs  and 
outputs  of  the  control  system  blocks  can  be  interfaced 
with  the  voltages  and  currents  of  the  electrical 
network . 

The  output  results  are  conveniently  obtained  in 
graphical  form  on  a  CRT  or  on  any  type  of  plotter  or 
printer.  In  addition,  there  is  a  tabular  output  which 
provides  highlights  of  all  the  necessary  results. 


4  CONSTANT  POWER  LOAD  ON  AN  IDEAL  HVDC  SYSTEM 

As  a  starting  point  for  this  research,  an  ideal  270V  dc 
source  is  chosen  as  the  power  system.  The  series  impedance  with 
the  ideal  source  is  chosen  as  a  small  inductance  in  series  with  a 
parallel  combination  of  an  inductance  and  a  resistance.  The 
series  inductance  represents  the  subtransient  reactance  of  the 
alternator  in  the  actual  HVDC  generating  system.  The  parallel 
combination  of  resistance  and  inductance  represent  the  time 
constant  and  the  open  loop  error.  The  rectifier  filter  is 
represented  by  a  series  RC  circuit  across  the  load.  The  constant 
power  load  is  represented  by  a  voltage  controlled  nonlinear 
current  source  such  that  the  product  of  the  current  and  the 
voltage  equals  the  power.  The  constant  power  load  is  a  cyclic 
load  occurring  100  times  a  second  with  a  duty  cycle  of  0.5.  The 
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circuit  diagram  is  shown  in  Fig.  1.  The  simulation  results  are 
shown  in  Fig.  2.  Some  of  the  main  points  of  the  results  are  the 
following. 

1.  There  is  considerable  ringing  in  the  voltage  across  the 
load. 

2.  The  current  drawn  from  the  source  has  a  very  high 
ripple  content  with  a  fundamental  frequency  of  100  Hz. 

An  analysis  of  the  circuit  shows  that  the  system  is  unstable 
for  constant  power  loads.  The  system  can  be  stabilized  by 
including  a  large  capacitor  across  the  constant  power  load.  The 
size  of  the  capacitor  depends  on  the  magnitude  of  the  load,  the 
value  of  the  inductance  in  series  with  the  source  and  the 
resistance  in  the  circuit.  The  simulation  results,  with  a  3000 
microfarad  capacitor  in  parallel  with  the  load,  are  given  in  Fig. 

3.  The  ringing  in  the  voltage  is  now  absent  but  the  ripple 
content  of  the  source  current  is  still  high. 

The  ripple  current  can  be  reduced  by  inserting  series  LC 
circuits  with  resonant  frequencies  equal  to  100  Hz  and  its  odd 
harmonics,  in  parallel  with  the  load.  Fig.  4  shows  the  effect  of 
inserting  these  harmonic  filters. 

5  SIMULATION  OF  A  RECTIFIER  WITH  A  CONSTANT  POWER  LOAD 

The  second  phase  of  the  research  consisted  of  simulating  an 
ideal  three  phase  source  rectified  and  feeding  a  constant  power 
load  cycling  100  times  per  second.  The  circuit  diagram  for  this 
system  is  shown  in  Fig.  5  and  the  results  are  shown  in  Fig.  6. 

The  following  conclusions  can  be  drawn  from  the  results. 
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1.  The  constant  power  load  causes  instability  and  this 
instability  can  be  eliminated  by  inserting  a  large 
capacitor  in  parallel  with  the  load.  The  ripple 
content  in  the  load  voltage  is  still  high.  The  current 
drawn  has  a  high  ripple  content  of  100  Hz  and  its 
harmonics. 

2.  Introduction  of  harmonic  filters  as  suggested  in  the 
earlier  section  reduces  the  ripple  content  of  the 
voltage  and  the  current  drawn  from  the  supply. 

In  the  next  phase  of  the  simulation,  the  three  phase  voltage 
source  magnitude  is  controlled  so  as  to  have  the  voltage  across 
the  load  to  be  270  volts.  The  overall  control  system  is  of  type 
one. 

Results  show  that  the  instability  can  be  eliminated  by 
inserting  a  large  capacitor. 

6  BRUSHLESS  DC  GENERATOR  WITH  A  CONSTANT  POWER  LOAD 

In  the  next  phase  of  the  research,  the  stability  of  a 
brushless  DC  generator  in  the  presence  of  constant  power  load  is 
studied.  As  in  the  earlier  cases,  the  overall  system  is  unstable 
under  a  constant  power  load.  The  instability  is  due  to  the  fact 
that  the  constant  power  load  appears  as  a  negative  resistance  and 
the  overall  impedance  as  seen  by  the  generator  has  a  negative 
real  part.  Introduction  of  a  large  capacitor  in  parallel  with 
the  load  makes  the  overall  impedance  to  be  stable  and  the  overall 
system  is  stable.  The  value  of  the  capacitor  needed  to  make  the 
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system  stable  depends  on  the  level  of  constant  power  that  is 
applied  as  the  load.  The  circuit  diagram  of  the  system  is  shown 
in  Fig.  7  and  the  results  are  shown  in  Fig.  8  and  9. 

7  CONSTANT  POWER  LOAD 

In  all  phases  of  the  earlier  research  it  was  assumed  that 
the  load  has  a  characteristic  of  demanding  constant  power 
irrespective  of  the  voltage  across  it.  This  load  is  simulated  as 
a  current  source  whose  magnitude  is  such  that  the  product  of  the 
source  current  and  the  voltage  across  it  is  the  negative  of  the 
power  demand. 

In  actual  practice,  the  constant  load,  as  it  is  assumed  is  a 
power  conditioner  whose  output  is  maintained  constant 
irrespective  of  the  input  voltage.  When  this  power  conditioner 
is  connected  to  a  resistive  load,  which  is  constant  the  output 
power  is  constant.  In  this  phase  of  the  research  a  buck 
switching  type  of  regulator  is  simulated  and  used  to  replace  the 
current  sources.  The  overall  system  stability  is  studied.  The 
circuit  diagram  of  the  regulator  is  shown  in  Fig.  10  and  the 
results  are  shown  in  Fig.  11. 

An  examination  of  the  results  shows  that  any  constant  power 
load,  which  is  derived  from  a  switching  regulator  does  not  pose 
any  problems  at  all  when  the  input  voltage  is  derived  from  ideal 
sources.  The  output  voltage  of  the  regulator  remains  constant 
while  the  input  voltage  is  varied  within  wide  limits.  The 
settling  time  of  the  transient  response,  when  the  load  resistance 
is  changed,  is  of  the  order  of  a  tenth  of  a  millisecond.  This 
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makes  the  circuit  fully  capable  of  operating  a  pulse  load  at  100 
cycles  per  second. 

8  MOTOR  LOADS 

The  preliminary  study  shows  that  a  constant  power  load,  when 
viewed  as  a  resistor  with  a  hyperbolic  v-i  characteristic,  poses 
a  stability  problem  for  dc  systems.  However,  when  the  constant 
power  load  is  considered  as  a  constant  resistive  load  on  a  power 
conditioner,  the  dc  system  seems  to  behave  normally. 

The  power  conditioner  circuit  mentioned  in  the  earlier 
section  is  simulated  with  constant  power  load  and  with  additional 
motor  loads.  Two  types  of  motor  load  have  been  simulated: 
Brushless  dc  motors  and  inverter  fed  induction  motors.  The 
results  are  satisfactory  and  the  dc  voltage  at  the  terminals  is 
maintained  fairly  constant. 

The  simulations  using  both  brushless  dc  motors  and  inverter 
fed  induction  motors  show  that  as  the  motors'  loads  are 
increased,  the  effect  of  constant  power  load  is  minimized. 

This  research  shows  that  a  270V  dc  power  supply  is  a 
feasible  alternative  for  Aircraft  power  supply.  The  istability 
that  is  asoociated  with  constant  power  loads  in  preliminary 
analysis  and  simulation  is  in  fact  not  present  when  the  constant 
power  load  is  derived  through  the  use  of  a  power  conditioner.  In 
addition,  the  loads  on  the  dc  power  supply  excluding  this 
constant  power  load  will  ensure  that  there  is  no  stability 
problem  present. 
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There  are  some  additional  problems  associated  with  dc 
power  supply.  One  of  the  main  problem  areas  is  corona.  Since  the 
voltage  never  crosses  zero,  any  corona  discharge  once  started 
cannot  be  estinguished. 

9.  RECOMMENDATIONS 

The  Electromagnetic  Transients  Program  (EMTP)  a  very 
powerful  simulation  tool  for  power  system  engineers  has  been  used 
for  the  simulation  and  analysis  of  270  V  dc  system  when  subjected 
to  constant  power  loads.  It  is  recommended  that  as  a  followup 
these  circuits  can  be  built  and  tested  and  the  deviayin  of 
laboratory  results  from  the  simulation  should  be  examined.  It  is 
recommended  that  the  circuits  which  have  been  simulated  are 
actually  built  using  MCT's  and  tested  in  the  laboratory.  This 
principal  investigator  will  be  submitting  a  proposal  for  doing 
the  experimental  analysis  at  North  Dakota  State  University. 
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LORD  V0LTR6E 


270V  DC  SYSTEM 

DC  SOURCE  WITHOUT  LARGE  CAP  OR  HARMONIC  flLTERS 
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270V  DC  SYSTEM 

DC  SOURCE  WITHOUT  LARGE  CAP  OR  HARMONIC  FILTERS 
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270V  0C  SYSTEM 
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WER  SUPPLY 


270V  DC  SYSTEM 

SYNCHRONOUS  MACHINE,  100KW  DC  ANO  25KW  CONSTANT  POWER  LOAD 
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LORD  VOLTAGE 


270V  DC  SYSTEM 

SYNCHRONOUS  MACHINE,  100KW  DC  AND  25KW  CONSTANT  POWER  LOAD 
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POWER  SUPPLY  CURRENT 


270V  DC  SYSTEM 

SYNCHRONOUS  MACHINE,  100KW  DC  AND  25KW  CONSTANT  POWER  LOAD 
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PHASE  A  GENERATOR  VOLTAGE 


270V  DC  SYSTEM 

3PHASE  SOURCE  CONTROLLED  BY  TRCS 
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LOAD  VOLTAGE 


270V  DC  SYSTEM 

3PHRSE  SOURCE  TflCS  CONTROLLED 
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LORD  CURRENT 


270V  DC  SYSTEM 

CONSTANT  POWER  LOAD  THROUGH  A  POWER  CONDITIONER 
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INPUT  VOLTAGE 


70V  DC  SYSTEM 
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270V  DC  SYSTEM 

CONSTANT  POWER  LORO  THROUGH  A  POWER  CONDITIONER 


48-41 


OUTPUT  VOLTAGE 


270V  DC  SYSTEM 


DYNAMIC  TEMPERATURE  MEASUREMENTS  IN  REACTING  FLOWS 


When  studying  combustion  systems  and  reacting  flows,  it  is 
important  to  be  able  to  measure  spatially  and  temporally  resolved 
temperatures.  This  data,  along  with  corresponding  velocity  data, 
aides  in  the  understanding  of  various  aspects  of  fluid  flow  such  as 
stability  characterization  and  control,  interpretation  of  velocity  data, 
and  the  understanding  of  turbulence  correlations  and  compressibility 
effects.  Since  these  fluid  phenomena  are  of  a  rapidly  changing 
nature,  it  is  important  to  have  a  temperature  measurement  rate 
sufficient  to  keep  up  with  such  fluctuations  in  the  flowfield.  In  this 
case,  a  thermal  data  rate  of  about  1  kHz  would  be  required  for  the 
anticipated  application  to  research  ramjet  combustors. 

Current  optical  techniques  based  on  the  quantum-mechanical 
effects  of  light  are  useful  in  their  ability  to  provide  temperature 
measurements  in  flows  due  to  their  non-intrusiveness,  their 
accuracy,  and  their  speed.  They  can,  however,  be  prohibitively 
expensive  and  very  complicated,  and  none  can  yet  provide  combined 
spatial  and  temporal  resolution  at  a  satisfactory  sampling  rate. 
Thermocouples,  on  the  other  hand,  are  relatively  inexpensive  and 
uncomplicated  and  a  great  deal  of  past  experience  provides  a  base  of 
knowledge  upon  which  to  rely.  There  still  remain  problems 
involving  probe  survivability  and  disturbance,  as  well  as  the  usual 
worries  over  data  validity,  but  the  current  research  is  attempting  to 
provide  insight  into  these  areas  of  concern. 

Thermocouples  are  able  to  provide  very  accurate  information 
about  the  temperature  of  the  probe  junction,  but  there  is  a  complex 
relationship  between  the  junction  temperature  and  the  instantaneous 
gas  temperature.  Since  the  physical  size  of  any  thermocouple 
prevents  it  from  responding  directly  to  temperature  fluctuations  at 
frequencies  above  about  200  Hz,  compensation  is  necessary.  For  the 
case  of  minimal  radiation  and  conduction  losses,  it  may  be  shown 
that: 


Tc  =  Tj  +  c*(dTj/dt) 


where  Tc  is  the  actual  gas  temperature,  Tj  is  the  measured  junction 
temperature,  and  o  is  referred  to  as  the  time  constant,  which 
represents  a  ratio  between  the  junction  thermal  capacity  and  the 
convective  heat  flux.  Since  c*  is  a  function  of  the  instantaneous  gas 
properties,  calculating  the  time  constant  directly  would  not  normally 
be  possible,  and  most  researchers  have  used  some  sort  of  average 
value.  This  has  proven  to  be  insufficient  and  an  alternate  method  of 
determining  o,  and  thus  the  instantaneous  gas  temperature,  is 
needed. 

One  possible  solution  is  to  use  a  two- wire  thermocouple  probe 
in  which  two  junctions  of  differing  diameters  are  placed  close 
together.  This  technique,  theoretically,  allows  the  determination  of 
an  instantaneous  time  constant  based  on  the  temporal  temperature 
gradients  of  each  junction.  The  focus  of  this  research  is  to  prove  this 
technique  and  apply  it  to  an  actual  flow  with  possible  future 
correlation  to  a  CARS  system  and  then  to  an  LDV  for  simultaneous 
velocity  measurements. 


LITERATURE  REVIEW 

An  extensive  review  of  thermocouple  techniques  has  been 
conducted  and  an  abbreviated  version  of  this  review,  limited  to  those 
studies  relevant  to  the  proposed  program,  is  presented.  The  primary 
items  of  interest  in  this  review  are:  flowfield  type,  size  of  junction  or 
wire,  coating,  radiation  and  conduction  corrections,  temporal 
resolution,  and  response  compensation.  Virtually  all  the  referenced 
experimental  studies  used  wires  of  Pt  and/or  Pt-Rh. 

Fristrom  and  Westenberg  (1965)  provide  detailed  instructions 
on  the  fabrication  of  small  thermocouples  and  generating  silica 
coatings  for  the  elimination  of  catalysis  effects.  The  coating 
technique  involves  passing  the  junction  through  a  flame  containing 
silicon  dioxide  generated  from  the  combustion  of  a  compound  such  as 
dimethyl  siloxane.  Thermocouples  prepared  using  this  method  were 
evalutated  in  hydrogen-air  flames  by  Cookson  et  al.  (1964),  with 
significant  errors  in  the  mean  measured  temperature  for  the  larger 
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thermocouples  attributed  to  recombination  reactions  in  the  boundary 
layer.  Radiation  corrections  were  implemented,  but  transient 
response  effects  were  not  considered.  Observed  differences  between 
coated  and  uncoated  thermocouples  due  to  surface  catalysis  were  on 
the  order  of  400C,  for  mean  temperatures  of  1100C. 

Kent  (1970)  reported  that  silica  is  not  appropriate  for 
temperatures  above  about  1100C,  as  free  silicon  diffuses  into  the 
junction,  altering  the  properties  of  the  thermocouple.  A  yttria- 
beryllia  coating  was  suggested  as  an  alternate,  tested,  and  found  to 
give  good  results. 

Yule  et  al.  (1978)  offer  a  good  discussion  of  transient 
compensation  concepts,  and  provide  a  correction  for  non-cylindrical 
junctions.  (Heat  transfer  to  a  cylinder  has  often  been  assumed  in 
predicting  thermal  response.)  Average  (not  instantaneous)  time 
constants  for  the  actual  junctions  were  determined  by  observing  the 
response  to  a  step-change  in  temperature  by  measuring  the  decay  in 
measured  temperature  after  a  heating  current  was  switched  off.  It 
was  stated  that  this  technique  will  not  work  unless  the  thermocouple 
wires  to  the  junction  are  much  smaller  than  the  probe  lead  wires. 
Wire  size  and  junction  size  were  25  and  70  microns,  respectively;  a 
typical  time  constant  was  30  msec.  Some  reasonable  agreement  was 
achieved  between  predicted  average  time  constants  based  on  Nusselt 
number  relations  and  measured  data.  In  a  propane-air  diffusion 
flame,  average  time  constants  were  determined  at  different  locations 
in  the  flowfield  by  pulsed  overheating  (again,  a  response  to  a  known 
step  change  in  temperature)  and  used  to  correct  the  observed 
temperatures.  These  thermocouples  were  apparently  not  coated. 
Independent  confirmation  of  the  observed  temperatures  was  not 
provided.  A  two-wire  technique,  with  one  wire  constantly  heated  to 
provide  heat  transfer  data  for  response  correction,  was  suggested  as 
a  future  development. 

Lockwood  and  Odidi  (1973)  measured  mean  temperatures  in  a 
turbulent  diffusion  methane  flame  with  a  40  micron  thermocouple. 

No  corrections  for  radiation  or  conduction  were  made  and  the 
thermocouple  was  not  coated.  An  average  time  constant  for  each 
location  in  the  flame  was  determined  by  an  AC  excitation  method,  a 
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typical  value  was  10  msec.  This  time  constant  was  then  applied  to 
an  on-line  compensation  circuit  to  obtain  a  corrected  voltage.  The 
use  of  an  average,  rather  than  instantaneous,  time  constant  was 
estimated  to  generate  errors  of  up  to  10  percent  in  the  mean 
temperature  and  20  percent  in  the  instantaneous  temperature. 

Lockwood  and  Moneib  (1980)  measured  the  temperature  in  a 
nonreacting,  electrically  heated  free  jet  with  a  12.7  micron 
thermocouple  compensated  for  transient  response,  (they  also 
concluded  that  the  results  of  the  previous  work  were  not  as  good  as 
originally  claimed.)  An  on-line,  digital  compensation  network  was 
developed;  the  appropriate  time  constant  was  still  an  average  value 
determined  by  a  pulsed  overheat  method.  The  time  constant  was  on 
the  order  of  15  msec;  noise  limited  the  max  frequency  response  to  5 
kHz.  Prodigious  quantities  of  results  were  plotted,  including  mean 
profiles,  PDFs,  flatness,  and  skewness,  and  were  described  as 
"physically  realistic." 

El  Banhawy  et  al.  (1983)  measured  mean  values  of 
temperature,  velocity,  and  concentrations  in  step-stabilized, 
premixed,  methane-air  flames.  The  thermocouples  were  15,  40  or  80 
micron  diameter,  with  the  maximum  indicated  temperature 
difference  between  the  three  being  40C.  Conduction  effects  were 
claimed  unimportant  as  a  result  of  the  probe  design,  radiation  effects 
were  not  addressed,  the  junctions  were  apparently  not  coated,  and 
no  correction  for  response  was  implemented.  A  frequency  response 
to  200  Hz  was  claimed. 

Heitor  et  al.  (1985)  measured  temperature  simultaneously  with 
LDV  data  in  premixed,  disc-stabilized,  natural  gas  flames.  The 
maximum  error  due  to  radiation  was  estimated  to  be  100K;  a  specific 
correction  was  not  made.  The  thermocouple  output  was  digitized  and 
stored,  with  the  compensation  being  done  off-line  by  determining 
the  gradient  dTj/dt  from  the  stored  data.  An  average  time  constant 
was  not  used,  rather,  an  instantaneous  time  constant  was  determined 
based  on  the  measured  velocity,  physical  characteristics  of  the 
junction,  and  a  heat  transfer  relation  for  fine  wires  based  on  the 
results  of  Coilis  and  Williams  (1959),  with  some  revisions.  This  was  a 
good  idea,  but  didn't  work  very  well  in  the  reacting  flows,  as  the  gas 
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properties  are  not  uniquely  determined  by  the  temperature  because 
of  large  variations  in  chemistry.  Plots  of  temperature  PDF’s  near  the 
edge  of  the  flame  showed  extreme  overcompensation  by  this  method, 
so  the  "instantaneous"  time  constants  were  all  multiplied  by  0.65 
(trial-and-errror  value)  to  make  the  PDFs  look  better.  The  junctions 
were  left  uncoated  to  keep  the  time  constants  as  low  as  possible. 
Significant  build-up  of  the  titanium  dioxide  seeding  particles  on  the 
thermocouple  surface  caused  the  response  to  degrade  rapidly  with 
run  time,  with  LDV  data  rates  of  100/sec  being  high  enough  to 
render  the  compensation  technique  essentially  useless.  The  wire 
diameters  were  again  15,  40,  or  80  microns. 

As  part  of  a  modeling  verification,  Abdalla  et  al.  (1982)  used 
silica-coated,  100  micron  thermocouples  to  measure  mean 
temperature  in  premixed,  methane-air  flames.  Uncoated,  50  micron 
wires  were  used  for  temporally  resolved  temperatures.  An  average 
time  constant  was  determined  at  each  location  in  the  flowfield  by  an 
overheat  method  and  dialed  into  an  electronic  compensation  RC 
twork,  responsive  to  5  kHz.  The  accuracy  of  these  results  was  not 
discussed  to  any  significant  extent. 

Masri  and  Bilger  (1984)  measured  average  temperatures  in 
hydrocarbon  turbulence  diffusion  flames  using  an  uncoated, 
uncompensated  thermocouple  with  a  130  micron  wire  diameter  and 
270  micron  bead  diameter.  Radiation  errors  were  estimated  to  be 
30C  maximum.  Masri  and  Bilger  (1986)  used  this  same  system  to 
determine  average  temperatures  in  a  different  burner.  Starner  and 
Bilger  (1986)  measured  mean  temperatures  in  a  swirling  hydrogen 
diffusion  flame  with  a  200  micron  (bead  size)  thermocouple, 
corrected  for  radiation  losses.  Transient  compensation  was  again 
ignored.  Neither  the  absolute  accuracies  nor  effect  of  the  probe  on 
the  flow  was  discussed. 

Yoshida  and  Tsuji  (1978)  evaluated  the  temperature  and 
velocity  distributions  in  a  premixed  propane  flame,  but  not 
simultaneously.  The  thermocouple  was  50  micron  diameter  and 
uncoated.  Transient  compensation  was  accomplished  in  a 
conventional  manner  (initially)  by  determining  the  time  constant 
from  an  overheat-response  method,  then  incorporating  this  value  in 
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a  5  kHz,  RC  compensation  circuit.  This  gave  maximum  instantaneous 
temperatures  higher  than  adiabatic  flame  temperatures,  and 
minimum  termperatures  lower  than  ambient.  To  correct  this 
discrepancy,  a  series  of  different  time  constants  was  used  in  the 
compensation  circuit  until  a  value  was  found  which  normalized  the 
output  between  the  ambient  and  adiabatic  flame  temperatures.  This 
value  (40  msec)  was  then  used  throughout  the  flowfield.  Some 
"mismatching"  was  expected  by  the  authors  from  this  procedure. 
Temperature  fluctuations  of  400C,  at  frequencies  above  1  kHz,  were 
found. 

Katsuki  et  al.  (1987)  developed  a  linearization  technique  for 
time-response  compensation  and  evaluated  the  procedure  by  rapidly 
vibrating  the  thermocouple  juntion  across  the  flame  front  of  a 
laminar  diffusion  flame.  Both  coated  and  uncoated  wires  were  used. 
The  technique  utilized  for  compensation  essentially  determines  a 
film  temperature  for  the  wire  and  evaluates  the  instantaneous  time 
constant  based  on  that  temperature.  Variations  in  chemistry  are 
apparently  not  accounted  for.  Differences  in  measured  values 
between  coated  and  uncoated  junctions  were  ascribed  to  radiative 
effects  in  laminar  diffusion  flames  and  catalysis  in  turbulent 
premixed  flames.  Results  from  the  new  technique  were  compared  to 
results  using  a  standard,  average  time  constant  compensation 
technique.  Neither  procedure  was  found  to  give  especially 
impressive  measurements;  the  authors  concluded  that  simultaneous 
velocity  data  were  necessary  to  properly  account  for  variations  in 
the  time  constant.  It  was  also  concluded  that  coating  did  not 
adversely  affect  the  determination  of  real-time  data;  although  the 
time  constant  was  increased  due  to  the  larger  diameter, 
compensation  produced  the  same  temperature  PDF's  as  for  the 
compensated,  uncoated  thermocouple. 

Lenz  and  Gunther  (1980)  used  an  uncoated,  50  micron, 
frequency  compensated  thermocouple  to  determine  time-resolved 
temperature  in  a  free-jet  diffusion  flame.  Compensation  was 
accomplished  by  determining  the  average  time  constant  at  each 
position  in  the  flow  by  an  overheat-response  method,  then  using  the 
resulting  value  in  an  on-line  electrical  compensation  network. 
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Conduction,  radiation,  and  catalytic  effects  were  not  corrected. 
Response  to  8  kHz  was  obtained,  limited  by  noise.  Several  apparent 
discrepancies  in  the  temperature  data  were  explained  away  by 
physical  arguments. 

Brum  et  al.  (1983)  used  a  25  micron,  uncoated,  compensated 
thermocouple  to  acquire  temperature  data  simultaneously  with  LDV 
data  in  a  swirling,  reactive  flow.  The  compensation  was  similar  to 
that  of  Lockwood  and  Moneib,  using  an  in-situ  overheat  method  to 
determine  an  average  time  constant  and  on-line  electronic 
compensation  of  the  signal.  The  authors  indicated  that  this  average 
time-constant  approach  was  not  optimum,  and  attributed  some 
apparent  discrepancies  in  the  data  to  the  instrumentation.  It  was 
determined  that  a  variation  of  10  percent  in  the  time  constant  could 
lead  to  errors  as  high  as  50  percent  in  local  heat  flux.  An  alternative 
technique,  utilizing  the  instantaneous  velocity  to  modify  the  time 
constant,  was  suggested,  (this  approach  was  later  used  by  Heitor  et 
al.  with  minimal  success.)  Probe  perturbation  effects  were  also 
studied,  with  large  variations  in  velocity  discovered  when  the  probe 
was  inserted,  due  to  both  local  and  large-scale  effects.  This  is  to  be 
expected  in  elliptic  flows. 

Chandran  et  al.  (1984)  measured  temperature  with  a  coated,  25 
micron  thermocouple,  coincident  with  LDV  data,  in  a  premixed 
turbulent  flame.  An  average  time  constant  was  determined  by  a 
cross-spectral  analysis  technique,  using  the  response  from  two 
closely  spaced  thermocouples  to  determine  the  time  constants  for 
both.  It  was  claimed  that  the  use  of  an  average  time  constant  does 
not  introduce  significant  error  in  the  temperature  results.  Probe 
interference  was  minimized  by  adjusting  the  probe  configuration 
until  the  local  velocity  closely  matched  the  velocity  measured  when 
no  probe  was  in  the  flow. 

Elmore  et  al.  (1986)  conducted  a  specific  program  for  dynamic 
temperature  measurement  using  a  dual-junction  method  with  off¬ 
line  compensation.  The  dual  junctions  were  used  to  determine  the 
response  characteristics  of  the  smaller  thermocouple,  which  was  then 
corrected.  Correction  was  not  done  on  a  point-by-point  basis,  but 
rather  in  the  frequency  domain. 
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Essentially,  the  technique  determined  the  convective  heat 
transfer  coefficient  as  a  function  of  instantaneous  frequency, 
corrected  the  measured  temeprature  for  convective  response  in  the 
frequency  domain  via  extensive  use  of  FFT’s,  then  inverse- 
transformed  (when  appropriate)  to  get  temporal  waveforms. 
Frequency  response  was  limited  to  about  250  Hz.  The  development 
program  was  not  intended  to  provide  temperatures  in  reacting 
regions,  so  the  assumption  of  air  as  the  working  fluid  was  justified. 

In  this  case,  the  compensation  may  very  well  be  a  function  of 
frequency  only.  Changes  in  local  chemistry,  and  catalytic  effects, 
were  not  addressed.  For  reacting  flows,  this  technique  would 
probably  suffer  from  the  same  problems  observed  by  Heitor  et  al.  A 
finite-difference  conduction  correction  was  applied  to  all  data.  A 
Fortran  program  to  provide  for  compensation  and  conduction 
corrections  was  developed.  The  design  and  fabrication  details  for  the 
probe,  and  initial  testing,  were  described  in  an  earlier  report  (Elmore 
et  al.,  1983). 

In  a  theoretical  investigation,  Chandran  et  al.  (1985)  used  a  two 
time  constant  model  for  a  premixed  flame  with  an  assumed  bimodal 
temperature  PDF.  A  linear  response  was  assumed  for  each  of  the  two 
parts  of  the  flow  and  time  constants  calculated  using  assumed  flow 
properties.  For  an  assumed  square  wave  input,  the  response  of  the 
system  was  modeled  and  the  calculated  mean  temperature  was 
found.  Using  only  a  single  time  constant  generated  errors  on  the 
order  of  10  percent. 

Experimental  evalutaion  of  these  results  in  a  premixed 
methane-air  flame  was  conducted  with  yttria-beryllia  coated 
thermocouples  of  25  and  75  micron  wire,  operated  simultaneously. 
The  junction  separation  was  1  mm.  A  cross-spectral  analysis 
technique  was  used  to  determine  the  average  time  constant  for  each 
junction,  assuming  the  same  environment  for  both.  An  RC  circuit  was 
tuned  to  this  mean  value  to  accomplish  on-line  compensation. 
Comparison  to  coincident  Rayleigh  scattering  results  (with  about  1 
kHz  resolution)  showed  reasonably  good  agreement  for  temporally- 
resolved  temperature,  and  errors  on  the  order  of  20  percent  for  the 
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RMS  fluctuation  and  10  percent  for  the  mean  temperature.  These 
errors  were  predicted  by  the  response  model. 

Ozem  and  Gouldin  (1989)  used  an  uncoated,  25  micron 
thermocouple  to  determine  temperature  statistics  in  a  turbulent, 
premixed,  methane-air  flame.  Radiation  and  conduction  were 
neglected,  and  response  compensation  was  conducted  off-line.  The 
thermocouple  output  was  amplified,  low-pass  filtered  at  20  kHz,  and 
digitized  at  40  kHz  for  transfer  to  a  Micro Vax  II.  An  average  time 
constant  was  used,  calculated  from  a  Nusselt  number  relation  rather 
than  an  experimentally  determined  response. 

Cambray  et  al.  (1986)  attempted  to  find  instantaneous  time 
constants  using  two  chromel-alumel  thermocouples  of  differing 
diameters  in  a  buoyant  turbulent  propane  diffusion  flame.  The 
calculations  were  performed  off-line  and  were  based  on  the 
differences  in  the  response  gradients  of  the  two  thermocouples.  In 
determining  these  gradients,  however,  the  signal  had  been 
"interpolated  with  straight  segments,  and  the  second  derivative  of 
the  new  signal  has  been  interpolated  and  twice  integrated."  This 
manipulation  makes  it  uncertain  as  to  whether  the  processed  signal 
resembled  that  of  the  original.  Results  were  compared  with  those 
found  using  an  average  time  constant  and  there  was  found  to  be 
little  difference. 

The  bulk  of  the  evidence  indicates  that  time-response 
compensation  is  necessary,  at  least  in  turbulent  flows,  if  accurate, 
temporally  resolved  temperature  data  are  to  be  obtained.  The  use 
of  an  average  time  constant  does  not  appear  to  be  sufficient,  and 
using  velocity  alone  as  an  instantaneous  "corrector"  to  the  time 
constant  does  not  provide  much  improvement  as  the  response  is  a 
function  of  temperature,  velocity,  and  chemical  composition.  A  two- 
wire  technique  is  definitely  indicated,  but  frequency  domain 
compensation  loses  the  temporal  information  required  for 
coincidence  with  LDV  data  and  still  does  not  account  for  chemistry. 
An  instantaneous,  real-time  correction,  based  on  the  differential 
response  between  two  thermocouples  of  different  sizes,  is  the  most 
promising  approach. 
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THEORY 


Performing  an  energy  balance  on  a  spherical  thermocouple 
junction  and  neglecting  radiation  and  conduction  effects  gives  the 
standard  equation  for  response: 

Tc  =  Tj  +  o(dTj/dt) 

where  Tc  is  the  instantaneous  gas  temperature,  Tj  is  the  junction 
temperature,  and  the  time  constant  is  defined  as 

oc  =  (pjCjdj2)/(6  k  Nu) 

Where  pjis  the  density  of  the  junction  wire,  Cj  is  the  specific  heat  of 
the  junction  wire,  dj  is  the  junction  diameter,  k  is  the  gas 
conductivity,  and  Nu  is  the  Nusselt  number.  For  turbulent  flows,  the 
Nusselt  correlation  is  taken  as 

Nu  =  C1ReC2PrC'3 

where  Re  is  the  Reynolds  number  based  on  wire  diameter  and  Pr  is 
the  gas  Prandtl  number.  Substituting  relations  for  Nu,  Re,  and  Pr  into 
the  relation  for  o: 


<*=  [  (  PjCjdj2'C2)/6  )  *  I  H  /  (VC2pC2PrC3  Cl  k)  ] 
Which  may  be  grouped  as 

O  =  (C4)(G) 

where  C4  =  C4(wire  properties)  and  G  =  G(gas  properties).  Now, 

Tc  =  Tj  +  C4G(dTj/dt) 

Clearly,  G  is  the  only  unknown  in  solving  tor  Tc.  However,  for  two 
thermocouples  in  the  same  environment,  we  can  let  Tci=Tc2  and 
solve  for  G  in  terms  of  known  or  measurable  parameters: 


49-11 


G  =  [Tj2-Tj i  ]  /  [  (C4  dTj/dt)i  -  (C4  dTj/dt)2  ] 


With  a  known  value  for  G,  the  time  constants  can  now  be  determined 
and  an  instantaneous  gas  temperature  obtained.  For  junction  1  this 
expression  would  reduce  to: 

Tci  =  Tji  +  t(Tj2-Tj,)(dTj/dl)|]  /  (  (dTj/dt)i  -  (d2/di)2'C2(dTj/dt)2  ] 

when  both  junctions  are  of  the  same  material,  leaving  the  corrected 
gas  temperature  a  function  only  of  the  diameter  ratio  between  the 
two  thermocouples,  the  measured  temperatures  and  the  temperature 
gradients. 

This  is  a  dynamic  calibration  and  depends  upon  the  responses 
of  the  two  different  junctions  in  the  same  environment.  The  only 
required  information  is  the  value  of  the  coefficient  C2,  which  has 
been  shown  to  be  about  0.5  but  can  be  experimentally  verified. 


MODELED  DATA 

In  order  to  show  that  it  is  possible  to  obtain  compensated 
temperatures  based  on  the  technique  just  outlined,  a  theoretical 
response  curve  was  generated  to  test  this  technique.  By  letting 
Tc=T0sinwt  and  substituting  back  into  the  relation  Tc  =  Tj  +  »(dTj/dt) 
an  expression  for  the  response  of  each  junction  Tj  could  be  obtained: 

Tj(t)  =  T0o[(sin(<6>t)/o)  -  (<i>cos(<a>t))  +  <fi>e("t/0t )]  /  [l+(<a>t)2] 

By  assuming  a  junction  diameter  ratio  of  0.5  (1:2)  and  assigning 
arbitrary  values  to  T0,  <a>,  and  the  other  physical  variables  contained 
within  o«,  the  response  curve  for  each  junction  was  calculated.  This 
can  be  seen  in  Fig.  (1)  where  the  curve  of  largest  amplitude  is  the 
actual  gas  temperature  Tc  and  the  next  smallest  and  smallest  curves 
are  the  response  curves  of  junctions  having  diameter  ratios  of  1:2 
respectively.  These  response  curves  were  then  fed  into  a  Fortran 
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Figure  1  -  Theoretical  Sinusoidal  Gas  Temperature  With 
Corresponding  Response  Signals. 
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Figure  2  •  Compensated  Gas  Temperatures  Based  On 
Sinusoidal  Model. 
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program  (Appendix  2)  which  calculated  the  gradients  at  each  point 
and  then  found  the  corrected  temperatures  for  both  curves.  The 
results,  seen  in  Fig.  (2),  showed  that  the  corrected  temperatures  for 
both  response  curves  were  identical  and  that  they  matched  the 
actual  gas  temperature  to  within  0.1  percent.  Moreover,  it  was  found 
that  the  time  constant,  c*,  could  be  calculated  using  only  values  for 
the  response  gradients,  the  response  difference  (Tj2-Tjj),  the 
diameter  ratio,  and  the  value  C2.  The  physical  properties  of  the 
junctions  were  not  a  factor  as  long  as  both  junctions  were  of  the 
same  material.  Similar  calculations  were  performed  for  various 
diameter  ratios  giving  similar  positive  results,  thus  showing  that  this 
technique  works  well  in  theory  for  any  given  diameter  ratio. 

A  second  theoretical  model  was  established  to  simulate  an 
actual  experiment  in  which  two  thermocouples  of  differing 
diameters  at  some  initial  temperature  T1  were  suddenly  exposed  to 
a  constant  temperature  environment  T2.  The  response  for  each 
junction  to  such  a  step  change  looks  like: 

Tj(t)  =  T2  +  (T1-T2)  e*(t/o) 

For  the  case  where  Tl=0.2,  T2=2.5,  and  the  diameter  ratio  is  0.5 
for  both  the  generation  of  theoretical  responses  and  the  regeneration 
of  the  temperature  T2,  such  a  compensation  was  indeed  sucessful. 
However,  in  order  to  test  the  sensitivity  of  the  compensation 
technique  to  the  diameter  ratio,  it  is  possible  to  try  compensating 
with  ratios  somewhat  different  than  that  of  0.5.  When  this  was 
attempted,  this  technique  was  found  to  have  a  great  deal  of 
sensitivity  to  the  diameter  ratio.  Being  off  by  even  \%  (eg  0.49)  from 
the  actual  ratio  produced  numbers  dramatically  different  from  that 
of  the  actual  gas  temperature,  T2.  In  fact,  a  pattern  was  seen  in  the 
compensation  curves  as  the  diameter  ratio  was  changed.  When  the 
compensation  ratio  began  below  that  of  the  correct  ratio  (eg  0.25) 
and  was  increased,  the  compensation  curve  slowly  increased  and 
approach  the  correct  gas  temperature  in  an  asymptotic  fashion  as 
seen  in  Fig.  (3).  When  the  compensation  ratio  equaled  that  of  the 
correct  ratio,  the  compensation  curve  gives  the  actual  gas 
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Figure  3  -  Results  Of  Compensation  Using  Diameter  Ratios 
Less  Than  Actual  Ratio  Of  0.5  (Ratios  Used:  .43  to 
.49). 


Figure  4  -  Results  Of  Compensation  Using  Diameter  Ratios 
Above  Actual  Ratio  Of  0.5  (Ratios  Used:  .70  to 
.80). 
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temperature,  T2.  When  the  ratio  surpased  the  correct  ratio  (eg 
0.75),  the  compensation  curve  jumped  from  positive  to  negative 
values  in  such  a  fashion  that  it  began  to  approach  the  lower  response 
curve,  shown  in  Fig.  (4).  All  this  suggests  a  means  of  calibrating  a 
probe  to  the  correct  diameter  ratio  by  looking  for  some  sort  of 
convergence  to  an  established  gas  temperature. 

Sensitivity  of  this  method  to  fluctuations  in  the  response  signal 
was  also  examined.  This  was  done  by  adding  a  small  amount  of 
random  noise  to  the  theoretical  response  curves.  Incorporating  noise 
on  the  order  of  0.01%  of  the  magnitude  of  the  signal  resulted  in 
fluctuations  in  the  response  curve  on  the  order  of  plus  or  minus  50 
percent  of  the  actual  temperature,  while  noise  on  the  order  of  0.1%  of 
the  signal  created  very  great  disturbances  which  produced  non¬ 
physical  (negative  or  extremely  large)  values  for  the  corrected 
temperature.  This  suggested  that  in  an  experimental  situation  the 
signal  should  have  a  high  signal  to  noise  ratio  for  correct 
compensation  to  occur.  It  also  demonstrated  the  sensitivity  of 
compensation  to  the  calculated  gradients.  If  the  gradients  used  in 
compensation  deviate  even  a  small  amount  from  their  true  value, 
large  errors  in  corrected  temperature  may  result. 


EXPERIMENTAL 

Data  for  all  experiments  was  collected  using  a  386-based 
computer  containing  a  DT2831-G  A/D  board  with  a  maximum 
throughput  rate  of  250  kHz.  The  typical  data  collection  rate  per 
channel  was  10  kHz.  Thermocouple  outputs  were  amplified  and 
filtered  using  two  Preston  Model  8300XWB  amplifiers  with  filter 
cutoffs  of  10  Hz,  500  Hz,  1  kHz,  5  kHz,  and  10  kHz.  A  gain  of  1000 
was  typically  needed  to  produce  outputs  on  the  order  of  1  volt  at 
design  temperatures. 

The  thermocouple  wires  were  made  of  Pt/Pt-13%Rh  and  were 
ofl,  2  or  3  mil  diameter.  Smaller  wires  would  have  higher  frequency 
responses  but  have  previously  been  shown  to  have  unacceptable 
survivability  in  the  high  temperature,  high  velocity  flows  that  the 
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two-wire  probe  is  interned  to  measure.  The  lead  wire  was  15  mils  in 
diameter.  Probes  using  1:2  and  1:3  diameter  ratio  thermocouples 
were  constructed.  The  junctions  were  flame  welded  using  a  small 
hydrogen  torch  that  created  a  bead  which  was  typically  four  times 
the  wire  diameter  and  was  usually  not  a  perfect  sphere.  The 
smallest  beads  that  most  resembled  spheres  were  chosen  for  the 
construction  of  each  probe.  The  junction  wires  were  connected  to  the 
lead  wires  using  an  electrical  welder.  This  proved,  through  trial  and 
error,  to  give  sufficient  strength  to  withstand  flows  of  approximately 
150  ft/s.  An  attempt  was  made  to  place  the  junctions  as  close 
together  as  possible,  but  a  spatial  separation  of  up  to  several 
millimeters  often  could  not  be  avoided. 

Initial  attempts  a..  data  aquisition  sought  to  move  the  probe  in 
and  out  of  a  methane  diffusion  flame.  This  set-up,  shown  in  Fig.  (5), 
used  a  200  Watt  speaker  driver  with  a  60  oz.  magnet  to  rotate  the 
probe  back  and  forth  around  a  pivot  point,  thus  exposing  the 
junctions  to  the  hot  environment  at  a  known  frequency.  This  was 
moderately  successful  at  exposure  frequencies  smaller  than  20  Hz. 
However,  anything  higher  would  either  not  provide  enough 
amplitude  to  move  the  junction  in  and  out  of  the  flame,  or  it  would 
simply  destroy  the  probe  itself  by  shaking  apart  the  junctions  along 
with  the  ceramic  insulation.  Also,  the  use  of  a  methane  diffusion 
flame  required  that  catalytic  effects  be  considered,  along  with 
possible  radiative  losses. 

The  solution  to  these  problems  involved  placing  a  dual-junction 
probe  in  a  stream  of  pulsed  hot  air  of  known  temperature  in  order  to 
provide  a  series  of  response  curves  which  could  be  used  to  find  the 
instantaneous  gas  temperature.  Figs.  (6)  and  (7)  show  details  of  this 
experiment.  Hot  air  was  provided  by  passing  compressed  air 
through  an  electrical  resistance  heater  which  typically  supplied 
values  between  300°F  and  600°F.  The  hot  flow  was  then  cycled 
using  a  rotating  steel  plate  with  a  hole  drilled  through  it.  The  test 
probe  and  the  air  supply  were  then  placed  on  either  side  of  the 
spinning  chopper  wheel  and  a  laser  and  photomultiplier  tube  (PMT) 
were  used  to  mark  the  onset  of  exposure  to  the  hot  air  each  time  the 
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hole  passed  in  front  of  the  probe.  Typical  wheel  speeds,  measured 
with  a  strobe,  ranged  between  600  rpm  and  3000  rpm. 

A  minimum  of  three  channels  were  required  for  this 
experiment;  one  for  each  thermocouple  and  one  for  the  PMT  signal. 

In  the  future,  the  PMT  input  channel  may  be  replaced  by  an  LDV 
input  signal.  Data  was  collected  at  a  rate  of  10  kHz  and  was 
amplified  at  a  gain  of  1000  and  low  pass  filtered  at  10  kHz. 

Fig.  (8)  shows  a  typical  response  curve  for  a  wheel  speed  of 
600  rpm  at  an  air  temperature  of  460°F.  Similarly,  Fig.  (9)  shows 
another  response  curve  at  the  same  air  temperature  for  a  wheel 
speed  of  1200  rpm.  It  should  be  noted  that  wheel  speeds  of  600 
rpm  and  1200  rpm  correspond  to  actual  driving  frequencies 
(frequencies  to  which  the  junctions  actually  respond)  of  about  220  Hz 
and  350  Hz  respectively.  These  numbers  are  based  on  an  analysis  of 
experimental  data  to  be  discussed  in  the  following  section. 

ANALYSIS 

The  experimental  data  used  for  compensation  testing  comes 
from  a  probe  with  a  measured  diameter  ratio  (assuming  spherical 
junctions)  of  approximately  0.540  (4.3:8.0  mils)  which  was  held  in  a 
pulsed  air  stream  of  300°F  (lmV),  460°F  (1.8mV),  and  600°F 
(2.6mV)  at  wheel  speeds  of  600  rpm  and  1200  rpm. 

Initially,  compensation  was  attempted  on  numerous  curves  at 
various  gas  temperatures  and  wheel  speeds  using  the  measured 
diameter  ratio  and  a  linear  approximation  technique  in  which  the 
gradients  of  the  response  curves  were  determined  by  finding  the 
slopes  between  the  points  behind  and  in  front  of  the  points  being 
compensated.  This  simple  attempt  produced  results,  shown  in  Fig. 
(13),  that  fluctuated  excessively  and  did  not  approach  any  sort  of 
physically  realistic  profile  for  the  corrected  gas  temperature.  Also, 
the  calculated  time  constant,  o,  often  was  negative  using  this  method. 
This  appears  to  be  due  to  the  inability  of  the  simple  linear  slope 
approximation  to  produce  voltage  gradients  that  are  accurate  enough 
to  overcome  the  sensitivity  of  the  method  to  such  values.  Another 


49-20 


600RPM  460  F 


GAIN- 1000 

SAMPLE  FREQ-10  kHz 


Figure  8  -  Typical  Response  Curves  At  600  rpm,  460F. 
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Figure  9  -  Typical  Response  Curves  At  1200  rpm,  460F. 
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problem  was  that  each  junction  was  not  a  perfect  sphere  as  the 
theory  assumes,  so  that  the  measured  diameter  ratio  may  have 
differed  from  the  true  "effective"  diameter  ratio. 

An  attempted  solution  to  this  problem  was  to  take  an  ensemble 
average  of  many  curves  (50  for  a  wheel  speed  of  600  rpm  and  100 
for  1200  rpm)  in  order  to  smooth  out  the  variations  in  the  response 
signals,  and  then  to  try  using  various  diameter  ratio  values  so  that  a 
range  of  ratios  giving  feasible  solutions  could  be  developed  for  each 
temperature  and  wheel  speed.  If  these  ranges  overlapped,  a  region 
of  effective  diameter  ratios  would  be  obtained.  This  "calibration"  of 
the  thermocouple  probe  is  necessary  because  of  an  unknown  actual 
gas  temperature  profile  and  an  unknown  effective  diameter  ratio. 

The  smoothing  that  occurs  in  the  ensemble  averaging  is  done  to  try 
and  isolate  the  200  Hz  to  400  Hz  driving  frequency  to  which  the 
probe  actually  responds.  The  smaller,  high-frequency  components 
are  not  of  interest  for  these  calibration  purposes  and  must  be 
removed  to  get  proper  differentiation  of  the  200  Hz  to  400  Hz 
component  when  using  a  linear  slope  approximation. 

A  trial-and-error  approach  of  this  type  was  tried  on  the 
ensemble  averaged  data  at  460°F  (1.8mV)  and  600  rpm.  The  results, 
as  seen  in  Fig.  (10),  seem  to  indicate  a  possible  ratio  range  of  about 
0.331  to  0.350  with  fluctuations  in  the  compensated  temperature  on 
the  order  of  10%  of  the  measured  gas  temperature.  Likewise,  at 
460°F  and  1200  rpm,  an  approximate  range  of  about  0.344  to  0.363 
was  found,  as  seen  in  Fig.  (11).  With  these  results  in  mind, 
compensation  of  the  averaged  curves  for  600°F  (2.6mV)  and  300°F 
(lmV)  at  600  rpm  were  performed,  producing  a  ratio  range  in  both 
cases  of  about  0.344  to  0.356.  The  time  constants  in  all  cases  also 
appeared  reasonable,  showing  values  on  the  order  of  5  to  10  msec 
for  the  smaller  junction  and  20  to  40  msec  for  the  larger  junction.  A 
calculation  for  the  time  constant  using  estimated  physical  parameters 
gives  a  value  of  about  6  msec  for  a  4  mil  bead  and  17  msec  for  an  8 
mil  bead. 

It  was  noted  in  performing  these  manipulations  that  there  was 
an  overall  trend  regarding  the  compensation  curves  and  their 
behavior  when  going  from  a  low  diameter  ratio  to  a  high  ratio.  This 
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e  10  •  Compensation  On  Ensemble  Average  Curves  At 
460F,  600  rpm,  Diameter  Ratios  0.331  to  0.350 

1 200RPM  AT  460F.  0.344-0.363 


Figure  11  -  Compensation  On  Ensemble  Average  Curves  At 
460F,1200  rpm,  Diameter  Ratios  0.344  to  0.363. 


trend  was  similar  to  that  seen  in  the  theoretical  model  in  that  the 
compensated  curve  grew  larger  in  value  with  increasing  ratio  and 
eventually  "flipped  over"  and  approached  the  lower  response  curve 
at  high  ratios.  However,  there  was  never  a  point  of  definite 
convergence  on  one  value  as  was  seen  in  the  model.  Rather,  there 
was  a  gradual  shift  in  the  response  curve  as  the  ratio  changed,  thus 
necessitating  the  need  for  a  range  of  possible  solutions. 

Using  the  resulting  diameter  ratio  range,  compensation  was 
attempted  on  single,  non-averaged  response  curves.  This,  once  again, 
produced  fluctuating  and  unrealistic  results.  It  took  an  ensemble 
average  of  about  20  or  more  curves  to  suppress  the  high-frequency 
signal  fluctuations  enough  that  the  derivatives  were  sufficiently 
accurate  and  useful  results  could  be  obtained. 

Since  the  compensation  method  is  clearly  so  dependent  on  the 
calculated  gradients,  the  likely  approach  would  seem  to  be  to  use  a 
curve  fit  over  the  response  curve.  Second  order,  third  order,  and 
high  (10th)  order  curves  fits  were  tried,  but  all  were  initially 
unsuccessful  and  gave  unrealistic  answers.  However,  it  was  found 
that  a  high  order  curve  fit  over  the  low  curvature  response  portion 
of  the  curve  (ie  the  period  of  time  between  exposure  of  the  junction 
to  the  hot  air  and  when  the  hot  air  flow  is  stopped)  produced  results 
that  were  physically  reasonable  and  that  did  away  with  much  of  the 
resulting  fluctuation. 

A  high  order  curve  fit  was  used  on  600  rpm  averaged  curves  at 
300°F,  460°F,  and  600°F  to  obtain  a  ratio  of  approximately  0.331  to 
0.338.  This  is  a  slightly  smaller  but  narrower  range  of  values  than 
that  found  using  simple  linear  slopes.  Curve  fits  were  also  done  on 
single,  non-averaged  response  curves  using  this  range  of  ratios,  and 
this  gave  smooth,  generally  realistic  results.  Fig.  (12)  shows  a  typical 
set  of  compensation  curves  using  this  range  of  ratios  for  a  single 
response  at  460°F  (1.8mV)  at  a  wheel  speed  of  600  rpm.  This  can  be 
compared  to  Fig.  (13)  which  shows  compensation  curves  for  the  same 
response  curve  without  using  a  high  order  curve  fit  to  determine  the 
gradients.  Fig.  (14)  shows  typical  results  at  a  wheel  speed  of  1200 
rpm  and  460°F.  Compensated  temperatures  of  single  curves  at 
600°F  and  300°F  at  600  rpm,  shown  in  Figs.  (15)  and  (16) 
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600RPM  AT  46QF.  10TW  ORD.  FIT,  0.333-0.340 


TIME  (S) 

b  12  -  Compensation  On  Single,  Non-Averaged  Curve  At 
460F,  600  rpm,  Diameter  Ratios  0.333  to  0.340 
UsinglOth  Order  Curve  Fit. 


600RPM  AT  460F,  NO  FIT,  0.333-0.340 


TIME  (S) 


Figure  13  -  Compensation  On  Single,  Non-Averaged  Curve  At 
460F,  600  rpm,  Diameter  Ratios  0.333  to  0.340 
Using  Linear  Slopes. 
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1200RPM  AT  460F.  10TH  ORD.  FIT,  0.333-0.345 


TIME  (S) 


14  -  Compensation  On  Single,  Non-Averaged  Curve  At 
460F,  1200  rpm,  Diameter  Ratios  0.333  to  0.345 
UsinglQth  Order  Curve  Fit. 


600RPM  AT  600F.  10TH  ORD.  FIT.  0.333-0.345 


15  -  Compensation  On  Single,  Non-Averaged  Curve  At 
600F,  600  rpm,  Diameter  Ratios  0.333  to  0.345 
UsinglOth  Order  Fit. 


600RPM  AT300F,  10TW  ORD.  FIT,  0.333-0.345 


Figure  16  -  Compensation  On  Single,  Non-Averaged  Curve  At 
300F,  600  rpm,  Diameter  Ratios  0.333  to  0.345 
UsinglOth  Order  Curve  Fit. 
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respectively,  demonstrate  a  different  behavior,  but  are  not  totally 
unrealistic.  These  figures  show  the  compensated  gas  temperature 
peaking  twice,  which  may  be  due  to  fluid  mechanical  effects.  The 
large  magnitude  of  these  peaks  can  be  attributed  to  a  slight  amount 
of  high-frequency  "noise"  in  the  signal  or  to  small  imperfections  in 
the  calculated  gradients.  All  these  results  correspond  to  the  ratio 
range  of  about  0.331  to  0.338  and  all  give  time  constants  on  the 
order  of  those  predicted. 


CONCLUSION 

The  experimental  results  indicate  that  temperature 
compensation  using  a  two-wire  thermocouple  probe  is  possible.  By 
calibrating  the  probe  to  a  range  of  effective  diameter  ratios  at  one 
known  gas  temperature,  reasonable  values  for  compensated 
temperature  were  achieved  at  other  gas  temperatures  as  well.  The 
range  of  effective  diameter  ratios  was  generally  found  to  be  below 
the  measured  ratio  of  0.538  by  about  60%. 

The  compensated  temperature  curves  showed  a  strong 
dependence  on  the  calculated  temperature  gradients.  When  these 
gradients  were  found  through  the  use  of  linear  approximation,  a 
meaningful  range  of  diameter  ratios,  about  0.344  to  0.356,  could  only 
be  obtained  using  an  ensemble  average  of  at  least  20  curves  which 
served  to  smooth  out  the  response  curve  by  eliminating  unwanted 
high-frequency  components.  The  resulting  ratios  were  then  applied 
to  single,  non-averaged  response  curves  and  the  compensated 
temperature  curves  showed  a  large  degree  of  fluctuation  and  did  not 
represent  a  phsyically  realistic  solution. 

However,  when  the  original  curve  was  filtered  to  eliminate 
high-frequency  components  by  fitting  a  high  (10th)  order  equation, 
much  of  this  fluctuation  disappeared  and  a  slightly  smaller  but 
narrower  range  of  diameter  ratios,  about  0.331  to  0.338,  was  found. 
This  demonstrated  the  significance  of  obtaining  an  accurate 
temperature  gradient,  but  at  the  price  of  making  these  slopes  more 
complicated  to  find.  One  solution  to  this  may  be  to  use  a  lower  order 
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curve  fit  over  small  portions  of  the  response  curves  at  a  time. 

Another  possible  idea  would  be  to  use  a  differentiating  op-amp 
circuit  to  obtain  the  slopes  electronically,  although  frequency 
response  and  stability  of  such  a  circuit  would  have  to  be  considered. 

A  significant  problem  remains  concerning  the  unknown  nature 
of  the  actual  gas  temperature  profile.  Since  this  is  not  known,  an 
exact  solution  for  the  effective  diameter  ratio  cannot  be  obtained  and 
a  range  of  values  must  suffice.  To  avoid  this,  it  may  be  possible  to 
place  a  very  fine  wire  temperature  sensor  in  the  flow  as  a  calibration 
device  with  the  two-wire  probe.  Wires  with  sufficient  frequency 
response  to  serve  as  calibration  standards  are  commercially 
available.  While  such  wires  would  not  survive  the  high  temperature, 
high  velocity  applications  intended  for  the  two-wire  probe,  they 
could  be  used  as  references  in  less  severe  conditions  such  as 
produced  in  the  chopper-wheel  calibration  rig.  This  would  provide  a 
means  of  checking  results  with  a  known  solution  in  order  to  obtain  a 
more  precise  value  for  the  effective  diameter  ratio,  and  thus  a  more 
accurate  compensation.  Eventually,  correlation  of  this  method  with  a 
CARS  system  may  also  be  possible. 

To  conclude,  it  has  been  demonstrated  that  temperature 
compensation  based  on  the  response  of  two  different-sized 
thermocouples  appears  possible  both  in  theory  and  in  practice.  A 
method  of  calibrating  thermocouple  probes  to  an  effective  diameter 
ratio  has  been  demonstrated  and  has  been  shown  to  give  realistic 
results  across  several  different  temperatures  and  driving 
frequencies.  Future  work  will  concentrate  on  pinpointing  the 
effective  diameter  ratio  more  accurately. 
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Appendix  1  -  The  Effects  Of  Probe  Geometry  On  Surrounding  Flow. 

The  effect  of  probe  geometry  on  the  flowfield  surrounding  the 
thermocouple  junctions  was  studied  in  anticipation  of  future 
applications  of  the  probe  to  a  high-speed  reacting  environment  such 
as  that  found  in  a  ramjet  combustor.  This  study  was  carried  out  in  a 
simple  dump,  no-swirl  combustor  using  a  two-component  LDV 
system.  The  probe  geometries  studied,  shown  in  Fig.  (17),  included 
two  pitot-tube-style  configurations  in  which  the  U-shaped  bend  was 
horizontal  (parallel)  with  the  flow  and  also  vertical  (perpendicular) 
to  the  flow.  The  third  configuration  involved  a  straight  probe 
inserted  perpendicular  to  the  flow.  Results  were  obtained  at 
distances  of  five  and  ten  inches  from  the  combustor  outlet  and  plots 
of  these  figures  are  given  in  Figs.  (18)  and  (19).  There  was  no 
significant  difference  between  the  three  geometries  tested,  although 
at  five  inches  the  straight  probe  appeared  to  have  a  slight  effect  by 
giving  values  up  to  15  percent  below  those  obtained  with  no  probe 
inserted  in  the  flow. 
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Figure  17  -  Various  Probe  Geometries  Tested. 
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VELOCITY  (ft/s)  c  VELOCITY  (FT/S) 


X=5in,  NO  SWIRL 


re  18  -  Probe  Flow  Effects  At  5in  In  Simple  Dump 
Combustor 

X=10in,  NO  SWIRL 


Figure  19  -  Probe  Flow  Effects  At  lOin  In  Simple  Dump 
Combustor 


Appendix  2 


-  Fortran  Program  Used  To  Perform  Compensation 
Calculations 
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C234567 

c  ***  Program  TCALC,  by  Tom  Scattergood,  1991 
c  ***  This  program  performs  compensation  calculations  on 

c  ***  response  signals  using  curve  fits,  linear  slopes,  and 

c  ***  various  other  nifty  options, 
c 

REAL  TIME1A(9000) ,TIME2A(9000) ,DVDTA(9000)  , 

*TIME1B(9000) ,TIME2B(9000) ,DVDTB(9000) ,TSEGA(500) , 

♦TSEGB ( 500 ) ,TEQL(500) ,ACOE1(20) ,ACOE2(20) ,BC0E1(1Q) ,BC0E2(10) 
REAL  T1A , TIB , T2A , T2B, RH01 , RH02 , CPI , CP2 , C2 , DIA1 , DIA2 , C4 1 , C4  2 
*G ,  DVDT1 ,  DVDT2 ,  VI ,  V2 ,  D ,  GNEG ,  GPOS ,  XDVDT1 ,  XDVDT2 ,  INC , 

♦MAX , MIN , AVG1 , AVG2 , SIGAVG , XX , VGAS , DUMMY 
INTEGER  X ,  CNTRA ,  CNTRB ,  Y ,  2 ,  Q ,  W ,  FLAGA ,  FLAGB ,  E ,  SHOW ,  NOPTS 
INTEGER  PICK1 , PICK2 , PICK3 , PICK4 , DISC, MARK , PTSACT, POWER 
CHARACTER*  12  CHAN1M ,  CHAN2M ,  TMPDAT ,  SLPDAT ,  SETUP ,  DATE ,  RANGE , 
♦YORNO 

PICK1=0 

PICK2=0 

PICK3*0 

PICK4=0 

MARK=0 

DISC=0 

MAX*0 

MIN=0 

XX=0 

AVG1=0 

AVG2=0 

SIGAVG=0 

PTSACT=0 

OPEN ( 3 , FILE* ' A10HZC3 . COE ' ) 

DO  788  POWER=l ,11,1 
READ ( 3 , * ) ACOE1 ( POWER) 

788  CONTINUE 

DO  789  POWER=l,ll,l 
READ ( 3 , * ) ACOE2 ( POWER) 

789  CONTINUE 
CLOSE ( 3 ) 

DATE*' 11/20/91 ' 

CHAN1M* ' 1 0AVG1ML . GRA ' 

CHAN2M* ' 1 0AVG2ML .  GRA ' 

TMPDAT* ' 1 OHZD . TMP ' 

SLPDAT* ' A10HZC3 . SLP ' 

RANGE*'  ' 

YORNO*'  ' 

NOPTS-22 
INC*. 0001 
DIA1*2 .72 
DIA2=8 . 0 

WRITE ( * ,  '  (A14,A14,A14,A14,I6,2X,F5.4)  ') CHAN  1M,CHAN2M, TMPDAT, 
♦SLPDAT, NOPTS , INC 

WRITE { * , ' ( A2 l ) ' ) ' Change  current  setup? ' 

READ(*, ' (Al) ' ) SETUP 

IF  (SETUP. EQ. 'Y' .OR. SETUP. EQ. 'y')  GOTO  1 
GOTO  2 

1  WRITE(*, ' (Al) ')  '  ' 

WRITE(*, ' (A21) ') 'ENTER  1  MIL  CHANNEL:  ' 

READ(*, ' (A12) ' ) CHAN1M 

WRITE(*, ' (A21) ') 'ENTER  2  MIL  CHANNEL:  ' 

READ(*, ' (A12) ' ) CHAN2M 
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WRITE ( * , 
READ ( * , ' 
WRITE (* , 
READ ( * , ' 
WRITE(*, 
READ ( * , ' 
WRITE ( * , 
READ(*, ' 
WRITE(*, 
WRITE (*, 
WRITE ( * , 
WRITE ( * , 
WRITE (*, 
READ(*, ' 
WRITE ( * , 
WRITE (*, 
WRITE ( * , 
READ ( * , ' 
WRITE ( * , 
IF  (PICK 
WRITE ( * , 
WRITE ( * , 
READ ( * , ' 
WRITE ( * , 
WRITE ( * , 
WRITE (*, 
READ(*, ' 
WRITE ( * , 
WRITE (*, 
READ ( * , ' 
IF  (YORN< 
WRITE (*, 
READ(*, ' 
WRITE (*, 
READ ( * ,  ' 
WRITE (*, 
READ(*, ' 
WRITE ( * , 
WRITE ( * , 


'  (A21) ' ) 'ENTER  DATA  FILENAME:  ' 

(A12)  ' )  TMPDAT 

'  (A28)  ' ) 'ENTER  POINT-SLOPE  FILENAME:  ' 

(A12 ) ' } SLPDAT 

'  (A18) ') 'ENTER  /  DATA  PTS:  ' 

(16) ' )NOPTS 

'  (A31) ') 'ENTER  SMALLEST  TIME  INCREMENT:  ' 

( F5 . 4 ) ' ) INC 

'  (A37) ' ) '  SELECT  ONE  ' 

'(Al)')'  ' 

'  (A28) ' ) ' (1)  Tact  AT  EVERY  POINT  ' 

'  (A46)  ' )  '  (2)  Tact  ALONG  MATCHING  TRANSITIONAL  INTERVALS 
(II) ' ) PICK1 
'(Al)')'  ' 

'  (A28) ' ) ' (1)  LEAVE  SIGN  OF  G  AS  IS  ' 

'  (A28) ' )  '  (2)  TAKE  ABSOLUTE  VALUE  OF  G' 


(II) ' ) PICK2 
'(Al)')'  ' 
il.EQ.l)  GOTO  3 

'  (A  ) ') ' (1)  USE  AVERAGE  SLOPE' 

'(A  ) ' ) ' (2)  USE  POINT  SLOPE' 

(II) ')PICK3 
'(Al)')'  ' 

' (A3 7) ' ) ' ( 1)  DISCARD  POINTS  OF  OPPOSITE  SLOPES' 
' (A22) ') ' (2)  IGNORE  SLOPE  SIGNS' 

(II) ')PICK4 
'(Al)')'  ' 

'  (A14) ' ) 'SET  RANGE(Y/N) ?' 

(Al) ')YORNO 

O. EQ. 'N' . OR. YORNO.EQ. ' n' )  GOTO  4 
'  (A24) ') 'GIVE  MAX  VALUE (XXX. XXX)  :  ' 

( F7 . 3 ) ' ) MAX 

'  (A24)  ')  'GIVE  MIN  VALUE (YYY.YYY)  :  ' 

(F7.3) ' ) MIN 

' (A33) ') 'DISCARD  PTS  OUTSIDE  THESE  VALUES?' 

(Al) ') RANGE 
'  (A40)  ')  > 

'  (A13) ')  'Working . ' 


OPEN ( 1 , FILE- ' TRACE . DAT ' ) 

OPEN (4 , FILE-CHAN  1M) 

OPEN ( 7  f  FILE-CHAN2M) 

OPEN  ( 8 ,  F I LE-TMPD  AT ) 

OPEN  ( 2 ,  FILE-SLPDAT) 

WRITE ( 8, ' (A10, A12) ' ) 'FILENAME:  ' , TMPDAT 
WRITE(8, ' (A6,A8) ') 'DATE:  ' , DATE 

WRITE (8, ' (A14,A14,A14,A14) ') 'FILES  USED:  ' , CHAN1M, 

*CHAN2M, SLPDAT 

IF  (PICKl.EQ.l)  WRITE(8, ' (A18) ') '  Every  point  used' 

IF  (PICK1.EQ.2)  WRITE (8,'(A18)')'  Intervals  only  ' 

IF  (PICK2.EQ.1)  WRITE (8,'(A18)')'  ABS(G)  not  used  ' 

IF  (PICK2.EQ.2)  WRITE ( 8 , ' ( A18 ) ' )  '  ABS(G)  used  ' 

IF  (PICK3.EQ.1)  WRITE (8 , ' (A18) ' ) '  Average  slopes  ' 

IF  (PICK3.NE.1)  WRITE (8, ' (A18) ')  '  Point  slopes  ' 

IF  (PICK4.EQ.1)  WRITE(8, ' (A23) ')  '  Discarded  opp.  slopes' 
IF  (PICK4.EQ.2)  WRITE (8,'(A23)')'  Ignored  opp.  slope  ' 
IF  (YORNO.EQ. 'Y')  WRITE(8 , ' (A13 , F7 . 3 , A13 , F7 . 3) ' ) 'RangeMAX: 
*MAX, 'RangeMIN:  ',MIN 

IF  (RANGE. EQ. 'Y'j  WRITE(8, ' (A17) ') 'Points  discarded' 
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WRITE  (8 ,  '  (A6  ,F4.2,2X,A6,F4.2)  ')  '  DIA1 :  ' ,DIA1, 'DIA2 :  ',DIA2 
WRITE (8,  '  (A25)  ')  '  ' 

WRITE  (8,7)  '  Time ' ,  'al' ,  'a2' ,  'Vcorrl'  ,  'Vcorr2' 

7  FORMAT ( 2 X,A4 , 9X, A2 , 9X, A2 , 6X, A6 , 5X,  A6) 

REWIND (4) 

REWIND (7) 

REWIND ( 2 ) 

RHOl-19.6 

RH02-RH01 

CP1-.6 

CP2-CP1 

C2-.5 

C41-(RH01*CP1*DIA1**  (2-C2) )  /4 

C42=(RH02*CP2*DIA2**(2-C2)  )/4 

GNEG-0 

GPOS-O 

CNTRA=0 

CNTRB-0 

X=*l 

IF  (PICK1.EQ.1)  W-NOPTS-1 
IF  (PICK1.EQ.1)  CNTRA-2 
IF  (PICK1.EQ.1)  GOTO  50 

10  READ  (4,*)  TIMEIA(X)  ,TIME2A(X)  , DVDTA(X) 

IF  (TIMEIA(X)  .EQ.0.AND.TIME2A(X)  .EQ.O)  GOTO  30 
X-X+l 

CNTRA-CNTRA+1 
SHOW-CNTRA 
GOTO  10 

30  X-l 

40  READ  (7,*)  TIMEIB(X)  , TIME2B(X)  ,DVDTB(X) 

IF  (TIMEIB(X)  .EQ.O. AND. TIME2B(X)  .EQ.O)  GOTO  50 
X-X+l 

CNTRB— CNTRB+ 1 
GOTO  40 

50  DO  100  X— 1 ,  CNTRA,  1 

IF  (PICK1.EQ.1)  GOTO  238 
WRITE  (*,59)  SHOW 

59  FORMAT  (14) 

SHOW-SHOW- 1 
Y— 1 

60  TIA-TIMEIA(X) 

TIB-TIMEIB(Y) 

IF  (T1B.LT.  (T1A-.001)  .0R.T1B.GT.  (T1A+.001) )  GOTO  70 
T2  A-TIME2  A  (X) 

T2  B— TIME2  B ( Y ) 

IF  (T2B.LT.  (T2A-.005)  .0R.T2B.GT.  (T2A+.005) )  GOTO  70 
IF  (T1A.GT.T2A)  GOTO  70 
IF  (T1B.GT.T2B)  GOTO  70 
XDVDTl-DVDTA(X) 

XDVDT2 -D VDTB ( Y ) 

IF  (PICK4.EQ.2)  GOTO  62 

IF  (XDVDT1 .  LT.  0.  AND.XDVDT2  .GT.  0)  GOTO  70 
IF  ( XDVDT1 .  GT .  0 .  AND .  XDVDT2 .  LT .  0 )  GOTO  70 
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62  2=1 

DO  333  D=T1A,T2A,INC 
TSEGA(Z) =D 
FLAGA=Z 
Z-Z+l 

333  CONTINUE 
2=1 

DO  334  D-T1B,T2B,INC 
TSEGB ( Z) =D 
FLAGB-Z 
Z-Z+l 

334  CONTINUE 
W=0 

DO  337  Z= 1 , FLAGA , 1 
DO  335  Q=l, FLAGB, 1 

DIFF=ABS(TSEGA(Z) -TSEGB (Q) ) 
IF  (DIFF.LT. .0001)  GOTO  336 

335  CONTINUE 
GOTO  337 

336  W=W+1 

TEQL(W) =TSEGA(Z) 

337  CONTINUE 


238  DO  340  E=1,W,1 

WRITE ( * , ' (14)  ' )  E 

338  READ  (2,*)  XTIM£,V1,V2,DVDT1,DVDT2 
XT=XTIME 

IF  (PICK3.EQ.1)  DVDT1=XDVDT1 
IF  (PICK3.EQ.1)  DVDT2»XDVDT2 
IF  (PICK1.EQ.1)  GOTO  339 
DIFF=ABS (TEQL ( E) -XTIME) 

IF  (DIFF.LT. .0001)  GOTO  339 
GOTO  338 

339  IF  ( DVDT1 .  EQ .  0 .  AND .  DVDT2 .  EQ .  0 )  GOTO  340 

DVDT1-0 

DVDT2-0 

DO  434  POWER-1,11,1 

DVDT1— ACOE1  ( 2 )  +  ( 2 *AC0E1  ( 3 )  *XT)  +  ( 3 *AC0E1  ( 4 )  *XT**2 )  + 
*(4*AC0E1(5)  *XT**3)  +  (5*AC0E1(6)  *XT**4)  +  (6*AC0E1  (7)  *XT**5)+ 
*(7*AC0E1(8)  *XT**6)  +  (8*AC0£l(9)*XT**7)  +  (9*ACOEl(10)*XT**8)  + 
* ( 10*ACOE1 ( 11) *XT**9) 

DVDT2-ACOE2  ( 2 )  +  ( 2 * ACOE2  ( 3 )  *XT)  +  ( 3 * ACOE2  (  4 )  *XT**2 )  + 

*  (  4  *  ACOE2  ( 5 ) *  XT*  *  3 )  +  ( 5  *  ACOE2  (6)*XT**4)  +  (6  *  ACOE2  ( 7 )  *XT*  *  5 )  + 

*  (7*ACOE2  (8)  *XT**6)  +(8*ACOE2  (9)  *XT**7)  +(9*ACOE2  (10)  *XT**8)+ 
* ( 10*ACOE2 ( 1 1 ) *XT**9 ) 

434  CONTINUE 

WRITE(1,  '  (F6 . 4 , 2X,  F9 . 5, 3X, F9 . 5)  ' )  XT, DVDT1, DVDT2 
G— (V2 -VI)  /  ( (C41*DVDT1)  -(C42*DVDT2) ) 

IF  (PICK2.EQ.2)  G=ABS(G) 

CONST1— C41*G*DVDT1 
CONST2-C4  2  *G*DVDT2 
TCORR1— CONST1+V1 
TCORR2 -C0NST2  +V2 

IF  ( DVDT 1 . LT .  0 .  AND .  DVDT2 .  GT .  0 .  AND . PICK4  .  EQ .  1 )  GOT 0  340 
IF  (DVDT1.GT.0.  AND.DVDT2.LT.  0.  AND. PICK4  .  EQ.  1)  GOTO  340 

MARX-0 

IF ( TCORRl.GT.MAX.OR.TCORRl.LT. MIN)  MARK-1 
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I F ( TC0RR2 . GT . MAX . OR . TCORR2  .  LT .  MIH )  MARK* 1 
IF (MARX. EQ. l.AND. YORNO. EQ. '  Y' )  DISC=JISC+1 
IF (MARK. EQ. 1 . AND . RANGE . EQ . '  Y' )  GOTO  340 
IF  (G.LT.O)  GNEG-GNEG+1 
IF  (G.GT.O)  GPOS-GPOS+1 

WRITE  (8,777)  XTIME, (G*C41) ,  (G*C42) , TCORR1 , TCORR2 
777  FORMAT  (F7 . 4 , 3X, F9 . 5 , 2X, F9 . 5 , 2X, F9 . 5 , 2X, F9 . 5) 

PTSACT=PTSACT+1 
AVG1=AVG1+TC0RR1 
AVG 2 = AVG2 +TCORR2 
340  CONTINUE 

IF (PICK1. EQ. 1)  GOTO  887 
GOTO  100 
70  Y=Y+1 

IF  (Y.GT.CNTRB)  GOTO  100 
GOTO  60 
100  CONTINUE 

887  WRITE  ( * , 888) '  -  ' , GNEG, '  +  ' , GPOS 

888  FORMAT  (A1 , F6 . 1 , 2X, A1 , F6 . 1) 

WRITE  (*,889) 'There  were  ',DISC,'  points  out  of  range.' 
WRITE  (8,889) 'There  were  ',DISC,'  points  out  of  range.' 

889  FORMAT (All, 14, A21) 

AVG  Is®  AVG  1  /  PTSACT 
AVG2 “AVG 2 / PTSACT 
SIGAVG*SIGAVG/PTSACT 

WRITE (*, 890) 'Signal  1  average  was  ',AVG1 
WRITE(8, 890) 'Signal  1  average  was  ',AVG1 
WRITE(*, 890) 'Signal  2  average  was  ',AVG2 
WRITE(8, 890) 'Signal  2  average  was  ',AVG2 
write (*, 890) '  Gas  average  was  ',SIGAVG 

write (8, 890) '  Gas  average  was  »,SIGAVG 

890  FORMAT ( A2 1 , F7 , 3 ) 

END 
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Abstract 

A  phase/ Doppler  particle  analyzer  is  used  to  measure  the  size  and 
velocity  distributions  of  the  droplets  generated  by  the  disintegration  of  a 
cylindrical  liquid  jet.  This  type  of  liquid  jet  breakup  is  commonly  called 
Rayleigh  breakup.  Metered  liquid  flow  rates  agree  with  the  rates  computed 
from  the  droplet  merasurements  made  with  the  phase/Doppler  particle 
analyzer.  The  maximum  entropy  principle  is  used  to  predict  the  droplet  size 
and  velocity  distributions.  The  constraints  imposed  in  this  model  involve 
conservation  of  mass,  momentum,  surface  energy,  and  kinetic  energy. 
Agreement  between  measurements  and  predictions  is  very  good. 

Untrodmtion 

The  process  of  producing  droplets  by  the  breakup  of  a  cylindrical  liquid 
jet  is  commonly  called  Rayleigh  Breakup  (1879,1882).  The  use  of  a  stimulated 
mechanism  (forcing  function)  has  been  widely  employed  by  many  researchers 
to  obtain  uniform  droplets.  Such  droplets  are  essential  in  calibrating  spray 
sizing  instruments  and  in  studying  the  fundamental  aspects  of  droplet 
combustion.  For  a  stream  of  droplets  with  uniform  size  and  velocity,  the  only 
mathematical  representation  of  droplet  distribution  is  a  coupled  Delta 
function  5(D-Do)5(U-U0).  D0  denotes  the  uniform  droplet  size  and  U0  denotes 
the  uniform  droplet  velocity.  This  distribution  is  not  generally  measured  - 
even  when  experiments  are  carefully  conducted  in  a  vibration  isolated 
environment.  Unforced  disturbances  and  complicated  nonlinear  breakup 
mechanism  normally  produce  a  stream  of  droplets  deviating  slightly  from 
the  ideal 
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distribution.  At  times  such  a  jet,  as  a  result  of  nonlinear  instability,  also  has  a 
propensity  to  produce  relatively  small  "satellite"  droplets  interspersed  among 
the  main  droplets  (Bogy,  1979).  The  existence  and  behavior  of  these  satellite 
droplets  depends  on  the  initial  flow  conditions  and  the  physical  properties  of 
the  liquid  jet.  The  stream  of  droplets  with  satellites  must  be  described  by  a 
bimodal  distribution  function.  In  this  study,  we  will  not  consider  flows  where 
satellite  droplets  exist.  To  control  the  production  of  satellite  droplets  which  is 
important  for  the  jet  printer  operation,  several  studies  have  been  made  to 
understand  the  formation  mechanism  at  various  stimulating  disturbances 
(Goedde  &  Yuen,  1970;  Pimbley  &  Lee,  1977;  Chaudhary  &  Maxworthy,  1980 
and  Bous field  et  al.,  1990,  etc.).  For  example,  Pimbley  and  Lee  (1977)  obtained  a 
map  showing  the  behavior  of  satellite  droplets  under  different  stimulating 
conditions.  There  is  a  region  in  this  map  where  the  satellite  droplet  can  be 
totally  eliminated.  In  this  study,  the  measurements  recorded  by 
phase/Doppler  particle  analyzer  (PDPA)  of  an  unforced  cylindrical  jet  (as  well 
as  photographs)  show  that  there  are  no  satellite  droplets  produced  over  the 
range  of  operation  under  consideration. 

With  a  PDPA,  it  is  possible  to  measure  both  the  droplet  velocity  and  size. 
The  PDPA  is  a  single  point  scattering  technique,  making  a  measurement  of 
each  droplet  as  it  passes  through  a  small  probe  volume.  In  these 
measurements  the  attempts  and  validations  are  recorded.  It  is  important  that 
the  acceptance  ratio  (validations /attempts)  be  close  to  one;  otherwise,  the 
cause  of  the  rejections  should  be  determined.  In  many  complicated  sprays 
(hollow  cone,  swirl  atomizers),  an  acceptance  rate  of  60%  or  less,  over 
portions  of  the  spray,  is  often  obtained.  The  advantages  of  using  a  small 
diameter  cylindrical  jet,  from  an  experimentalist's  point  of  view,  are  twofold: 
(1)  the  acceptance  rate  in  the  PDPA  measurements  is  very  high  - 
approximately  100%.  (2)  the  probe  area  is  not  a  variable  in  the  problem. 

The  concept  of  information  entropy  was  developed  by  Claude  Shannon 
(1948),  and  Jaynes  (1957)  later  extended  this  concept  into  the  now  well-known 
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method  of  maximum  entropy  formalism.  This  formalism  can  be  applied  to 
problems  which  involve  probability,  i.e.,  where  insufficient  information  is 
available  to  obtain  exact  solutions.  Tribus  (1961)  used  the  principle  of  this 
formalism  in  thermodynamics  and  showed  that  the  concepts  of  heat  and 
temperature  in  thermodynamics  could  be  defined  through  the  formalism  of 
maximum  entropy.  Haken's  book  on  synergetics  (1978)  established  links 
between  dynamical  systems  theory  and  statistical  theory  with  information 
theory  as  its  basis.  This  maximum  entropy  formalism  allows  one  to  determine 
the  probability  distribution  functions  for  complex  systems  in  physics, 
chemistry,  biology,  as  well  as  in  many  other  disciplines  by  measuring  relatively 
few  average  (macroscopic)  quantities.  In  the  present  study,  we  will  confine  our 
attention  to  its  application  to  liquid  sprays  in  order  to  predict  the  droplet  size 
and  velocity  distributions  in  sprays.  Since  the  application  to  these  kinds  of 
problems  has  been  adequately  discussed  by  several  researchers  -  Kelly  (1976), 
Sellens  &  Brzustowski  (1985, 1986)  and  Li  &  Tankin  (1987, 1988,  1989),  it  will  not 
be  necessary  to  develop  the  background  materials  once  again. 

The  primary  purpose  of  this  study  is  to  compare  the  theoretical 
predictions  based  on  maximum  entropy  principle  concerning  droplet  size  and 
velocity  distributions  with  the  experimental  results.  In  the  process  of  making 
these  comparisons,  estimates  will  be  made  of  the  source  terms  that  appear  in 
the  conservation  equations.  From  a  computational  point  of  view,  there  is  an 
advantage  in  using  the  cylindrical  liquid  jet  instead  of  a  hollow  or  solid  cone 
spray  -  reasonably  accurate  estimates  can  be  made  of  the  source  terms  associate 
with  these  constraints.  The  pattern  of  droplets  formed  after  breakup  in  the 
Rayleigh  problem  is  different  from  the  regular  hollow  cone  or  solid  cone 
spray  system.  Therefore,  the  problem  is  formulated  differently  than  in 
previousanalyses. 
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ILEsperiment 


A  critical  component  in  this  experiment  is  the  hole  that  forms  the  orifice 
plate.  In  these  experiments  the  hole  is  formed  by  an  electroforming  process 
that  is  commonly  used  to  make  bimetal  masks  for  photolithography.  With 
this  process  precise  openings  of  various  shapes  (in  this  case,  circular)  and  sizes 
(in  this  case,  50  microns)  can  be  made  in  a  thin  metal  plate  such  as  nickel.  The 
nickel  metal  is  bonded  to  a  layer  of  another  metal  for  strength.  This  technique 
is  used  to  construct  contact  exposure  masks  in  the  production  of  printed 
circuit  boards  (see  Dressier  &  Kraemer,  1990). 

The  liquid  used  in  this  study  is  distilled  water  with  the  following 
additives  to  prevent  corrosion  at  the  orifice:  ethylene  glycol  1%,  sodium 
nitrite  0.1%  ,  borax  0.1%  and  traces  of  bacterial  inhibitors.  The  liquid  is  forced 
through  the  orifice  by  applying  a  constant  air  pressure  to  a  liquid  reservoir 
(see  Figure  1).  By  changing  the  air  pressure,  the  flow  rate  is  changed.  The 
water  exiting  the  orifice  forms  a  cylindrical  jet  that  is  approximately  50 
microns  in  diameter.  This  cylindrical  jet  breaks  up  into  droplets.  It  is  desirable 
to  measure  the  droplets  as  close  as  possible  to  the  breakup  region,  thus 
reducing  aerodynamic  effects  (drag);  but  beyond  the  point  where  the  droplets 
are  oscillating  (non  spherical).  Figure  2  is  a  typical  photograph  that  was  taken 
through  a  microscope  using  a  strobe  as  a  light  source  (100  microsecond  flash). 
In  this  photograph,  the  droplets  as  well  as  a  fringe  pattern  is  seen.  The  fringe 
pattern,  where  measurements  were  made,  is  due  to  the  laser  beams  used  to 
determine  simultaneously  the  droplet  velocity  and  size.  It  can  be  seen  from 
this  photograph  that  the  droplets  are  spherical  at  the  point  where  the 
measurements  were  taken. 

Droplets  are  sized  with  an  Aerometrics  Inc.,  two-color,  four-beam 
PDPA;  although  only  one  color  (green  -  5145°  A)  was  used  in  these 
experiments.  This  instrument  has  a  probe  volume  of  0.002  mm3.  It  sizes  a 
droplet  by  measuring  its  radius  of  curvature.  The  instrument  is  configured 
like  a  standard  laser  Doppler  anemometer;  the  beam  pair  yields  a 
measurement  of  the  axial  component  of  the  droplet  velocity.  5,000  droplets 
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are  measured  in  each  test  run.  Collection  times  for  this  many  samples  are  of 
the  order  of  four  seconds.  Sample  sizes  as  large  as  10,000  droplets  were  taken 
before  determining  that  5,000  samples  are  more  than  sufficient  to  yield  stable 
mean  and  RMS  statistics. 

One  could  have  used  microscopic  photography  as  shown  in  Figure  2  and 
with  double  exposures  and  multiple  pictures  obtained  the  size  and  velocity 
distributions.  There  are  two  reasons  for  using  the  PDPA  unit.  First,  in  an 
earlier  study  we  examined  a  complex  spray  from  a  pressure  atomizer  and  found 
the  metered  flow  rate  was  an  order  of  magnitude  greater  than  the  flow  rate 
obtained  from  the  PDPA  measurements.  Thus  we  decided  to  use  a  simple  spray 
system  -  a  jet  that  breaks  up  into  a  single  column  of  drops  -  to  test  the  accuracy 
of  our  system.  The  PDPA  measurements  agreed  with  the  metered  flow  rate  as 
well  as  with  the  sizes  estimated  from  photographs.  This  will  be  discussed  later. 
Second,  when  the  distributions  are  relatively  broad  (as  they  were  in  some 
cases),  analysis  of  many  such  photographs  (in  the  hundreds  or  thousands) 
would  be  required  to  obtain  accurate  velocity  and  size  distributions.  This  would 
have  been  tedious  and  time  consuming. 

Several  constraints  inherent  in  the  PDPA  must  be  addressed.  First,  the 
measured  droplets  must  be  spherical.  This  requires  that  the  measurement 
station  be  sufficiently  far  from  the  break  up  region  so  that  droplets  are  not 
oscillating  (non  spherical).  Second,  the  PDPA  has  a  dead  time  of  16 
microseconds  associated  with  each  measured  droplet.  Another  droplet 
entering  the  probe  volume  during  this  dead  time  will  not  be  measured  and 
may  prevent  the  measurement  of  the  first  droplet.  In  these  experiments,  the 
stream  of  droplets  is  not  dense;  thus,  there  is  little  probability  of  having  a 
droplet  entering  the  probe  volume  during  this  dead  time.  These  two 
considerations  influence  the  percentage  of  valid  signals  versus  the  total 
at^mpts.  That  is,  the  PDPA  attempts  to  process  all  Doppler  signals.  It 
performs  checks  on  the  quality  of  each  signal  and  rejects  those  which  exceed 
certain  limits.  In  these  experiments,  there  were  seven  different  flow  rates 
analyzed;  the  valid  signals  are  almost  100%  of  the  signals  collected.  For 
example,  in  one  test  there  are  5022  attempts  arid  5022  validations  -  yielding  an 
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acceptance  ratio  of  100%;  in  the  experiment  having  the  worst  acceptance  ratio 
there  were  5492  attempts  and  5004  validations  -  yielding  an  acceptance  ratio  of 
91% 


The  experimental  data  for  size  distribution  is  normalized  by  the  mass 
mean  diameter,  D30.  This  value  is  determined  from  PDPA  measurements. 
The  droplet  velocity  is  normalized  by  the  axial  velocity  at  the  orifice,  Ujet. 
This  value  is  determined  by  dividing  the  measured  liquid  flow  rate  by  the 
cross  sectional  area  of  the  cylindrical  jet  issuing  from  the  nozzle  (50  microns 
diameter),  rhe  diameter  of  the  cylindrical  jet  was  verified  by  magnifying  the 
jet  with  a  microscope  and  projecting  an  image  onto  a  screen  -  from  which 
measurements  could  be  made.  Also,  direct  examination  of  the  cylindrical  jet 
with  a  microscope  with  a  built  in  measuring  reticle  yielded  the  same  result. 
The  cylindrical  jet  diameter  is  50  microns. 

IILThcflry 

As  stated,  detailed  formulation  and  derivation  of  the  maximum  entropy 
principle  have  been  published  (Li  and  Tankin,  1989),  and  will  not  be 
reiterated  here.  However  there  are  two  significant  modifications:  (1)  Instead 
of  combining  the  kinetic  energy  and  the  surface  energy  into  one  energy 
constraint,  these  two  energy  constraints  were  separated.  The  need  for  this  will 
be  described  later.  (2)  The  control  volume  chosen  here  is  the  region  of  liquid 
jet  between  the  orifice  plate  (nozzle)  and  the  location  downstream  where  the 
droplets  were  measured  by  the  PDPA.  That  means  the  distribution  obtained 
from  this  formulation  will  be  the  droplet  distribution  at  the  same  location 
measured,  not  the  breakup  region  of  liquid  jet  normally  used  by  Li  and 
Tankin  (1989).  The  reason  we  chose  the  measuring  position  as  the, 
downstream  boundary  is  that,  later,  we  will  make  use  of  some  measured 
mean  values  to  estimate  the  source  terms.  In  addition,  the  aerodynamic 
effects  on  the  droplet  velocities  are  negligible. 

A  schematic  drawing  for  the  breakup  of  a  liquid  jet  and  it's  control 
volume  is  shown  in  Figure  3.  The  control  volume  begins  at  the  outlet  of 
orifice  (nozzle)  and  extends  far  enough  downstream  so  that  all  of  the  droplet 
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oscillations  have  damped  out.  The  downstream  boundary  is  the  location 
where  the  measurements  were  made.  The  liquid  jet  which  enters  the  control 
volume  through  surface  1  has  a  diameter  Djet,  velocity  Ujet,  density  p,  and 
surface  tension  o.  The  droplets  which  emerge  from  the  control  volume 
through  surface  2  have  diameter  D,  velocity  U,  and  arrive  at  a  frequency  ri 
(number/time).  The  surface  tension  and  density  are  assumed  to  be 
unchanged  due  to  the  breakup. 

By  applying  maximum  entropy  formulation,  the  most  probable  droplet 
distribution  due  to  the  breakup  of  cylindrical  liquid  jet  is  obtained  based  on 
the  well  known  conservation  laws  involving  a  few  physical  quantities.  In  this 
study,  these  physical  quantities  include  mass  flux,  axial  momentum  flux, 
kinetic  energy  flux  and  surface  energy  flux  which  are  conserved  in  the  control 
volume.  Due  to  the  existence  of  individual  droplets  crossing  surface  2;  the 
mass,  momentum  and  energy  leaving  the  control  volume  are  not  constant 
but  occur  in  a  temporal  oscillating  manner  whose  frequency  depends  on  ri. 
Thus,  there  are  changes  of  these  quantities  (mass,  momentum,  kinetic  energy, 
and  surface  energy)  within  the  control  volume  with  time.  To  maintain  a 
steady  state  formulation,  the  conservation  equations  employed  here  will  be 
derived  using  a  time  averaged  base. 

Illa.Conservation  of  mass  flux,  momentum  flux  and  energy  fluxes  : 

If  the  droplets  are  produced  in  a  saturated  air  environment,  there  is  no 
mass  flux  loss  or  gain  during  the  breakup  process;  and,  the  mass  flow  rate  at 
surface  1  and  surface  2  should  be  equal  on  a  time  averaged  base.  That  is 

Z/>‘VD?n  =  ri^  =  5’PDJtUjet  Eq.(l) 

>  j 

Similarly,  a  balance  in  the  momentum  and  energy  fluxes  of  liquid  jet  and 
droplets  yield 

momentum  :  J  P  Z  X  uj  “  *  Ujet  +  Sm*  =  P  Djt  Ujet2  +  Smv  Eq.  (2) 

•  j 
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kinetic  energy  :  £  *pX  rivU^*  +  Ske=2  4  pD^‘U^  +  Ske 

«  j 

Eq.  (3) 

surface  energy :  Jt  o  ^  ^  Z*^-  D?  n  =  jc  o  D  Uj*  +  SK  Eq.  (4) 

»  j 

where  Pg  is  the  joint  probability  of  finding  a  droplet  with  diameter  Dj  and 
velocity  Up  Smv,  Ske  and  Sse  are  the  momentum,  kinetic  energy  and  surface 
energy  source  terms  respectively.  They  include  the  effects  of  transport 
between  two  phases  (liquid  &  air)  within  the  control  volume.  These  three 
source  terms  could  be  positive,  zero  or  negative.  In  addition,  there  is 
requirement  that  the  sum  of  joint  probabilities  equal  to  unity.  That  is 


normalization  :  X  X  =  1  Eq.  (5) 

*  j 

The  droplet  size  and  the  droplet  velocity  are  nondimensionalized  by  D30 
(mass  mean  diameter)  and  Ujet  (initial  jet  velocity)  respectively.  Thus,  the 
five  constraints  expressed  in  integral  form  are  as  follows  : 

Normalization  :  j  j  f  dD  dU  *  1  Eq.  (6) 

Mass  conservation  :  jjf  D3  dD  dU  =  1  Eq.  (7) 

Momentum  conservation  :  JjfD3  U  dD  dU  =  1  +  Smv  Eq.  (8) 

Kinetic  energy  conservation  :  jjf D3  U2  dD  dU  =  1  +  Eq.  (9) 

Surface  energy  conservation  :  B  f  [  f  D1  dD  dU  =  -2B-  +  S,e 

Jj  3Djet  Eq.  (10) 

where  f,  the  continuous  joint  probability  density  function  (PDF)  of  P,-,y,  can  be 
obtained  by  maximizing  the  Shannon’s  entropy  subject  to  the  above  five 
constraints.  Thus,  f  will  have  the  following  form  : 
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f  =  3  D2  exp  {-Oo-aiD^ai^U-aai^lP-o^BD2}  Eq.  (11) 


where 

B  =  -12-  1Yr  PU^P» 

We  we "  a 

The  formulation  in  the  present  study  is  slightly  different  from  that  of  Li 
and  Tankin  (1989).  The  previous  formulation  conserves  the  combined  energy 
flux  -  kinetic  energy  flux  and  surface  energy  flux  -  as  one  single  constraint 
instead  of  two  separate  constraints.  The  shortcoming  of  previous  formulation 
is  that  no  information  is  provided  as  to  how  the  total  energy  source  is 
distributed  between  the  kinetic  energy  and  the  surface  energy.  Any 
combination  of  constant  total  energy  source  will  result  in  same  probability 
density  function.  However,  the  kinetic  energy  source  term  primarily  affects 
the  droplet  velocity  distribution  and  the  surface  energy  source  term  primarily 
affects  the  droplet  size  distribution.  The  difference  between  the  distributions 
based  on  these  two  formulations  will  be  clearer  when  the  calculated  results 
are  presented  later. 

IIIb.Estimates  of  source  terms 

Due  to  the  complexity  of  breakup  mechanism  for  a  spray  system, 
normally  the  source  terms  (Smv,  Sfcc,  Sse),  can  be  roughly  estimated 
according  to  some  simple  physical  phenomena  (e.g.  Li  et  al.,  1990).  In  the 
present  study,  the  disentrigation  of  the  liquid  cylinder  results  in  a  spray  of 
droplets  of  nearly  uniform  size  and  uniform  velocity.  This  implies  that  the 
distributions  are  very  steep  and  narrow.  Under  such  circumstance,  the  results 
obtained  from  this  formulation  are  extremely  sensitive  to  the  source  terms 
assigned.  A  minor  variation  of  a  source  term  (less  than  1%)  could  change  the 
droplet  distributions  dramatically.  This  situation  requires  an  accurate 
estimate  of  source  terms  for  this  problem  to  yield  reliable  results  . 

Estimating  these  source  terms  accurately  will  require  the  use  of  some 
average  quantities  that  are  obtained  from  experimental  measurements  -  such 
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as  mass  mean  diameter,  mean  velocity,  etc.  The  relationships  between  source 
terms  and  experimental  data  are  derived  as  follows  : 

Smv  =  J|fD3UdDdU- 1  =D3o(jfUdDdU-  l=UmD3o- 1  =Um- 1  Eq. (12) 


Ske  =  f  jfD3  U2  dD  dU  - 1  =  DioJJ  f U2  dD  dU  - 1 
=  D^ojjf  (U  -  Um)2  dDdU  - 1  +^0|(fU^dDdU 

=  ^o(^  +  OL)-  1=(^,  +  UL)-  1  Eq.  (13) 


where  P30  is  the  mass  mean  diameter  (equal  to  one)  from  experiments 
Um  is  the  mean  droplet  velocity  from  experiments 
Umu  is  the  root  mean  square  of  droplet  velocity  fluctuations  from 
experiments. 

Pi2  is  the  Sauter  mean  diameter  from  experiments 
is  the  diameter  of  cylindrical  jet  at  the  orifice 

is  the  surface  energy  flux  of  the  cylindrical  jet  at  the  orifice. 

In  making  these  estimates  of  SmV  and  Ske,  i*  was  assumed  that  all  the 
droplets  have  same  velocity  so  that  D30  can  be  extracted  from  the  integrals. 
That  is,  the  velocity  and  size  are  uncoupled.  This  will  be  seen  to  be  a 
reasonable  assumption  for  the  sprays  under  consideration. 

IV.Results  and  Discussion 

In  this  section,  two  sets  of  experimental  data  will  be  chosen  to  illustrate 
their  corresponding  distribution  functions  for  droplet  size  and  droplet 
velocity.  In  one  set  of  data,  the  distributions  are  very  narrow  -  almost  delta 
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functions;  in  the  other  set  of  data,  the  distributions  are  broad.  We  have 
selected  these  two  sets  of  data  because  they  represent  the  two  extremes  from 
the  seven  sets  of  data  collected.  Figures  4  and  5  show  these  two  sets  of 
experimental  data;  one  set  has  an  acceptance  ratio  of  1.0  (highest)  and  the 
other  an  acceptance  ratio  of  0.91  (lowest).  The  narrow  distributions,  as  in 
Figure  4,  occur  over  a  very  restricted  range  of  nozzle  exit  velocities,  Ujet, .  It 
was  only  with  great  care  (remember  these  were  not  externally  stimulated  jets) 
that  these  data  were  obtained.  The  nozzle  exit  velocity  (Ujet)  -  based  on  a  jet 
diameter  of  50  pm  -  equals  2.76m/sec  in  Figure  4.  Changing  Ujet  (either 
increasing  or  decreasing)  results  in  broader  distributions.  For  example,  the 
distributions  at  Ujet  =  2.32  m/sec  is  shown  in  Figure  5.  Similar  results  (  but 
not  quite  as  broad  as  those  in  Figure  5)  occurred  when  Ujet  =  3.64  m/sec. 
Intermediate  widths  of  the  distributions  occur  when  3.64m/sec  >  Ujet  >  2.76 
m/sec  and  2.32m/sec<  Ujet  <  2.76  m/sec. 

For  each  set  of  experiments  there  are  two  distribution  curves  -  one  is 
number  of  droplets  vs.  droplet  diameter  (4a  and  5a)  and  the  other  one  is 
number  of  droplets  vs.  droplet  velocity  (4b  and  5b).  Accompanying  these  two 
plots  are  the  experimental  parameters  and  measured  mean  values  of  droplets. 
P30/  P32/  Um,  and  Urms  are  used  in  estimating  the  source  terms.  It  was 
assumed  in  these  initial  estimates  (Equations  12,  13  and  14)  that  the  droplet 
size  is  independent  of  droplet  velocity.  This  assumption  needs  justification. 

The  assumption  for  estimating  Smv  and  Ske  can  be  verified  from  the 
experimental  data  recorded  by  the  PDPA.  If  one  examines  the  raw  data  -  for 
example,  for  the  test  shown  in  Figure  5b  -  one  can  plot  the  mean  diameter 
(P30)  and  its  RMS  value  for  each  velocity  bin.  This  is  shown  in  Figure  6a  for 
velocity  bins  having  more  than  10  drops/bin.  The  mean  diameter  is  nearly 
independent  of  velocity  and  the  RMS  of  the  drop  diameter  in  each  velocity 
bin  is  small  compared  to  the  mean  diameter  (1).  The  largest  deviation  of  D30 
from  1  occurs  at  velocity  bins  (U/Ujet)  greater  than  0.35;  but  even  in  these 
bins  the  D/D30  is  greater  than  0.97  and  the  RMS  is  less  than  0.15.  In  addition, 
the  number  of  drops  in  the  velocity  bins  greater  than  0.35  only  amounts  to 
10%  of  the  total  drops  measured.  Thus  the  assumption  that  the  droplet 
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diameter  is  independent  of  velocity  is  valid.  Therefore  one  would  expect  the 
estimated  source  terms  to  be  reasonably  accurate. 


As  an  alternate  approach,  one  could  assume  a  uniform  velocity  (UJ 
instead  of  a  unifrom  droplet  size  when  making  estimates  for  SmV  and  Ske  in 
Equations  12  and  13.  By  employing  this  assumption,  the  only  difference  from 

the  previous  approach  is  that  Sfcc=Unrl.  The  term  u£nS  disappears  due  to  the 
assumption  that  the  velocity  is  constant.  It  is  interesting  to  note  that  the 
influence  of  the  droplet  size  distribution  on  the  estimates  of  the  source  terms 
is  the  same  in  these  two  appraches.  The  assumption  of  uniform  velocity  is 
verfied  experimentally  by  plotting  the  mean  and  RMS  values  of  the  droplet 
velocity  in  each  size  bin  as  shown  in  Figure  6b.  The  RMS  values  for  the 
velocity  in  each  bin  is  less  than  0.02  (6%). 

A  third  approach  can  be  used  without  making  any  assumption  about  the 
droplet  size  or  droplet  velocity  distributions.  This  approach  is  based  two 
additional  mean  variables  which  are  not  directly  measured  by  PDPA.  If  one 
defines  the  mean  velocity  and  RMS  values  of  the  velocity  in  terms  of  droplet 
volume  instead  of  droplet  number,  then  the  source  terms  Smv  and  Ske  can 
derived  by  simply  replacing  the  number-based  mean  velocity  and  the  RMS  of 
the  velocity  in  Equations  12  and  13  with  volume-based  variables.  Although 
these  two  mean  variables  are  not  directly  available  from  the  standard  PDPA 
output,  they  could  be  obtained  from  the  raw  data. 

The  droplet  distributions  obtained  from  these  estimated  source  terms 
only  provide  an  initial  result,  which  will  probably  require  modification  of  the 
source  terms  in  order  to  yield  distributions  that  agree  with  the  experimental 
data.  Therefore  we  will  not  concern  ourselves  with  the  different  possible 
results  obtained  from  these  three  approaches  for  estimating  the  source  terms, 
but  choose  the  first  approach  -  constant  droplet  diameter.  It  is  believed  that 
any  one  of  these  three  approaches  will  provide  sufficiently  accurate  estimates 
of  the  source  terms  needed  to  make  the  initial  calculations.  Once  the  source 
terms  are  estimated,  the  corresponding  distribution  functions  of  droplet  size 


51-12 


and  droplet  velocity  are  obtained  by  integrating  Equation  (11)  along  D  and  U 
respectively.  They  are : 

M  CTfWau,)  -  erfK^,)  I  exp  {-a,  -  (a,  -  a,  B  D2)  Eq.(15) 

dD  2  a3  403 

^  =  ffdD  Eq.  (16) 

where  erf  (X)  is  the  error  function  of  X 

Xm»  =  (Uflux+^2-Xa3^)2 
2a3 

x™  “(U^+^-XajD3^ 

Since  there  is  no  closed  form  solution  for  the  droplet  velocity 
distribution.  Equation  (16)  is  calculated  by  numerical  integration.  N  is  the 
normalized  droplet  number.  "Estimated"  distributions  (based  on  estimated 
source  terms)  for  these  two  set  of  data  are  also  shown  in  Figures  7  and  8.  The 
estimated  source  terms  and  the  corresponding  parameters  (oto,  ai,  <X2,  a3,  04) 
for  these  predicted  distributions  are  listed  in  Tables  I  &  n.  Agreement  between 
measurements  and  estimated  correlations  is  reasonably  good.  However,  there 
is  a  discrepancy  between  the  peaks  of  estimated  distributions  and  the 
experimental  data. 

The  "estimated"  distribution  function  was  further  modified  by  adjusting 
the  source  terms  -  resulting  in  "fitted"  distribution  functions.  The  fitting 
technique  applied,  in  this  case,  relies  on  the  fact  that  the  We  number  and  the 
surface  energy  source  term  §se  have  a  major  effect  on  the  droplet  size 
distribution  and  a  minor  effect  on  the  velocity  distribution.  On  the  other 
hand,  the  other  two  source  terms  SmV  and  Ske  have  a  major  effect  on  the 
velocity  distribution,  but  minor  effect  on  the  size  distribution.  The  procedure 
used  for  fitting  is  described  briefly  as  follows  : 

(1).  To  fit  the  droplet  size  distribution,  the  We  number  was  fixed  and  the 
surface  energy  source  Sse  was  adjust  until  the  maximum  amplitude  of 
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dN/dD  of  the  experimental  data  and  the  fitted  data  matched.  In  this  manner, 
the  size  distribution  is  fitted  while  the  velocity  distribution  remains 
essentially  unchanged. 

(2).  To  fit  the  droplet  velocity  distribution,  the  source  term  Smv  was  fixed 
and  the  kinetic  energy  source  term  Skewas  adjusted.  Thus  the  velocity 
distribution  is  fitted  while  the  size  distribution  remains  essentially 
unchanged. 

The  results  for  the  combined  changes  in  Sse  and  Ske  axe  shown  in  Figures 
7  and  8.  The  corresponding  parameters  and  source  terms  are  listed  in  Tables  I 
and  II.  The  difference  in  the  source  terms  used  in  the  "estimated"  and  "fitted" 
curves  varied  by  only  a  fraction  of  a  percent!  For  example,  in  Figure  7,  the 
difference  in  Sse ls  only  0.0038%  ;  the  difference  in  Ske  xs  only  0.00016%.  In 
Figure  8  these  differences  are  only  0.0001%  and  0.259%.  Similar  results  were 
obtained  for  the  other  five  sets  of  data  collected  and  examined.  Thus,  one  can 
see  the  importance  of  "accurately"  estimating  the  source  terms. 

The  distributions  using  the  previous  formulation  (Li  &  Tankin,  1989)  of 
a  single  energy  flux  constraint  was  examined.  Attempts  were  made  to 
determine  the  corresponding  droplet  distributions  for  the  two  sets  of  data 
based  on  the  formulation  that  there  is  only  one  constraint  for  the  energy  flux. 
The  combined  energy  source  term  is  the  sum  of  two  separate  energy  source 
terms  (Sse  and  Ske)  estimated  earlier  (see  Tables  I  &  Q).  For  data  set  in  Figure  7, 
the  numerical  calculations  didn't  converge.  The  results  for  data  set  in  Figure 
8,  reveal  that  the  velocity  and  size  distributions  disagree  with  the 
experimental  data.  Thus  the  conservation  of  combined  energy  source  term 
doesn't  provide  enough  information  for  this  spray  system. 

Finally  the  computed  flow  rate  using  the  PDPA  measurements  is 
compared  with  the  measured  flow  rate  obtained  by  weighing  a  sample  of  the 
droplets  collected  over  a  period  of  time.  The  following  equation  was  used  to 
compute  the  flow  rate: 
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Eq.  (17) 


where 

D|  is  the  diameter  of  the  droplet  in  the  "i"th  bin 

*4  is  the  number  of  droplets  per  unit  time  in  the  ’*i"th  bin. 

A  comparison  of  the  difference  between  the  calculated  flow  rate  using 
the  PDPA  and  the  measured  flow  rate  is  8%.  However,  since  the  diameter  is 
raised  to  the  third  power,  the  measured  diameter  of  the  droplets  is  within  3% 
of  the  actual  diameter.  This  implies  a  consistency  between  the  measured  data 
and  the  PDPA  data.  In  making  the  calculation  represented  by  Equation  17,  the 
bin  diameter,  (Di),  is  the  lower  limit  of  the  ith  bin.  If  one  used  an  average  bin 
diameter  of  the  ith  bin,  the  difference  between  the  metered  flow  rate  and  the 
calculated  flow  rate  would  be  about  3%;  thus  resulting  in  a  discrepancy  of  only 
1%  in  droplet  diameter. 

V.  Conclusions 

The  agreement  between  theory  and  experiments  is  reasonably  good  for 
the  Rayleigh  problem  under  investigation.  In  order  to  obtain  these  results  it 
was  necessary  to  modify  the  previous  theory  of  Li  and  Tankin  (1987,  1988, 
1989),  by  separating  the  energy  source  term  into  a  kinetic  energy  source  term 
and  a  surface  energy  source  term.  It  was  possible  for  the  breakup  of  a 
cylindrical  jet  to  evaluate  the  source  terms  -  within  a  fraction  of  a  percent.  In 
the  experiments  conducted  in  this  study  it  was  not  necessary  to  involve  the 
probe  area  of  the  PDPA  (a  possible  source  of  error)  since  all  the  droplets  passed 
through  the  measuring  volume.  Also  the  acceptance  ratio  exceeded  0.91  in 
these  experiments.  The  measured  liquid  flow  rate  agrees  to  within  8%  that 
obtained  from  the  PDPA  measurements. 
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vm.  Figure  Captions 

Figure  1.  Schematic  drawing  of  the  droplet  generator  setup. 

Figure  2.  Photograph  showing  the  stream  of  droplets  produced  by  the  droplet 
generator. 
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Figure  3.  Control  volume  used  in  this  study  for  the  liquid  jet  and  droplets  . 

Figure  4.  Print-out  of  the  measurements  obtained  from  PDPA  for  data  set  #1. 
Figure  4a  is  the  number  of  droplets  versus  droplet  diameter,  and  Figure 
4b  is  the  number  of  droplets  versus  droplet  velocity. 

Figure  5.  Print-out  of  the  measurements  obtained  from  PDPA  for  data  set  #2. 
Figure  5a  is  the  number  of  droplets  versus  droplet  diameter,  and  Figure 
5b  is  the  number  of  droplets  versus  droplet  velocity. 

Figure  6.  Plots  showing  the  dependence  of  the  droplet  velocity  on  droplet 
diameter.  Figure  6a  is  a  plot  of  the  mean  velocity  and  its  rms  values 
versus  droplet  diameter.  Figure  6b  is  a  plot  of  the  mean  diameter  and  its 
rms  values  versus  droplet  velocity. 

Figure  7.  Comparison  between  experimental  data  (  ),  "estimated",  and 
"fitted"  results  for  data  set  #  1.  In  Figure  7a  are  the  droplet  size 
distributions  and  in  Figure  7b  are  the  droplet  velocity  distributions. 

Figure  8.  Comparison  between  experimental  data  (  ),  "estimated",  and 
"fitted"  results  for  data  set  #  2.  In  Figure  8a  are  the  droplet  size 
distributions  and  in  Figure  8b  are  the  droplet  velocity  distributions. 
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Figure  1  Schematic  drawing  of  droplet  generator 
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gure  3  Liquid  jet,  formed  from  an  onfice,  breaking  into  droplets  m 
Control  volume. 
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Figure  6(a)  Mean  diameter  and  diameter  RMS  vs.  droplet  velocity 
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Figure  7(a)  Droplet  size  distributions  for  data  set  #1 
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Figure  7(b)  Droplet  velocity  distributions  for  data  set  #1 
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Figure  8(a)  Droplet  size  distributions  for  data  set  #2 
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ABSTRACT 

The  concept  of  a  function  is  central  to  the  mission  of  fire  control.  We  want  the  hardware 
to  learn  the  desired  function  -  to  give  the  desired  output  when  presented  with  input.  With 
this  in  mind,  efficiency  can  be  best  achieved  by  finding  the  simplest  form  of  the  desired 
function.  Several  experiments  have  been  done  that  suggest  that  learning  is  best  done  with 
t he  technique  known  as  Function  Extrapolation  by  Recomposing  Decompositions  (FERD). 
We  present  some  results  about  FERD. 
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INTRODUCTION 


In  the  Held  of  avionics,  complicated  tasks  must  be  performed  quickly  and  efficiently.  A 
vast  amount  of  information  must  be  processed  while  that  information  is  still  relevant.  A 
natural  conch  sion  is  that  this  information-processing  and  decision-making  can  best  be 
done  with  the  best  machine.  However,  with  all  other  things  being  equal,  simplifying  the 
algorithm  that  describes  the  task  is  far  more  efficient  than  boosting  the  hardware. 

With  that  in  mind,  the  Systems  Concept  Group  has  been  studying  Pattern-Based  Ma¬ 
chine  Learning,  especially  the  idea  of  function  decomposition  (the  process  of  receiving 
information  and  giving  a  course  of  action  is  the  process  of  a  function  -  input  is  given  to 
the  machine  and  output  is  returned).  Since  there  is  no  way  to  program  all  the  possible 
inputs  that  a  machine  may  receive,  we  want  the  machine  to  learn  its  intended  function. 
What,  is  the  best  way  for  the  machine  to  learn? 

Several  experiments  run  by  Dr.  Ross  and  Lt.  Taylor  at  WPAFB  suggest  that  FERD 
oilers  the  best  approach  to  learning.  Let  us  briefly  explain  how  FERD  works. 

We  have  a  function  that  we  would  like  the  machine  to  learn.  Our  functions  are  of  the 
form  f  :  {0.1,. ...2"  -1}  — ♦  {0.1}.  We  may  think  of  /  as  a  collection  of  ordered  pairs 

f  =  {(a.h)  :  a  €  {0.1 . 2"  — !}.(>€  {0.1},/(a)  =  h).  We  randomly  choose  a  sample  from 

f  (in  its  set  form),  then  Hud  the  function  of  least  complexity  that  agrees  with  /  on  mis 
mniple.  That  function  is  the  learned  function  that  FERD  selects. 

The  results  obtained  so  far  indicate  that  FERD  generally  gets  the  function  right.  That 
is.  its  learned  function  is  the  desired  function  (when  the  size  ot  the  sample  is  on  the  order 
of  the  complexity  of  the  desired  function).  FERD  performs  better  than  other  learning 
techniques  that  have  been  tried,  neural  nets  for  example. 

We  present,  here  some  theoretical  results  about  FERD  in  the  hope  of  trying  to  deter¬ 
mine  why  FERD  works,  if  it  will  always  work  in  the  manner  described  above,  and  the 
implications  of  this  for  machine  learning. 
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OBJECTIVES 


Our  primary  objective  is  to  investigate  function  decomposition  and  its  applicability  to 
machine-learning.  All  the  experiments  performed  to  this  date  suggest  that  for  the  f'uuc- 
uons  we  are  concerned  with,  function  decomposition  provides  the  best  learning  algorithm. 
We  are  investigating  why  this  is  so  by  formulating  some  of  its  general  properties  and 
investigating  how  these  properties  relate  to  specific  cases. 

We  have  also  read  many  articles  in  an  attempt  to  find  where  the  function  decomposition 
algorithm  approach  to  learning  fits  into  other  areas  of  complexity  theory  and  learning 


THE  REPORT 


Lor,  us  first  explain  our  general  situation.  We  have  a  function  /  :  {0.1 . 2''  — 1}  — »  {0.1} 

often  called  a  binary  or  Boolean  function.  The  function  j  can  be  visualized  as  a  sot  of 
ordered  pairs  i  a.  b)  where  «  €  {0,l,...,2n  —  1},  6  G  {0, 1}  and  /(a)  =  />.  This  is  the  function 
that  we  want  the  machine  to  learn.  To  accomplish  this,  we  select  a  sample  of  ordered  pairs 
from  f.  and  call  this  set  .4.  Thus.  A  C  /. 

When  the  machine-learning  system  is  given  .4.  it  then  attempts  to  reconstruct  a  function, 
i/.  from  this  sample.  Perhaps  (j  will  equal  /.  perhaps  not.  The  latter  case  is  more  likely. 
In  fact,  with  some  machine-learning  systems,  the  learned  function.  </,  may  not  even  equal 
/  on  the  sampled  set  .4  !  We  call  the  number  of  values  at  which  f  and  <j  disagree  -- 
{r  :  fix)  7^  <j( .r ) } |  —  the  error  e.  Ideally,  we  would  like  e  to  be  zero.  This  would  mean 
that  (j  =  f  and  the  machine  has  learned  the  function.  Note  that  we  can  always  do  this  by 
letting  A  =  f  in  which  case  no  learning  is  necessary.  However,  as  might  be  expected,  it  is 
impractical  if  not  impossible  for  .4  to  equal  /.  For  one  thing.  ./  is  too  large.  Its  ordered 
pair  listing  can  contain  billions  of  ordered  pairs.  For  another,  often  we  don  t  know  all  of 
f.  In  practice.  |.4|  —  the  number  of  elements  in  .4  —  is  far  less  than  |/|. 

The  best  learning  that  can  be  done  is  done  by  the  algorithm  that  minimizes  <  on  the 
smallest  .4  when  compared  to  all  other  learning  algorithms.  Then'  is  not  one  algorithm 
that  can  do  this  for  every  function  possible  and  every  sample  that,  we  might  take.  However, 
the  Function  Extrapolation  by  Recomposing  Decompositions  (FERD)  lms  consistently  out¬ 
performed  other  algorithms  when  presented  with  many  different  types  of  binary  functions. 
Several  results  on  FERD  are  found  in  [4], 

T  et  us  briefly  explain  how  it  works.  When  the  random  sample.  .4.  has  been  selected. 
FERD  constructs  functions  with  the  same  domain  and  range  as  /  that  an*  equal  to  /  on 
4  and  have  complexity  of  at  most  the  complexity  of  f.  Our  complexity  here  is  the 
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Decomposed  Function  Cardinality  (DFC)  complexity  defined  by  Ross  in  [4],  We  write 
the  cost  of  a  function  0  as  C(9). 

Let  F  =  {g  :  g  :  {0,1,....  2"  -  1}  -»  {0,1}},  5  =  {g  G  F  :  g(x)  =  f(x)  Vx  €  .4}.  and 
C  =  {g  6  F  :  C{g)  <  C{f)}.  Then  FERD  finds  a  function  in  5  D  C.  It  finds  a  function  of 
lowest  cost  that  agrees  with  /  on  .4.  We  call  this  function  the  learned  function.  Of  course, 
the  best  result  is  when  the  learned  function  is  /.  In  that  case,  FERD  “gets  it  right". 

We  now  present  some  facts  about  costs.  Let  /  be  given  and  let  .4  C  /.  Define 
C  A(f)  =the  largest  cost  of  the  functions  in  S  and  C_A(f )  =the  smallest  cost  of  the  func¬ 
tions  in  S.  To  be  precise,  in  the  first  definition,  there  may  not  be  a  largest  cost,  but  what 
we  mean  is  the  cost  such  that  no  function  in  the  set  has  a  higher  cost.  The  same  is  true 
for  the  second  definition. 

If  V  is  a  finite  set  of  real  numbers,  let  min(V)  denote  the  number  m  such  that  m  <  r 
for  all  v  €  V',  and  let  max(V)  denote  the  number  M  such  that  M  >  v  for  all  v  6  V.  Recall 
that  for  any  finite  sets  of  real  numbers,  T  and  U,  if  T  C  U,  then  min(T)  >  min(C)  and 
ma.v(T)  <  max(U). 

f.r.MMA  1.  If  AC  A'  C  f  then 

( 1)  CLa (/)  <  CA,{f);  nn cl 

(2)  CA(f)  >  C.r(f). 

Pltoor:  Note  that  if  .4  C  .4',  then  SA  =  {g  G  F  :  g{x)  =  f(x)  V.r  6  -4}  D  {g  6  F  : 
g(x)  =  f(x)  V.r  €  .4'}  =  SA'.  Both  statements  now  follow  from  the  above  remarks  since 
the  quantities  involved  are  the  minimum  and  maximum,  respectively.  | 

The  Lemma  allows  us  to  state  and  prove  the  upcoming  Theorem.  It  says  that  regardless 
of  the  sample  sizes  involved,  the  smallest  costing  function  resulting  from  a  sample  can  he 
no  larger  .nan  the  largest  costing  function  resulting  from  any  other  sample. 
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TllFOREM  2.  For  any  f  6  F.  and  any  A,  B  C  f,CA(f)  <  C'nif). 

PROOF:  Consider  A  U  B  which  is  also  contained  in  /.  Since  A.  B  C  A  U  B  we  have  by 
detinition  and  by  the  Lemma  that 

CAU)<CAuB(f)<CAuB(f)<Ca(f).  I 

This  allows  us  to  visualize  FERD’s  learned  function  eventually  converging  to  f . 

We  now  list  a  result  similar  to  those  above. 

The  learned  function  is  obtained  by  chance.  A  random  sample  of  f  is  chosen  and  FERD 
constructs  a  function  consistent  with  /  on  that  sample.  We  would  like  the  probability  that 
the  learned  function  is  /  to  lie  one  but  this  is  not  usually  the  case.  We  strive  to  know  how 
much  of  FERD  is  random  and  how  much  is  determinable. 

Probabilistically,  let  /  be  given,  let  A  C  /,  and  let  PA{f)  be  the  probability  that  given 
sample  A,  FERD  correctly  identifies  /. 

Proposition  3.  If  A,  c  A2  c  /.  then  PAt(f)  -  pA7if)- 

PROOF:  Let  .4]  c  .42,Si  =  {g  €  F  :  g(x)  =  f(x)  Vr  £  .4i}.S_>  =  {<j  G  F  :  rj(.r)  = 
/(.f)  V.r  €  -42}.  Then,  ns  in  the  Lemma,  S2  Q  S\  so  that  S2  AC'C  5|  H  C.  Hence.  FERD 
has  a  larger  field  to  search  through  to  correctly  learn  /  when  learning  on  „4|  as  opposed 
to  when  it  learns  on  .42.  Therefore.  PA](f)  <  PAj(f).  | 

It  still  might  be  the  case  that  .4]  ^  A2  and  F.v,(./)  —  F,v3(/i.  For  example,  let  f  be 

defined  by  f(x)  —  0  V.r  €  A’.  Let  .4]  =  {0, 1, 2n  —  2},  A>  =  {0.1 . 2"  -  1 }  =  A"  where 

n  >  l.  Then  .4;  ^  .42  yet  PAl(  f )  =  PAj{f)  because  the  only  function  that  agrees  with  f 
on  .4 1  and  has  complexity  at  most  that  of  /  is  /  itself. 

This  points  out  the  uncertainty  when  dealing  with  complexity  and  learning. 

For  example,  it  is  often  helpful  to  have  a  "triangle  inequality"  with  a  measure  (here  our 
measure  is  complexity  C'(  f )  i.  With  this  measure,  we  might  hope  that  for  all  f  and  >/ 
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in  F.C{f  +  g)  <  C(f)  +  C{g).  However,  this  is  not  the  case.  Note  that  in  this  case  the 


—  used  in  adding  the  functions  is  the  +  of  Boolean  addition  where  all  other  sums  are  as 
expected  except  that  1  +  1  =  1.  Consider  the  following  table: 

x  f{x)  g{x)  ( f+g)(x ) 

00  0  0  0 

01  1  0  1 

10  0  1  1 

11  1  1  1 

Note  that  C(f)  =  C(g)  =  0  since  both  are  projection  functions,  yet  C'if  +g)  =  4.  Thus, 
in  this  example,  C(f  +  g)  >  C(f)  +  C(g). 

Perhaps  a  “reverse  triangle  inequality”  holds.  Perhaps  it  is  true  that  for  all  /  and 
.</,  C(f  +  g)  >  C{f)  +  C{g).  Again,  no. 

Let  /  be  any  function  whose  cost  is  not  zero.  Let  g  —  /,  the  complement  of  f  (the 
function  that  is  zero  when  /  is  one  and  is  one  when  f  is  zero).  Then  for  all  r.  ( /  +g)(x)  =  1. 
So  that  the  cost  of  /  -f-  g  is  zero.  Hence  in  this  case  we  have  0  =  C‘(  f  +  g)  <  C(  f)  < 

C(f)  +  C(g ). 

Let  /i ,  f  y  i  <7 1 1  ancl  gi  be  as  defined  below. 

•C  fi(x)  f2{x)  g  i(x)  g2{x) 

00  0  0  1  1 

01  0  0  0  0 

10  0  0  1  1 

11  0  1  0  1 

If  we  let  d(  /,  g )  =  |{.r  :  f(x)  ^  </(.r)}|,  and  let  min(f)  =  mm(|{.r  :  f(x)  =  l}|.|{.r  : 
f  ( x )  =  0 }  | )  (sometimes  called  “the  number  of  minterms  of  /” ).  then  <h  t\ .  ./■_• )  =  dig i .  g-> )  = 
l.C(/i)  <  C(f>).C(g  i)  <  C(gi ),  mini  f\  )  <  min(f2 ),  and  mm(</,  )  >  wm(g>).  So.  two 
functions  may  be  close  to  one  another,  have  different  costs,  and  we  can  not  resolve  which 
has  the  lower  cost  just  by  counting  the  number  of  minterms. 
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There  is  not  total  despair  in  this  area,  however,  as  several  bounds  relating  these  ideas 
<  an  be  found  in  [4]. 

We  now  return  to  the  problem  of  finding  the  probability  that  a  sample  leads  to  FERD 
correctly  identifying  /.  Recall  that  if  .4  C  /,  S  H  C  =  {(j  €  F  :  <]{x)  =  /(. r)  V.r  E  .4  and 
C(<j )  <  C( /)}.  When  we  speak  of  more  than  one  function,  we  will  write  (S  DC)/  to  avoid 
confusion.  Write  P(A  — *  /)  to  mean  the  probability  that  given  sample  A.  FERD  learns 

/. 

A  projection  function  is  a  function  of  the  form  f(x i,x-z, . r, . . r„  )  =  .r,  for  all  x  €  A 

where  i  is  fixed  and  1  <  i  <  n.  Here  we  write  x  in  its  binary  form  so  that  each  coordinate  is 
0  or  1.  Thus,  a  projection  function’s  value  at  a  number  is  determined  by  the  ith  coordinate 
of  flit1  number.  When  f  is  a  projection  function,  we  take  C(f)  —  0  and  C(f)  =  2  by 
convention. 

Proposition  4.  If  f  is  not  a  projection  function  then  given  any  .4  C  F.P(A  — •  f)  = 
P(A  — ►  /). 

ProoP:  Since  /  is  not  a  projection  function,  C(f)  =  C(.f).  So  that  t)£(5nc)/«o£ 

(  5 H  C) j.  That  is,  the  structures  of  (SflC)/  and  (5flC)j  are  dual.  Since  FERD  s  learned 
function  is  a  function  in  the  respective  sets  with  minimal  cost,  the  statement  follows.  | 

Now  define  C'(f.A.i)  =  {<]  €  F  :  C(g)  <  C(f)  and  ){.r  €  .4  :  g{x )  =  /(  r ) }  |  =  / }. 

I'll  IIORK.M  o.  If  f  is  not.  a  projection  function  and  .4  C  /,  then  for  all  I  A)  <  l<  <  u  = 
■4|.  |0/.  .4.  k)\  =  \C’[f,  .4, a  -  A-)|. 

ProoI':  Let  k-  be  as  stated  and  let  <j  €  C'(f.A.k).  Then  |{.r  €  -4  :  </(.r)  =  /|.r)}|  =  k\  so 
j  r  6  .4  :  </{  x )  f{x)}  \  =  a  —  k.  But  this  hist  set  is  the  same  as  {.r  E  .4  :  </(./■)  =  tor)},  so 

{./•  E  .4  :  ff(x)  =  /(.r)}|  =  a  -  k.  However,  if  g  —  f  on  a  set  of  size  a  -  k  and  C'ig  i  <  C[f) 
then  7j  =  /  on  a  set.  of  size  a  —  k  and  C(g)  <  C(f).  Thus.  \Ci  f.A.  /,•)!  =  )0_/..4.r/  —  /.■ ) |  = 
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iClf.A.a-  k)\.  | 


There  is  one  specific  result  from  this  Theorem  that  is  worth  mentioning.  The  number  of 
functions  that  agree  with  /  on  a  sample  and  have  complexity  at  most  that  of  /  is  the  same 
as  the  number  of  functions  that  agree  with  /  nowhere  on  that  sample  and  have  complexity 
at  most  that  of  /,  when  /  is  not  a  projection  function.  The  truth  of  the  previous  sentence 
can  be  demonstrated  by  letting  k  =  a  in  the  previous  Theorem.  We  have  shown  the 
following. 

Co  it  o  LI.  ARY  6.  Let  f  and  .4  be  as  above.  Then  \C(f,  -4,a)|  =  |C'(/,  .4.  G)|. 

The  Theorem  gives  us  some  way  of  counting  the  functions  that  agree  with  a  given 
function  on  a  fixed  sample.  We  would  like  to  know  this  number  because  it  lies  at,  the  heart 
of  the  manner  in  which  FERD  operates. 

PROPOSITION  7.  If  f  is  not  a  projection  function  and  A  C  /,  then  |5  ft  C’|  = 

a  —  1 

2  ^  | C(f,  .4,  k) |  if  a  is  odd , 

k- u 

2  | C(f,  A,  A.j^  +  | C(/,  .4,  a/2) |  if  a  is  even. 

PROOF:  Note  that  C(f\A,k)  fl  C(f,A,j)  ^  0  if  and  only  if  k  —  j.  For  if  <j  agrees  with 
/  on  precisely  k  numbers  in  .4  then  it  can  not  agree  with  /  on  precisely  j  numbers  in  .4 
unless  k  =  j.  Therefore, 

\S  n  C' I  =  \{(J  e  F  :  <j{x)  =  /(. r)  V.r  G  .4  and  C'{(j)  <  C’(f)}\ 

=  \  U  C(f,A,k)\ 

0  <  k<a 
n 

=  ^ic(/..4.ni 

Jt=o 
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The  last  equality  follows  from  the  argument  above.  If  a  is  odd.  we  are  summing  an  even 
number  of  terms  and 


\SnC\=j2\C(f,A,k)\+  Y  \C(f*A,k)\ 


k  =  0 


a  —  1 


=  £|C(/,.4,*)|  +  £|C(/..4.a-AT| 

k=0  k=0 

a  —  L  a  —  1 

=  f;ic(/,n,t)i  +  ^ic(/..4.ni 

k= 0  k=0 

o—l 

=  2^|C'(/,.4,t)|. 

A-0 

The  next-to-last  statement  follows  from  the  previous  Theorem.  If  a  is  even,  we  have 

l5nC'l  =  f^|C(/..4,fc)l)  +  |C(/,.4,a/2)|  +  Y  1 0/..4.Ar)| 


k-o 

'o-2 


HE  \C(f\A.k)\  HE  \C(f,A,a-k)\\  +  \C{f,  .4.  a/2) | 


k=  0 

'  a  -  2 


Jfc=0 

'  q-2 
2 


HE  I C’(f.  A,k) |  +  Eiw-a  *)|  +|C(/..4.a/2)| 


fc=U 


. 


=  2  EiC(/.nn-)|  +|C(/..4,«/2)|.  ■ 


k  =  II 


This  gives  us  a  start  on  counting  the  cardinalities  of  the  sets  in  which  FERD  does  its 
learning. 

We  can  generalize  the  notion  of  C(f.A.k)  a  bit.  and  investigate  the  following: 
('if.  A.k)  =  {(j  E  F  :  Cl  </ )  <  C'(f)  and  |{.r  €  A  :  g(.r  )  =  f(.v)}\  >  k}. 

So  that.  C(f,A.k)  =  ij *,=k-C(f,A.i).  Now  it  is  not  the  case  r  hat  given  f  and  .-1. 
C\f.A.k)  and  C(  /.  A.  j  )  are  disjoint  when  j  7^  k.  In  fact,  from  the  definition,  it  can 
be  seen  that  if  7  <  k.  then  C(f.  A.  j )  D  C'(f,A.k). 
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It  can  be  noted  that  if  .4i  C  ,4>  C  /  and  0  <  k  <  a,  then  C'(f.  A\J;)  C  C-'(/,  .4->.  /.•)•  For 
if//  6  C{f,Ai,k)  then  C'(//)  <  C{f)  and  |{x  6  .4i|</(x)  =  /(x)}|  >  k.  Since  .4,  C  .4_> .  |{.r  6 
4_»  :  y( x )  =  _/(x)}|  >  k.  Therefore,  we  can  say  that  C  is  what  is  called  homogeneous  in  .4 
and  anti-homogeneous  in  k.  No  nice  inequalities  can  be  found  in  the  /  -coordinate,  and  no 
inequalities  can  be  found  for  the  original  C(f,A,  k)  function. 

Note  that  given  /  and  .4,  the  function  learned  by  FERD,  //,  will  be  in  C'(f,A.a)  and 
C(f.  .4.  k)  for  each  k. 

If  FERD  identifies  /,  then  f  £  S  D  C.  For  the  moment,  let  us  assume  that  FERD  has 
identified  /  and  that  5  ft  C  =  {/}.  There  are  two  reasons  for  doing  so:  1)  this  is  the 
more  desirable  situation  since  if  {/}  /  5  fl  C,  then  FERD  is  only  finding  f  by  chance  and 
another  learning  experiment  done  with  the  same  function  might  not  produce  /,  and  2)  in 
the  experiments  performed  with  FERD,  when  FERD  correctly  identifies  /,  it  appears  that, 

S  n  C  =  {/}■ 

Now  let  I  =  {.4  C  f  :  (S  fi  C)j  — +  g  where  C{g)  <  C{f)},E  =  {.4  C  /  :  (S  Pi 
C')f  — ♦  g  where  C(g)  =  C'(f)}.  Then  under  our  simplifying  assumption.  P(S  flf  = 
{/})  =  P(sample  gives  no  other  function  as  complex  or  less  complex  than  f  in  5  fl  C)  = 
P( sample  gives  no  function  less  complex  than  /)  —  P(  gives  no  function  as  complex  as 
f)  =  P(L)- P(AnL)  =  P(L)-P(A\L)P(L)  =  P{L){\  —  P(A\L).  Since  1-P(.4|I)  <  1.  we 
can  deduce  that  as  the  complexity  of  the  desired  function  strictly  increases,  the  probability 
that,  a.  sample  will  correctly  identify  /  decreases,  although  perhaps  not  strictly.  Therefore, 
as  might  be  predicted.  FERD  has  the  property  that  it  is  harder  for  it  to  learn  a  more 
complex  function  than  a  simpler  function.  As  an  example  of  FERD-learning,  we  calculate 
f ho  chance  that  FERD  identifies  f  when  /  is  a  constant  function  -  that  is.  either  f{.v)  =  0 
for  all  .r  or  /(. r)  =  1  for  all  .r.  In  this  case,  if  we  make  the  simplifying  assumption  above, 
'here  is  really  nothing  to  calculate  so  we  will  assume  no  extra  properties  of  FERD.  Also, 
it  we  assume  that  the  constant  function  is  the  simplest  function  possible,  then  FERD  will 
always  make  the  correct,  identification  because  there  can  be  no  other  function  that  agrees 
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with  f  and  has  complexity  no  more  than  /.  However,  with  the  cost-complexity  defined  by 
Ross  in  [4],  the  projection  functions  have  the  same  cost  as  the  constant  functions  —  a  cost 
of  (). 

Let  /  be  a  constant  function.  Then  VA  C  /,  /  E  Sfl  C  because  regardless  of  the  sample 
chosen,  it  is  always  the  case  that  /  =  /  on  that  sample  and  C(f)  <  C'(  f ).  If  \S  fl  C |  =  1, 
then  SflC  =  {/}  and  FERD  will  learn  /.  If  |SnC|  =  2,  then  FERD  learns  f  half  the  time 
and  learns  the  other  function  half  the  time.  Similarly,  given  that  |5  D  C'\  =  m.P(FERD 
identifies  /)  -  T.  When  f  is  a  constant  function,  the  only  functions  besides  f  that  could 
possibly  be  in  SflC  are  the  projection  functions  and  the  other  constant  function.  /.  since 
i  hey  are  the  only  functions  whose  cost  is  no  larger  than  the  cost  of  /.  However,  the  other 
constant  function  will  not  be  learned  by  FERD  since  it  does  not  agree  with  /  on  any 
-ample  from  /.  So  we  can  find  the  probability  that  a  sample  results  in  FERD  identifying 
/  by  finding  the  probability  that  a  sample  of  /  leads  to  k  projection  functions  that  agree 
with  f  on  the  sample. 

Let  k  be  the  number  of  projection  functions  that  equal  /  on  the  sample  A.  Then. 
HFERD  learns  /)  =  £Lo  *TT^(|SnC|  =  *+!)  =  ELo  kTi  P(*™-dy  *  projection 
functions  agree  with  /  on  .4). 

Assume  that  f(x)  =  0  V.r  €  A*.  Write  pi  for  the  ith  projection  function  (the  function 
that  maps  an  x -value  to  its  ith  coordinate.)  Thus,  pi  =  /  on  .4  (  r ),  =  0  V.r  E  .4. 

So  that  exactly  k  projection  functions  agree  with  /  on  .4  exactly  when  the  coordinates 
corresponding  to  those  projection  functions  are  0  for  each  element  of  .4  and  for  each  of  the 
other  coordinates  there  is  at  least  one  element  of  .4  that  is  1  in  that  coordinate'. 

Write  x,  =  0  on  .4  to  denote  that  the  ith  coordinate  of  x  is  0  for  each  .r  in  .4.  What  we 
have  determined  then  is  that  ,P(FERD  learns  /)  =  EEo  r+T^^01  (>xacrb"  ^  coordinate's. 
r,  =  0  on  .4i.  There  are  exactly  C'(n.k)  ways  to  choose  the  k  coordinates  (where  C'in.k) 
denotes  the  combination  of  n  things  taken  k  at  a  time).  Once  they  are  chosen.  .4  must 
come  from  the  set  where  each  of  these  coordinates  is  0  on  the  entire  set.  Then'  are 
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such  numbers  so  that  we  have  C(2n~k,a)  ways  to  choose  .4  if  we  want  at  least  k  coordinates 
to  be  ().  (Note  that  there  are  C(2n.a)  ways  to  choose  A  with  no  restrictions  on  A.)  Wo 
must  always  assume  that  a  <  2n~k  otherwise  no  such  .4  is  possible  and  we  take  C(2"~l .  a) 
to  be  zero. 

To  have  exactly  k  coordinates  where  each  xk  £.4  =  0  we  must  ensure  that  for  each 
coordinate,  i,  not  equal  to  one  of  the  k  coordinates  where  xk  =  0,  we  have  at  least  one  x 
in  .4  where  x,  =  1.  There  are  n  —  k  coordinates  of  this  type.  For  convenience  in  notation, 
write  these  coordinates  as  i  =  1,2,  ...,n  —  k.  We  would  like  to  determine  P(for  the  n  —  k 
coordinates,  t,  x,  ^  6  on  .4)  =  P(x [  ^  0  on  .4  and  x2  7^  0  on  .4  and  ...  and  x„_*.  ^  0 
on  .4).  These  calculations  would  be  fairly  simple  if  the  events  we  are  concerned  with  are 
independent,  but  they  are  not.  To  determine  the  desired  probability,  we  use  the  fact  that 
it  is  equal  to  1  —  P(x \  =  0  on  .4  or  x2  =  0  on  .4  or  ...  or  x„_*  =  0  on  .4)  = 


'n  —  k 


1  -  |  ^  C(n  —  k.  t)(  —  l)1+1P(at  leastzof  then  —  k  coordinates  =  0  on  .4)  . 


.  1=1 


The  expression  inside  the  sum  sign  is  obtained  using  basic  probability  rules  —  especially 
the  union  rule  —  which  can  be  found  in  [2],  for  example  and  the  fact  that  P(.r  1  =  0)  = 
P(r2  =  0)  =  ...  =  P(xn.k  =  0). 

From  earlier  work,  we  know  that  the  probability  that  at  least  i  of  the  n  —  k  coordinates 
.  C(2n~k~‘ .  a) 


arc  0  on  .4  is 


C{ 2"  *\n) 


.  So  that  for  the  n  —  k  coordinates,  i  —  1, ....  n  —  k,  P{x,  0  on 


_  E,=T(-l),+  1n»  -  k.i)C(2"-k-'.a) 

C{2"~k , a ) 

And  the  probability  that  FERD  correctly  identifies  f  is 

ELo  tiiC(n.A-)  [C(2n-*-.«)  -  EEV(-1),+  1C(u  -  k,,)C(2"-l'-‘.u) 


C'(2" .  a  ) 


We  summarize  this  in  the  following. 
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PliOPOSITION  8.  If  f  is  a  constant  function,  then  the  probability  that  FERD  correctly 
identifies  f  is 


ELo  krrC(nA)  C(2"~k.  a)  -  DJL  *(-l  ),+  1C(n  -  k.i)C(  2"~k~l. 


C(2n, a) 

when  a  <  2n_1.  If  a  >  2"_l.  then  the  probability  that  FERD  correctly  identifies  f  is  1. 


We  calculate  this  probability  for  specific  values  of  n  and  n.  We  write  /  ,  )  for  the 
probability  that  FERD  identifies  f.  When  n  =  1./  :  {0. 1}  — >  {().  1}  so  a  =  1  and  P(F)  = 
~  -El./-2-  =  3/4.  When  n  =  2, 


a  =  1  P(F)  =  7/12 
a  =  2  P(F)  =  5/6 
a  =  3,4  P(F)  =  1 


When  n  =  3, 


When  n  =  4. 


a  =  1 

P(F)  =  15/32 

o  =  2 

P(F)  =  5/7 

o  =  3 

P(F)  =  25/2S  . 

a  =  4 

P(F)  =  137/140 

o  >  4 

P{F)  =  1 

0=1 

P(F)  =  31/SO 

a  =  2 

P(F)  =  5/8 

o  =  3 

P(F)  =  109/140 

o  =  4 

P(F)  =  841/910 

o  =  5 

P(F)  =  3S/39 

o  =  6 

P(F)  =  143/144 

a  =  7 

P(F)  =  714/715 

o  =  S 

P(F)  =  6434/6435 

o  >  S 

P(F)  =  1 

In  table  form  with  each  [n.a )  -entry  corresponding  to  P(  F )  where  /  :  { 0.  1 . 2"  -  1  }  — » 

)().  1}  is  a  constant  function  and  a  =  |.4|, 

1  2  3  4  5  6  7  8 

1  .75  1  — - —  —  -- 

2  ,5S3  .833  1  1  —  —  --  —  . 

3  469  .714  .893  .979  111  1 

4  3SS  .625  .779  .924  .974  .993  .999  -.999 
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Tlu'  table  shows  patterns  that  are  true  in  general.  For  fixed  /  and  fixed  sample  size  (same 
eohunn),  the  larger  the  domain  the  smaller  the  chance  of  learning  /.  For  fixed  f  and  fixed 
domain  size  (same  row),  the  larger  the  sample,  the  larger  the  chance  of  learning  /. 

During  the  summer  it  was  found  that  the  ADA  function  decomposition  algorithm  ^  AFD) 
that  is  a  prime  ingredient  in  FERD  was  declaring  variables/coordinates  to  be  vacuous 
that  really  weren’t.  A  variable  is  called  vacuous  if  its  value  has  no  bearing  on  the  func¬ 
tion  output.  More  precisely,  r,  is  a  vacuous  variable  for  /  :  X|  X2X.i...x,...x„  — ►  {0.1}  if 

f ix  |  ./•>..  ..r,_i().rl+ !  ...x„  )  =  f(X]X2...Xi-ilXi+i...xn)  for  all  values  of  .t|..r.> . ;>_i..c,+  |. 

. r„.  For  example,  for  the  function  below,  xj  and  x>  are  vacuous  variables  (where 


f‘  —  .V  |  £•>  ,i’  ;j  ) . 

X, 

X2 

X  3 

f(x) 

0 

0 

0 

0 

0 

0 

1 

1 

0 

1 

0 

0 

0 

1 

1 

1 

1 

0 

0 

0 

1 

0 

1 

1 

1 

1 

0 

0 

1 

1 

1 

1 

To  see  if  AFD  was  misidentifving  vacuous  variables,  we  counted  the  number  of  binary 

functions  that  had  vacuous  variables. 

This 

way  we  know  the  proportion  of  functions  with 

vacuous  variables  to  expect  in  a  random  sample. 

Let  \ ’  =  { /  G  F:f  has  at  least  one 

vacuous  variable  }.  We  seek  |1’  .  Now.  |l’|  =  {  f  6 

F  :  f  has  at  least  one  vacuous  variable  }| 

=  IU 

tCtCn  {/  £  F"  :  /  has  at  least  i  vacuous 

variables  }|.  So  once  again  we  use  the  union  rule  to  count  the  desired  set.  If  /  has  at  least 

vacuous  variables  then  in  effect  /  is  a  function  on  {0.1 . 2"-'  —  1}.  There  are  2" 

-uch  functions.  Therefore. 

M  =  ^  C(  n.  / )( —  1  ),+  l  22""' . 

i=i 
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This  number  is  calculated  for  some  values  of  n  below. 


a  m 

1  2 
2  6 

3  38 

4  942 


U'l 

The  table  shows  that  |V|  does  grow  quickly  but  so  does  \F\.  In  fact,  we  expect  - — -  to  be 

t 

on  the  order  of  —  n— t- ,  a  very  small  proportion. 
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RECOMMENDATIONS 


Machine-learning  is  still  a  very  new  and  largely  unexplored  area.  Often  an  algorithm  is 
presented  that  allows  machine-learning  for  a  specific  type  of  function,  for  example,  letter 
recognition.  The  FERD  algorithm  is  different  in  that  it  attempts  to  learn  any  function 
regardless  of  its  type.  This  strength  makes  it  a  powerful  tool  whose  workings  are  still  to 
he  found.  Progress  has  been  made  but  must  still  be  made. 

We  would  like  to  try  FERD  on  some  larger  functions  —  ones  with  larger  domains  — 
hut  currently  that  is  impractical. 

A  study  has  been  done  on  how  decomposed  function  cardinality  relates  to  Kolmogorov 
complexity  —  a  popular  measure  of  complexity  [3]. 

Recently,  the  concept  of  Vapnik-Chervonerkis  dimension  has  come  to  our  attention, 
especially  through  [1].  Its  title  suggests  why  it  is  of  interest  because  it  is  desirable  to  know 
how  large  a  sample  we  need  to  take  in  order  to  guarantee  efficient  learning. 

W'e  would  like  to  relate  our  work  to  existing  work  to  1)  indicate  to  outsiders  the  frame¬ 
work  within  which  FERD  is  working  and  2)  to  perhaps  use  some  general  theory  that  may 
apply  to  our  algorithm.  Presently,  we  have  yet  to  find  that  link.  Either  the  theory  is  too 
specific  and  so  does  not  apply  to  FERD  or  it  is  too  general  to  be  of  use. 

However,  make  no  mistake  about  the  fact  that  FERD  has  outperformed  more  conven¬ 
tional  and  more  well-known  machine-learning  systems.  We  will  continue  both  to  try  to 
liud  out  why  and  to  broaden  its  application. 
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Abstract 

This  report  presents  the  results  of  research  into  the  software  re-engineering  process.  The  demand 
for  successful  software  re-engineering  techniques  is  growing.  Research  into  software  engineering 
tends  to  focus  on  technical  aspects  of  the  problem.  However,  effective  technical  solutions  must 
fit  into  an  overall  framework  that  represents  the  entire  software  re-engineering  process.  While 
technical  solutions  are  needed,  there  is  more  to  the  process  of  re-engineering  software  than  just 
technical  work. 

The  software  re-engineering  process  can  be  divided  into  numerous  tasks.  These  tasks  fall  into 
three  groups  based  on  the  type  of  work  done  in  each  task.  The  three  groups  are:  management, 
technical,  and  support.  Much  of  the  research  into  software  re-engineering  focuses  on  technical 
aspects  of  re-engineering.  While  such  research  is  important,  the  key  to  a  successful  re-engineering 
project  is  planning  and  management. 

The  goal  of  this  research  is  to  model  the  software  re-engineering  process.  This  report  presents 
a  new  model  of  the  re-engineering  process  that  adds  several  more  phases  to  the  traditional  re¬ 
engineering  model.  This  report,  examines  the  Analysis  and  Planning  phase  of  the  re-engineering 
process  and  presents  a  formal  specification  of  the  information  objects  that  are  manipulated  during 
this  phase.  The  tasks  that  occur  during  this  phase  are  also  discussed. 
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Chapter  1 


Research  Overview 


There  is  a  growing  interest  and  need  for  successful  software  re-engineering  methodologies  and 
techniques.  Research  into  software  re-engineering  tends  to  focus  on  technical  solutions.  While 
sound  technical  procedures  for  re-engineering  are  needed  this  emphasis  ignores  other  aspects  of  the 
software  re-engineering  problem.  One  lesson  learned  from  experience  in  software  development  is 
that  sound  management  and  planning  are  necessary  for  a  project  to  succeed.  This  same  principle 
is  true  for  any  software  re-engineering  project.  This  report  examines  the  process  of  software  re¬ 
engineering.  The  planning  phase  of  the  re-engineering  process  is  examined  in  detailed. 


1.1  Introduction  to  Research 

The  purpose  of  the  current  research  is  to  study  the  process  of  software  re-engineering.  The 
motivation  for  this  research  is  driven  by  a  simple  question:  How  can  a  software  re-engineering 
project  be  accomplished?  If  a  re-engineering  project  was  studied  and  all  the  work  done  within  the 
project  was  noted  and  recorded,  what  tasks  would  be  found?  What  information  would  be  created? 
Manipulated?  How  would  the  tasks  be  ordered?  How  would  the  project  be  organized?  What  skills 
would  be  needed?  What  tools  would  be  useful?  What  problems  would  be  encountered?  How  would 
they  be  solved?  By  asking  one  question,  many  questions  appear. 

To  answer  these  questions,  this  research  seeks  to  produce  a  formal  process  model  that  specifies 
the  overall  re-engineering  process  and  the  subprocess  activities  such  as  software  reverse  engineering 
and  re-design.  The  questions  driving  this  objective  are:  What  information  must  be  produced?  and 
When  ran  it  be  produced?  I’he  first  question  focuses  attention  on  the  information  used  within 
the  re-engineering  process.  The  properties  and  characteristics  of  this  information  must  be  stated. 
Relationships  between  information  groups  must  be  clarified  and  expressed.  The  second  question 
focuses  attention  on  the  tasks  that  create  and/or  manipulate  this  information.  Properties  of  these 
tasks  must  be  stated  and  the  relationships  between  tasks  must  be  expressed.  It  is  also  necessary  to 
declare  the  assumptions  on  which  the  proves-,  model  is  based,  and  the  re-engineering  capabilities 
t  hat  are  modeled. 

What  is  a  software  process?  A  software  process  is  the  collection  of  related  activities,  seen 
a.s  a  coherent  process,  used  to  accomplish  -ome  task  such  as  --oftware  development,  testing,  or 
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maintenance  [Dowson86].  What  is  a  software  process  model?  A  software  process  model  represents 
attributes  for  a  particular  software  process  and  is  specific  enough  to  allow  reasoning  about  the 
process. 

What  is  the  significance  of  a  process  model?  Software  process  models  are  seen  as  important 
vehicles  for  understanding,  evaluating,  reasoning,  and  improving  software  processes  [ISPW88].  Pro¬ 
cesses  are  hard  to  understand  and  reason  about,  while  process  models,  as  static  objects,  are  far 
easier  to  comprehend.  A  process  is  a  vehicle  for  doing  a  task,  while  a  process  model  is  a  specifica¬ 
tion  of  how  the  task  is  to  be  done.  A  software  process  model  represents  key  relationships  among  a 
variety  of  types  of  software  objects.  Typical  objects  that  are  modeled  are  activities,  data  objects, 
tools,  and  user  roles  within  the  process.  Thus,  a  formal  process  model  can  be  used  to  provide  an 
understanding  of  the  essential  properties  of  the  re-engineering  task. 

The  research  reported  here  is  a  continuation  of  research  done  by  Byrne  [Byrne90].  In  his  original 
report,  a  descriptive  process  model  of  the  technical  aspect  of  software  re-engineering  was  presented. 
The  major  technical  tasks  covered  were: 

Design  recovery  The  reconstruction  of  a  system  design  from  source  code. 

Re-design  The  alteration  of  the  reconstructed  design  to  satisfy  new  criteria. 
Re-implementation  The  implementation  of  the  altered  design. 

Re-documentation  The  updating  of  the  system  documentation  to  reflect  differences  between  the 
target  system  and  the  existing  system. 

Strangely,  in  the  first  report  a  fifth  technical  sub-process,  testing,  was  omitted.  The  descriptive 
process  model  focused  on  the  tasks  within  the  re-engineering  process.  The  inputs  and  outputs  for 
each  task  were  identified.  The  ordering  of  tasks  within  a  hierarchy  was  presented.  The  flow  of 
information  between  tasks  was  also  given. 

During  the  research  reported  here  it  was  realized  that  the  full  software  re-engineering  process 
has  three  major  aspects  or  areas  of  concern.  These  areas  are  project  management,  technical  work, 
and  project  support.  The  first  report  considered  only  the  technical  aspect  of  the  re-engineering 
process.  By  broadening  the  scope  of  the  re-engineering  process  it  is  possible  to  obtain  a  more 
complete  and  accurate  model  of  the  process.  What  quickly  becomes  almost  overwhelming  is  the 
variety  and  volume  of  information  that  is  generated  during  a  project.  There  are  many  information 
objects  and  operations  on  these  objects.  The  relationships  between  objects  is  complex.  By  defining 
a  model  of  the  process  it  is  possible  to  clarify  the  properties  of  these  information  objects  and  their 
interrelationships. 

1'he  realization  that  only  one  aspect  of  the  re-engineering  process  had  been  covered  in  the  first 
report  led  to  a  dilemma.  The  original  intent  of  this  stage  of  the  research  was  to  improve  and  build 
on  the  initial  process  model  presented  in  the  first  report.  However,  doing  so  would  mean  that  the 
other  two  inspects  would  continue  to  be  ignored.  It  was  decided  that  since  the  goal  of  this  research 
is  to  model  the  entire  re-engineering  process,  more  of  the  process  should  be  explored.  Therefore, 
the  research  reported  here  examines  the  overall  software  re-engineering  process.  Once  the  overall 
'trui  ture  of  the  process  was  expressed,  the  work  focused  on  one  phase  of  the  process.  The  analysis 
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and  planning  phase,  which  is  the  first  phase  of  a  re-engineering  project,  was  examined  in  detail. 
The  results  of  that  examination  are  presented  in  this  report. 

The  second  objective  of  this  research  is  to  identify  a  suitable  notation  to  express  the  developed 
process  model.  To  represent  a  formal  process  model  that  can  be  reasoned  about,  a  process  modeling 
language  is  used.  Such  a  language  has  a  formally  defined  syntax  and  semantics  that  permit  models 
expressed  in  the  language  to  be  analyzed  and  properties  of  the  model  to  be  discussed. 

In  the  first  report  each  task  was  carefully  explained  in  English  using  a  template  to  format 
the  information.  The  flow  of  information  between  tasks  was  modeled  using  dataflow  diagrams. 
While  this  conveyed  information  about  the  ordering  of  tasks,  it  gave  little  information  about  the 
characteristics  and  properties  of  the  information  being  manipulated. 

For  this  report  several  process  modeling  languages  were  examined.  Eventually,  it  was  decided 
that  the  level  of  representation  offered  by  these  languages  was  too  detailed.  These  process  pro¬ 
gramming  languages  were  aimed  at  implementing  a  process  model.  The  current  research  is  focused 
on  the  re-engineering  process  at  a  higher  level  of  abstraction.  Finally,  it  was  decided  to  use  the 
formal  specification  language  Z  [Spivev89].  The  Z  language  was  designed  to  state  the  properties 
of  programs  and  data,  while  avoiding  design  or  implementation  details.  While  not  developed  for 
process  modeling,  it  has  proven  to  be  usable  in  specifying  the  properties  of  information  objects  and 
tasks  within  a  process  model. 


1.2  Software  Re-engineering 

Most  software  organizations  report  a  growing  backlog  of  maintenance  work.  In  many  organizations 
resources  are  being  diverted  from  new  development  projects  to  support  the  maintenance  of  existing 
systems.  It  is  estimated  that  many  software  organizations  spend  between  60%  -  75%  of  their 
effort  on  maintenance.  Much  of  this  is  work  on  outdated  code  that  is  difficult  to  maintain,  but  is 
considered  valuable  by  its  users. 

As  a  software  system  is  maintained  over  a  period  of  years  the  logic  and  structure  of  the  system 
deteriorates.  Eventually,  the  system  becomes  difficult  to  maintain  and  requires  guru  personnel  who 
have  accumulated  a  deep  familiarity  with  the  system  and  its  quirks.  At  this  point  re-engineering 
of  the  software  is  often  considered. 

Software  re-engineering,  also  called  software  re-juvenilization,  reclamation,  or  recycling,  is  gain¬ 
ing  attention  as  the  base  of  installed  software  ages  and  becomes  in  need  of  replacement.  Most 
organizations  have  invested  millions  of  dollars  in  their  software  and  are  not  willing  to  simply  throw 
it  out  and  build  completely  new  systems.  Those  organizations  have  a  great  deal  of  experience 
invested  in  their  systems  and  are  more  willing  to  modernize  a  system  than  to  obtain  a  new  one. 
Software  re-engineering  addresses  this  problem. 

Software  re-engineering  is  necessary  when  a  software  system  reaches  the  point  where  it  must  be 
retired  or  rewritten.  Re-engineering  is  selected  w  hen  the  system  is  seen  as  valuable  but  too  difficult 
to  maintain  in  its  current  state.  This  i-  a  problem  faced  by  software  systems  that  originated  10. 
20,  or  more  years  ago.  Old  age  is  not  uere^arilv  the  only  reason  to  re-engineer  a  system.  Here  are 
some  typical  reasons: 
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•  An  organization  has  purchased  a  new  computer.  The  new  environment  does  not  support  the 
programming  language  in  which  parts  of  the  existing  system  are  written,  or  the  system  must 
he  rewritten  to  use  features  provided  by  the  new  environment. 

•  A  software  system  that  has  been  around  for  many  years  has  become  too  expensive  to  maintain. 
Such  systems  have  generally  evolved  far  past  their  original  functionality  and  the  code  has 
become  difficult  to  understand  or  change. 

•  An  organization  has  decided  to  start  using  modern  software  engineering  practices,  but  unfor¬ 
tunately,  most  of  the  organization’s  work  deals  with  already  existing  code  that  does  not  have 
the  benefit  of  these  new  practices. 

•  An  organization  plans  to  purchase  modern  CASE  tools  that  support  development  and  main¬ 
tenance.  Unfortunately,  such  tools  tend  to  work  only  on  code  initially  developed  with  the 
tool. 

•  An  organization  has  decided  to  switch  to  a  modern  programming  language,  and  wants  to 
translate  their  existing  code  into  the  new  language.  For  example,  the  DOD  mandate  on 
ADA. 

Software  re-engineering  begins  with  an  existing  software  system  and  produces  a  new  modernized 
version  of  that  system.  Chikofsky  and  Cross  defined  software  re-engineering  as  [Chikofsky90]  : 

1'he  examination  and  alteration  of  a  subject  system  to  reconstitute  it  in  a  new  form 
and  the  subsequent  implementation  of  the  new  form. 

The  term  ’’re-engineering"  implies  some  type  of  transformation.  A  software  re-engineering 
process  should  start  with  an  existing  system  and  generate  a  new  high-quality  system  with  valid 
documentation.  The  definition  of  such  a  process  can  begin  bv  comparing  the  characteristics  of 
existing  systems  (input  to  the  process)  with  target  systems  (output  from  the  process).  In  this  way, 
functional  requirements  can  be  identified.  I  it  addition,  several  process  goals  and  constraints  are 
also  apparent. 

The  software  re-engineering  process  must  provide  the  capability  to  produce  a  new  system  that  is 
written  in  a  different  programming  language  t  ban  the  existing  system.  One  reason  for  re-engineering 
a  system  can  be  to  switch  to  a  modern  programming  language.  Perhaps  parts  of  the  system  are 
written  in  different  languages  and  the  new  version  is  to  be  written  in  only  one  language. 

The  process  must  provide  t  he  capability  to  re-design  the  system  architecture.  An  existing  system 
typically  was  designed  before  modern  design  techniques  came  into  practice.  Modern  techniques 
such  as  information  hiding  and  object-oriented  design  are  intended  to  produce  systems  whose 
architecture  reflects  a  model  of  the  application  domain.  Designs  produced  by  these  techniques  tend 
to  be  easier  to  comprehend,  maintain,  and  change. 

The  process  must  provide  the  capability  to  incorporate  new  requirements  into  the  system. 
This  means  changes  to  the  functionality  of  the  system.  This  may  refer  to  the  ability  to  work  on 
a  different  machine  architecture,  interlace  \  iih  new  equipment  or  systems,  changes  to  the  user 
interface,  addition  of  new  capabilities,  etc.  It  can  also  mean  the  removal  of  portions  of  the  system 
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that  support  requirements  that  have  been  deleted.  Or  it  can  mean  changes  to  the  system  to  satisfy 
changes  to  existing  requirements. 

The  process  must  provide  the  capability  to  generate  structured  source  code  even  when  the 
existing  code  is  unstructured.  This  refers  to  the  use  of  control-flow  constructs  used  in  the  body  of 
a  routine. 

The  process  must  provide  the  capability  to  restructure  data.  The  capability  to  restructure 
data  aids  in  improving  the  understandability  and  maintainability  of  the  code.  Data  restructuring 
includes  changing  data  structure  definitions,  removing  duplicate  data  items,  clarifying  type  infor¬ 
mation,  organizing  data  handling  procedures  and  re-organizing  data  files  (data  re-engineering). 

The  process  must  provide  the  capability  to  document  the  system  on  both  the  program  and  design 
level.  Program  documentation  includes  information  such  as  where  an  item  is  defined  and  where 
it  is  used,  who-calls,  who-is-called,  item  descriptions,  etc.  Design  documentation  represents  the 
system  at  a  more  abstract  level,  and  shows  the  system  architecture  and  processing  characteristics. 
Design  documentation  should  document  both  high  and  low  level  design. 

The  process  must  preserve  the  correctness  of  the  existing  system.  Preserving  the  correctness 
means  that  the  functionality  of  the  system  has  been  preserved,  with  the  exceptions  of  changes 
to  the  system  specified  by  requirement  changes.  The  requirement  that  correctness  be  preserved 
requires  that  the  process  either  guarantees  preservation  or  provides  a  method  to  verify  it. 

The  process  must  provide  the  capability  to  replace  existing  program  item  names  with  meaningful 
names.  This  is  the  ability  to  rename  any  variable,  procedure,  function,  macro,  data  type,  etc.  In 
addition  to  replacing  a  name,  the  process  must  aid  in  locating  and  replacing  all  occurrences  of  that 
name.  This  includes  both  source  code  and  documentation. 

The  process  must  support  the  generation  of  a  system  that  is  organized  and  written  to  support 
understandability.  This  is  a  goal  and  not  a  functional  requirement.  Many  of  the  requirements 
above,  such  as  use  of  modern  design  techniques,  control-flow  structuring,  and  use  of  meaningful 
names  will  help  achieve  the  goal  of  understandability. 

'I'lie  process  must  support  the  generation  of  source  code  that  conforms  to  an  in-house  standard. 
Such  a  standard  places  constraints  on  the  presentation  style  of  the  source  code,  naming  conventions, 
restrictions  on  implementation  language  constructs,  etc.  The  process  should  help  verify  that  such 
constraints  are  not  violated. 

Examining  these  requirements  it  becomes  clear  that  the  re-engineering  process  must  be  able 
to  manipulate  a  representation  of  the  system.  Further,  this  representation  must  be  more  abstract 
than  a  source  code  representation.  This  can  be  seen  by  examining  the  requirement  that  the  process 
must  be  able  to  manipulate  the  system  architecture  or  high-level  design.  Manipulating  the  system 
architecture  bv  changing  source  code  is  difficult,  partly  because  such  manipulations  must  consider 
the  implementation  details  of  the  system.  Modifications  can  be  more  easily  made  to  a  design 
representation  of  the  system.  A  design  representation  for  an  existing  system  can  be  constructed 
using  reverse  engineering  techniques. 

Mow  should  software  re-engineering  be  done?  I  lie  1  .S.  Federai  Conversion  Support  Center, 
Office  of  Software  Development  issued  a  set  of  guidelines  for  establishing  software  improvement 
programs  (SIl’s)  ( iOSD/FCSC'Ol.  ,  lloutzSd]).  These  guidelines  establish  "what  needs  to  be  done,” 
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modernization  of  existing  software  to  maximize  its  value,  quality,  efficiency,  and  effectiveness.  Mod¬ 
ernization  includes  those  activities  necessary  to  upgrade  the  software  and  its  engineering  techniques 
to  current  or  state-of-the-art  levels.  A  SIP  preserves  the  value  of  past  software  investments,  and  at 
the  same  time,  increases  t he  reliability,  efficiency,  portability,  and  maintainability  of  the  software 
to  create  an  improved  foundation  on  which  to  maintain  older  systems. 

The  SIP  approach  is  intended  to  establish  a  program  for  improving  software  within  an  or¬ 
ganization.  A  SIP  is  not  a  one-shot  effort,  it  establishes  an  evolutionary  approach  to  software 
improvement.  The  software  improvement  approach  may  be  used  to  improve  existing  software  or 
build  new  systems  from  existing  software.  When  working  with  existing  systems  there  are  several 
factors  that  influence  or  constrain  the  work.  The  characteristics  of  the  existing  system  must  be  con¬ 
sidered.  There  may  be  other  operational  systems  that  are  integrated  with  the  existing  system.  The 
testing  integrity  of  the  current  system  may  need  to  be  preserved  while  moving  to  a  new  or  improved 
system.  An  evolutionary  approach  may  be  necessary  because  of  a  need  for  usable  implementations 
at  each  stage  of  the  improvement  process. 

Another  approach  for  improving  software  is  Structured  Retrofit  which  is  based  on  restructur¬ 
ing  techniques[Lyons81].  In  one  sense,  restructuring  is  a  limited  form  of  re-engineering.  Typical 
restructuring  techniques  focus  on  control-flow  restructuring  of  source  code.  This  is  the  transforma¬ 
tion  of  unstructured  source  code  into  structured  code  by  removing  COTO’s  and  other  unstructured 
control-flow  constructs. 

The  basic  philosophies  behind  both  structured  retrofit  and  re-engineering  are  similar.  The  act 
of  changing  software  tends  to  destroy  or  obscure  its  structure.  This  makes  software  progressively 
more  resistant  to  change.  My  examining  the  processes  embodied  in  these  and  other  approaches  it 
is  possible  to  discover  tasks  and  information  that  software  re-engineering  must  also  incorporate. 


1.3  Guide  to  Report 

The  focus  of  many  research  projects  in  software  re-engineering  is  the  development  of  techniques 
and  tools  that  support  a  programmer.  What  is  missing  is  an  investigation  of  the  deeper  properties 
of  software  re-engineering.  The  development  of  supporting  tools  without  an  understanding  of  the 
real  problems  that  need  to  be  addressed  will  not  lead  to  satisfactory  solutions  that  can  reduce  the 
comp'exitv  of  a  task. 

What  is  needed  is  a  deeper  understanding  of  the  characteristics  of  software  re-engineering.  This 
deeper  insight  can  be  provided  bv  two  lines  of  investigation.  The  first  is  to  examine  and  model  the 
re-engineering  process.  The  second  is  to  establish  a  formal  basis  for  re-engineering  techniques.  This 
research  focuses  on  the  first  line  of  investigation:  examine  and  model  the  re-engineering  process. 

fhe  first  objective  of  the  research  reported  here  is  to  identify  a  suitable  language  for  representing 
the  software  re-engineering  process  model,  rite  second  chapter  presents  the  concept  of  software 
process  modeling  and  process  modeling  languages.  The  language  chosen  to  express  the  process 
model  is  Z.  At  the  end  of  chapter  2  a  brief  description  of  Z  is  given.  Appendix  A  also  summarizes 
the  Z  notation  used  in  this  report. 

1  1m*  -econd  objective  of  the  research  is  to  model  the  software  re-engineering  process.  Chapter 
1  presents  t he  software  re-engineering  process  and  its  major  phases.  This  is  a  high  level  discussion 
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of  the  activities  involved  with  conducting  a  re-engineering  project. 

Within  the  software  re-engineering  process  the  first  major  phase,  project  analysis  and  planning, 
was  examined  in  detail.  Chapter  4  contains  a  formal  specification  of  the  information  and  oper¬ 
ations  used  within  this  phase.  The  tasks  that  must  be  done  to  successfully  plan  and  prepare  a 
re-engineering  project  are  also  discussed. 
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Chapter  2 


Process  Modeling  and  Languages 


The  goal  of  this  research  is  to  develop  a  model  of  the  software  re-engineering  process.  A  model  that 
contains  enough  detail  to  guide  humans  through  the  process,  providing  information  about  major 
and  minor  tasks,  their  sequencing,  and  their  steps.  The  process  model  developed  must  accurately 
describe  the  process,  otherwise  it  will  be  of  little  value. 

An  issue  that  must  be  resolved  is  how  to  represent  the  software  re-engineering  process  model. 
English  descriptions  are  neither  precise  or  concise.  Diagrams  omit  too  much  important  informa¬ 
tion.  A  notation  or  language  for  expressing  the  model  is  required.  There  is  a  subfield  of  software 
engineering  dedicated  to  researching  the  software  process.  The  specification  and  development  of 
process  modeling  languages  is  an  area  of  research  within  this  field. 


2.1  Software  Process  Research 

To  study  software  processes  it  is  necessary  to  realize  that  they  exist.  Programmers  use  a  process 
to  develop  software.  They  follow  a  process  for  specifying  requirements,  a  process  for  designing 
software,  a  process  for  implementing  it,  for  testing,  for  fixing  errors,  for  porting  software,  etc.  The 
result  of  a  software  process  is  a  software  product.  Organizations  often  have  procedures  or  guidelines 
that  are  followed  when  developing  or  maintaining  software.  Osterweil  said,  ’’...humans  improvise 
processes  on  the  fly,  maintain  them  mentally,  modify  them  as  necessary  and  guide  others  in  their 
effective  use...’’  [Osterweil87],  By  studying  software  processes  their  existence  is  recognized.  These 
processes  must  be  made  visible.  They  must  be  described  formally,  discussed,  reasoned  about,  and 
improved. 

A  software  process  model  is  a  means  of  expressing  a  software  process.  Researchers  who  study 
process  modeling  agree  that  if  you  are  concerned  about  the  quality  of  software  products,  then  you 
need  to  be  concerned  with  tin'  process  bv  which  they  are  produced  [ISPW88].  Software  process 
models  are  seen  as  important  vehicles  for  understanding  and  reasoning  about  software  activities 
[Williams88].  The  ability  to  represent  and  model  a  software  process  is  necessary  to  understand, 
evaluate,  reason,  and  improve  the  process.  Software  process  models  are  necessary  to  help  make 
software  activities  more  reliable  and  productive. 

I  util  recentlv,  process  models  have  focused  mainly  on  management  aspects  f(’urtis87].  Boehm 


states  that  software  process  models  are  important  because  they  provide  guidance  on  the  order  of 
major  phases  in  a  project.  He  believes  that  t lie  primary  functions  of  a  software  process  model  are 
to  determine  the  order  of  phases  involved  in  software  development  and  evolution  and  to  establish 
the  transition  criteria  for  progressing  from  one  phase  to  the  next  [Boehm88].  This  point  of  view  is 
reflected  in  software  life-cycle  models. 

The  Waterfall  model  is  perhaps  the  most  well  known  software  life-cycle  model.  The  Waterfall 
model  and  its  variations  describe  the  software  process  as  a  series  of  phases.  Each  phase  ends  with  the 
production  of  an  artifact,  such  as  a  design  document,  or  source  code.  The  Spiral  model  [Boehm88] 
replaces  the  Waterfall  approach  with  a  spiral  model  of  development  that  uses  a  risk-driven  approach 
rather  than  a  document-driven  or  code-driven  process.  Iivari  builds  on  this  to  create  the  Hierarchal 
Spiral  Model  that  combines  the  spiral  model  with  the  PIOCO  model  [Iivari87]. 

The  problem  with  these  life-cycle  models  as  software  process  models  is  that  any  model  built  only 
on  the  end  points  of  major  phases  offers  little  insight  into  the  actions  and  events  that  take  place 
to  produce  the  end  point  products  [CurtisXT].  These  models  enable  management  to  set  milestones 
and  measure  progress,  but  provide  little  information  about  how  software  is  actually  created  or 
modified.  These  life-cycle  models  are  actually  high-level  process  models  that  are  not  elaborated 
to  lower  levels  of  detail.  Process  models  that  describe  lower  levels  of  detail  provide  the  means  to 
discuss  exact  procedures. 

What  is  a  process?  Osterweil  defines  a  process  as  a  systematic  approach  to  the  creation  of  a 
product  or  the  accomplishment  of  some  task  [Osterweil87].  Dowson  defines  a  software  process  as 
the  collection  of  related  activities,  seen  as  a  coherent  process  subject  to  reasoning,  involved  in  the 
production  of  a  software  system  [DowsonXb]. 

What  is  a  software  process  model?  Dowson  defines  a  software  process  model  as  a  purely  descrip- 
tive  representation  of  the  software  process  | Dowson! SO].  A  software  process  model  must  represent 
attributes  of  a  range  of  particular  software  processes  and  be  specific  enough  to  allow  reasoning 
about  them.  Mi  gives  a  different  definition.  Mi  defines  a  software  process  model  as  a  prescriptive 
representation  of  software  development  activities  in  terms  of  their  order  of  execution  and  resource 
management  [MiOOj.  This  difference  in  definition,  are  process  models  descriptive  or  prescriptive,  is 
currently  being  debated,  but  most  researchers  seem  to  agree  that  software  process  models  should 
be  proscriptive. 

Processes  are  hard  to  comprehend  and  reason  about,  while  process  models,  as  static  objects, 
are  far  easier  to  comprehend.  A  process  is  a  vehicle  for  doing  a  task,  while  a  process  model  is 
a  specification  of  how  the  task  is  to  be  done.  If  a  software  process  model  is  prescriptive,  it  can 
be  written  as  a  process  program  that  guides  a  human  through  the  encoded  process.  This  is  the 
viewpoint  held  by  many  and  best,  stated  by  Osterweil  [Osterweil, XT], 

We  suggest  that  it  is  important  to  create  software  process  descriptions  to  guide  our  key 
software  processes,  that  these  descriptions  be  made  as  rigorous  as  possible  and  that  the 
processes  then  become  guides  for  effective  application  of  computing  pow'er  in  support 
of  tin1  execution  of  processes  instantiated  from  these  descriptions. 

I.ehman  argues  against  this  approach  [LehmanX?].  He  claim-  that  the  pursuit  of  prescriptive 
models  i-  useless,  even  meaningless,  unless  vie  have  comprehensive  knowledge  about  the  process. 

Iff .  I  _ _ : .  .  : .  i  «-  .1  »  ■  •  •  • 
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process  so  far. 

In  either  case,  it  is  agreed  that  languages  for  representing  software  process  models  are  needed. 
The  questions  is  whether  we  need  software  process  modeling  languages  or  software  process  pro¬ 
gramming  languages. 

A  software  process  meta-model  is  a  representation  formalism  that  provides  the  necessary  com¬ 
ponents  to  create  various  types  of  software  process  models  [Mi90j.  Katayama  points  out  that  a 
key  software  process  issue  is  the  choice  of  formalisms.  A  formalism  must  provide  a  clear  and  un¬ 
derstandable  description.  It  must  be  capable  of  evolving  over  a  period  of  time.  It  must  provide 
the  necessary  ability  to  describe  the  behavior  of  the  process.  Two  important  behavioral  aspects  of 
software  processes  are  concurrent  processes  and  iteration  [Katayama89].  Formalisms  are  needed  to 
represent  information  about  the  software  product,  operations  on  the  software  product,  sequencing 
of  operations,  information  about  resources  including  project  personnel,  equipment,  and  facilities 
[Sutton89].  Humans  are  essential  components  in  the  software  process  and  their  roles  and  limitations 
must  also  be  subject  to  definition  and  design  [Gamalel-Din88j. 

The  software  process  is  shaped  by  many  factors.  The  individual  programmer  is  concerned 
with  the  process  used  to  accomplish  his  work.  A  team  of  programmers  are  concerned  with  how 
their  work  interacts.  Management  is  concerned  with  the  sequence  of  activities  that  must  be  done, 
allocating  the  necessary  resources,  and  tracking  the  advancement  of  the  work.  Even  policies  of 
the  orp  nation  can  effect  the  software  process.  Because  of  these  influences  a  software  process 
model  can  be  viewed  from  a  variety  of  perspectives  [ISPVV88].  These  viewpoints  include  life-cycle, 
organizational,  and  software  process  improvement-oriented.  The  value  of  any  view  is  to  provide 
insight  into  the  important  characteristics  of  software  processes  and  the  issues  that  must  be  addressed 
in  modeling  these  processes.  A  software  process  modeling  language  must  be  able  to  support  these 
differing  views. 

I’he  language  used  to  represent  a  prescriptive  software  process  model  should  be  enactable.  The 
term  "enactable”  was  adopted  by  the  -4th  International  Software  Process  Workshop  [ISPW88]  to 
convey  the  concept  of  a  "running"  process  and  avoiding  the  connotations  of  machine  execution  that 
terms  like  "executing”  or  "interpreting"  suggest.  A  software  process  model  written  in  an  enactable 
language  is  called  a  software  process  program.  The  activity  of  expressing  software  process  models 
with  the  aid  of  programming  techniques  is  called  process  programming  [Osterwei!87].  The  result 
of  running  a  process  program  is  typically  a  software  product  program.  The  term  product  program 
is  used  to  denote  a  standard  executable  program  written  in  a  normal  programming  language  such 
as  ('.  Fortran,  or  Lisp.  A  process  program  must  be  written  in  a  process  programming  language. 

How  does  a  process  programming  language  differ  from  a  product  programming  language?  In  a 
standard  programming  language  primitive  expressions  are  formed  from  a  set  of  basic  operators  that 
are  part  of  the  language.  In  a  process  programming  language  the  basic  operators  are  tools  and  these 
tools  are  not  part  of  the  language.  A  tool  can  be  an  automatic  tool  that  is  invoked  under  certain 
conditions  or  an  interactive  tool,  such  as  an  editor,  that  allows  a  human  to  do  some  necessary  task. 
Another  difference  is  the  data  types  supported  by  the  language.  Most  programming  languages 
support  a  fixed  set  of  primitive  types  Midi  as  character,  integer,  real,  arrays,  etc.  A  process 
programming  language  must  provide  a  rich  typing  system  to  allow  programmers  to  specify  complex 
type,  for  objects  such  as  design  documents.  ler.t  plans,  source  code,  cross  reference  information, 
error  reports,  change  requests,  etc. 


A  prescriptive  software  process  model  provides  a  basis  for  structuring  software  environments. 
Most  software  environment  currently  supply  a  toolbox.  Providing  a  collection  of  tools  from  which 
a  user  selects  and  uses  what  is  needed.  This  is  a  loosely  coupled  architecture  and  provides  little 
means  of  insuring  that  tools  are  used  correctly  or  even  at  all  [Ramanathan88].  There  is  a  growing 
opinion  that  an  unstructured  ’’bag  of  tools"  does  not  qualify  as  a  software  environment.  Dowson 
defines  a  software  environment  as  [Dowson86]: 

A  coordinated  collection  of  software  tools  organized  to  support  some  approach  to  soft¬ 
ware  development  or  conform  to  some  software  process  model. 

The  collection  of  tools  is  coordinated  by  an  explicit  process  model.  The  model  is  not  built 
into  the  environment  but  is  represented  explicitly  in  a  process  programming  language  and  can  be 
changed  by  the  users.  The  ability  to  change  the  process  model  used  by  an  environment  provides 
much  flexibility.  The  local  organization  or  user  can  tailor  the  process  model  to  conform  to  local 
work  patterns  or  methods. 


2.2  Process  Modeling 

Why  should  we  develop  process  models?  Software  process  models  provide  the  means  to  make 
visible  the  processes  by  which  software  is  developed  and  maintained.  Software  process  models 
enable  increased  understanding  of  a  process  and  help  to  highlight  important  features  [Sutton89]. 
Kellner  has  identified  four  ob  jectives  for  the  development  of  software  process  models  [Kellner89]. 

1.  finable  effective  communications  regarding  the  process  being  modeled. 

2.  f  acilitate  reuse  of  the  process. 

Support  evolution  of  the  process. 

1.  Facilitate  management  of  the  process. 

Once  a  process  model  is  developed,  how  diould  it  be  used?  This  question  invokes  the  debate 
on  whether  models  should  be  descriptive  or  proscriptive.  Descriptive  models  are  defined  to  gain  a 
better  understanding  of  a  complex  process  by  exposing  some  important  characteristics,  but  without 
actually  providing  any  details  on  how  the  process  is  carried  out.  Prescriptive  models  provide  more 
or  less  firm  guidelines  on  how  the  different  actors  of  the  process  will  work  to  achieve  the  goals.  A 
prescriptive  model  is.  at  least  implicitly,  based  on  a  descriptive  one. 

A  prescriptive  software  process  model  can  be  automatically  analyzed  and  enacted  by  computers 
[OsterweilK?],  such  a  model  is  called  a  process  program.  This  suggests  that  process  programs 
can  serve  as  a  means  of  specifying  how  the  actions  of  software  tools  might  be  integrated  with 
human  activities  to  support  software  processes.  Thus  a  prescriptive  software  process  model  can  be 
written  to  define  possible  (allowable)  patterns  of  behavior  between  humans  and  tools  in  a  process 
'ISPWxs],  Flnart ability  means  that  human  beings  involved  in  the  software  process  can  receive 
computer  guidance  and  help  in  what  is  an  extremely  complex  activity. 
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2.2.1  What  are  we  trying  to  model? 


If  a  prescriptive  process  model  for  software  design  is  developed,  will  it  be  able  to  design  software 
when  enacted?  The  answer  is  no.  Ramanathan  has  pointed  out  that  a  large  portion  of  software 
development  is  creative  and  cannot  be  automated.  However,  interspersed  with  this  creative  activity 
is  much  planning,  control,  status  tracking,  reporting,  and  other  management  activities  for  which 
well  defined  standards  exist,  but  that  are  difficult  to  enforce  manually  [Ramanathan88].  A  pro¬ 
cess  program  can  be  used  to  guide  programmers  through  the  process  while  enforcing  management 
policies,  yet  providing  humans  with  the  necessary  tools  to  accomplish  their  work. 

This  represents  one  possible  view  of  the  use  of  a  software  process  model:  the  management  view. 
Mi  claims  that  effective  process  models  should  address  organizational  and  technical  dimensions 
including: 

1.  Detailed  descriptions  of  software  processes, 

2.  Their  interactions, 

3.  Management  and  exception  handling  during  the  performance  of  the  software  process, 

4.  Product-specific,  organization-specific,  and  project-specific  processes. 

Software  process  models  will  be  used  by  different  groups  of  people,  each  with  their  own  view  of 
what  information  is  necessary.  Three  possible  views  arc  technical,  managerial,  and  organizational. 
Curtis  in  his  study  of  software  develop  projects  found  the  following  hierarchy  of  process  factors 
[Curtis87]: 

1.  Individual  concerns 

2.  Team  concerns 

3.  Project  concerns 

4.  Company  concerns 

The  individual  programmer  is  concerned  with  how  to  accomplish  his  task.  What  tools  should 
be  used?  What  steps  are  necessary?  How  should  each  step  be  done?  The  team  is  concerned 
with  interactions  among  the  team  members.  Interaction  includes  dividing  tasks,  communication 
between  team  members,  and  possible  resource  conflicts  between  members.  Project  level  concerns 
deal  with  scheduling  project  phases,  meeting  deadlines,  allocating  necessary  personnel  and  resources 
to  accomplish  the  project.  Finally,  the  company  affects  the  process  by  setting  company  goals, 
directions,  and  policies. 

There  seem  to  be  many  possible  foci  for  process  models.  Curtis  believes  that  the  most  valuable 
are  those  that  capture  the  processes  that  control  the  most  variance  in  software  productivity  and 
quality.  If  a  process  model  does  not  represent  the  processes  that  control  the  largest  share  of 
variability  within  a  software  process,  it  can  not  be  helpful  in  improving  productivity  «..d  quality 
|('urtis87]. 
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To  accommodate  this  variety  of  views  is  has  been  suggested  that  a  single  representation  captur¬ 
ing  all  the  relations  among  objects  and  processes  be  c  reated.  Then  tools  could  be  used  to  extract 
specific  views  on  request  (Taylor88).  However,  it  may  be  difficult  to  provide  a  single  description  that 
is  rich  enough  to  support  all  the  various  views  [ISPW88].  Despite  these  complications,  there  is  gen¬ 
eral  agreement  that  a  software  process  model  must  describe  the  context  in  which  problem-solving 
decisions  are  made. 

A  software  process  model  represents  key  relationships  among  a  variety  of  types  of  software 
objects.  These  relationships  are  dependent  on  the  "view”  being  used  to  examine  the  process 
model.  Software  objects  are  complex.  Examples  of  software  objects  are  requirement  specifications, 
design  documents,  user  manuals,  cross-reference  information,  module  interface  specifications,  test 
plans,  test  results,  change  requests,  etc. 

A  prescriptive  process  model  performs  activities  on  objects.  Activities  are  done  with  tools. 
Tools  can  be  fully  automated  and  invoked  by  the  process  model,  or  interactive  and  used  by  a 
human  to  accomplish  a  task.  A  process  program  shows  how  the  various  software  tools  and  objects 
are  coordinated  to  support  a  process.  Software  tools  are  operators  used  by  a  model.  They  transform 
software  objects.  Humans  are  assigned  well  defined  roles  in  creating  and  transforming  objects  too. 
The  specification  of  what  they  do.  when  they  do  it.  and  how  they  coordinate  with  others  humans 
and  with  their  tools  is  embodied  in  a  process  program. 

There  are  several  dangers  in  process  programming.  For  example,  the  process  model  may  not 
map  accurately  to  real  process  behavior.  A  process  program  must  achieve  an  accurate  model  of 
behavior.  Existing  models  of  the  software  development  process  do  not  provide  enough  insight 
into  real  development  processes  to  guide  research  on  the  most  effective  development  technologies 
[CurtisS?].  One  solution  is  to  maintain  a  history  of  process  usage  to  form  a  trace.  Analysis  of  a 
process  trace  can  pinpoint  problems  with  the  process  and  high-light  successful  processes  used  by 
programmers. 

2.2.2  Developing  Software  Process  Models 

How  is  a  process  model  created?  Taylor  reporter!  that  in  an  effort  to  gain  experience  with  process 
programming,  process  programs  were  written  for  various  software  processes.  While  the  intent  was 
to  gain  experience  with  process  programming,  it  soon  became  clear  that  it  was  necessary  to  develop 
process  requirement  specifications  and  designs  first  ( Taylor88).  He  also  ’’discovered”  that  process 
designs  proved  to  be  more  useful  in  understanding  the  processes  than  the  process  code  itself.  This 
agrees  with  statements  bv  others  that  a  life-cycle  for  software  process  programs  (or  models)  exist. 
It  has  been  proposed  that  methodologies  used  for  system  analysis  and  design  should  be  considered 
as  suitable  approaches  to  process  model  development  (Kellner89]. 


2.3  Process  Programming  Languages 

A  language  for  representing  process  models  is  necessary.  For  prescriptive  process  models  the  chosen 
language  must  be  enactable.  i.e.  it  must  have  an  execution  mechanism.  Unfortunately,  the  speci¬ 
fication  of  a  language  depends  on  an  accumulation  ol  experience  with  process  modeling  or  process 
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programming.  What  features  does  such  a  language  need?  Rapid  progress  in  process  programming 
can  not  be  achieved  until  a  suitable  language  for  process  programming  exists  [Osterweil87]. 

The  4th  International  Software  Process  Workshop  focused  attention  on  languages  and  notations 
in  which  formal  models  of  software  processes  could  be  represented  and  enacted  [ISPW88].  There 
was  a  general  consensus  that  process  programs  are  likely  to  be  inherently  complex.  The  concern 
for  expressive  range  in  a  process  programming  language  must  be  balanced  against  concern  that 
notations!  complexity  does  not  add  to  the  already  high  inherent  complexity  of  process  models. 

Boehm  applied  process  programming  to  his  Spiral  Model  [Boehm88]  and  reported  the  following 
lessons  learned  about  process  programming  [Boehm89|. 

1.  A  model  for  the  meaning  of  the  process  program  is  essential. 

2.  Data  representation  issues  are  significant. 

3.  Relationships  among  data  objects  are  a  central  issue.  Relationships  must  be  made  explicit. 

4.  Data  visibility  issues  are  significant . 

5.  Change  of  the  process  during  its  elaboration  is  critical  (dynamic  properties). 

Can  experience  with  developing  product  programming  languages  be  applied  to  developing  pro¬ 
cess  programming  languages?  Indeed,  can  product  programming  languages  be  used  for  process 
programming  languages?  The  general  consensus  is  that  while  much  can  be  taken  directly  from 
product  programming  languages,  these  languages  are  not  well  suited  for  process  programming. 

Osterweil  believes  that  the  primary  difference  between  process  programs  and  conventional  prod¬ 
uct  programs  is  that  process  programs  represent  programming  in  a  non-traditional  application 
domain  [Osterweil87].  Curtis  believes  that  a  difference  between  process  programs  and  product 
programs  is  the  variability  of  the  underlying  process  being  specified  [Curtis87].  In  a  conventional 
programming  language  the  basic  operators  are  well  defined.  In  a  process  programming  language 
the  basic  operators  will  be  tasks  that  are  process  specific  and  carried  out  by  automated  tools  or 
humans.  The  4th  International  Software  Process  Workshop  suggested  that  the  relative  differences 
between  process  programs  and  product  software  include  much  less  code  in  process  programs,  and 
increased  amount  of  instantiation,  and  more  concern  for  failures  in  instantiations  and  composition. 

2.3.1  Data  Support 

Software  process  programs  specify  how  software  objects  are  to  be  systematically  developed  or 
modified.  Software  objects  include  much  more  than  just  source  code.  Other  possible  objects  are 
requirement  specifications,  designs,  documentation,  user  manuals,  test  cases  and  test  results,  test 
plans,  object  code,  executable  code,  change  requests,  versions  of  objects,  etc.  Software  objects  are 
complex  data  aggregates. 

Software  processes  manipulate  information.  This  information  differs  from  that  needed  for  prod¬ 
uct  programs  in  several  ways  such  as  content,  structure,  and  the  properties  of  the  supporting 
operations.  Software  information  objects  are  stored  in  an  information  base.  The  list  of  infor¬ 
mation  and  support  provided  bv  the  in  formation  base  includes:  objects  of  varying  sizes,  varying 
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degrees  of  persistence,  nested  transactions,  very  long  transactions,  complex  and  programmable  re¬ 
lations  among  objects,  triggering  mechanisms,  automatic  inferencing  mechanisms,  dynamic  types 
and  schemas,  multi-user  sharing  with  associated  locking  mechanisms,  versioning,  query  languages, 
and  partioning  and  view  mechanisms. 

Normally,  such  information  is  maintained  in  a  database.  The  connection  between  a  process 
program  and  the  database  that  serves  as  a  repository  for  the  objects  and  relationships  included 
in  the  software  process  is  important.  For  a  process  program  this  database  serves  as  part  of  the 
program  store.  Because  of  this  the  connection  between  the  language  and  the  information  base  must 
be  considered.  The  requirements  for  the  information  store  must  be  carefully  considered.  They  are 
richer  than  for  any  other  known  application  area. 

2.3.2  Survey  of  Process  Programming  Languages 

Because  of  the  lack  of  an  accepted  process  programming  language,  research  projects  that  investigate 
software  processes  often  develop  their  own  notation  or  language  to  express  a  process  model  or 
program.  These  languages  vary  greatly  in  level  of  expression  ranging  from  high  level  specification 
style  languages  to  lower  level  traditional  programming  style  languages.  Languages  run  the  range 
of  styles  such  as  imperative,  object-oriented,  and  rule-based. 

The  following  subsections  summarize  several  process  modeling  and  process  programming  lan¬ 
guages  that  have  been  reported  in  the  literature.  Available  information  for  these  languages  range 
from  short  articles  to  fully  implemented  systems.  Many  of  these  languages  are  experimental.  The 
suitability  of  each  language  was  considered  for  use  in  this  research  work.  The  criteria  for  choosing 
a  language  is  given  in  the  next  section. 


IT  :  The  Meta-Model 

Wileden  proposed  a  high-level  modeling  representation  called  IT  [WiledenS6].  This  meta-model  for¬ 
malism  is  based  on  two  abstract  spaces  called  the  I  -  information  space,  and  the  T  -  transformation 
space.  Transformations  are  functions  that  take  information  and  produce  information.  For  example, 
a  simple  Waterfall  model  can  be  expressed  bv  dividing  the  information  space  into  a  requirements 
space,  design  space,  and  code  space.  Transformations  can  then  be  defined  to  take  information  from 
the  requirements  space  and  generate  information  in  the  design  space,  and  similarly  from  the  design 
space  to  the  code  space. 


HFSP  :  Hierarchical  and  Functional  Software  Processes 

Katayama  introduced  a  formalism  for  hierarchical  and  functional  software  process  descriptions 
called  HFSP  ([KatayamaX9].[I\atayamaNl]).  I1FSP  defines  software  processes  through  hierarchical 
activity  decomposition.  The  basic  principle  of  HFSP  is  to  focus  on  the  product-base  character¬ 
ization  of  activities  and  their  hierarchical  decomposition.  HFSP  is  founded  on  two  fundamental 
concepts,  (  1)  design  activity  ami  (2)  activity  decomposition.  Its  formalism  is  derived  from  attribute 
grammars.  HFSP  can  be  characterized  as  a  specification  style  language.  Process  models  expressed 
in  HFSP  ran  be  enacted. 
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In  HFSP  an  activity  is  the  unit  of  task  in  a  software  process.  An  activity  is  assumed  to  be 
completely  characterized  by  its  input  and  output  attributes.  An  activity  may  be  simple  or  a  complex 
task  that  must  be  decomposed  into  subtasks.  If  an  activity  is  simple  enough  to  be  performed  by 
invoking  tools,  its  execution  is  left  to  the  human  activity  of  doing  the  job  by  using  the  tools.  If  it 
is  not  the  activity  must  be  decomposed  into  sub-activities.  Activity  decomposition  must  specify 
how  an  activity  is  decomposed  into  other  activities  and  what  relationships  hold  among  attributes 
of  the  activities  involved. 

Concurrency  is  essential  for  describing  design  processes  for  large  software  products,  because 
software  design  is  usually  performed  by  a  group  of  designers  working  together.  In  HFSP,  concur¬ 
rency  is  expressed  through  attribute  dependency.  Nondeterminacy  is  useful  in  describing  design 
alternatives.  HFSP  expresses  nondeterminacy  through  decomposition  conditions. 


SPM  :  Software  Process  Model 

Williams  presented  an  approach  to  software  process  modeling  based  on  behavioral  descriptions  of 
software  activities  [Williams88|.  Behavioral  descriptions  make  it  possible  to  describe  a  process  at 
any  desired  level  of  abstraction.  The  use  of  abstraction  assists  in  handling  aspects  of  a  process  that 
are  poorly  understood.  SPM  style  specifications  are  used  for  descriptive  process  modeling. 

The  Behavioral  Model  describes  a  software  process  as  a  collection  of  activities  or  processes  that 
can  take  place  concurrently  and  asynchronously.  The  model  does  not  assume  that  information  is 
"shared"  among  activities  or  the  individuals  performing  them.  A  process  is  described  in  terms  of 
abstractions.  The  approach  is  behavioral  because  descriptions  of  activities  focus  on  the  effects  that 
the  activities  produce  rather  than  the  specific  procedures  used  to  produce  those  effects. 

A  process  model  developed  using  this  approach  is  described  by  a  set  of  activities.  Each  activity 
has  a  set  of  preconditions,  an  action,  a  set  of  postconditions,  and  a  set  of  messages.  Activities  may 
be  performed  bv  different  individuals  and  at  different  locations.  Activities  may  be  built-up  from 
simple  activities.  Activities  can  be  ordered  sequentially  or  in  parallel. 

All  preconditions  must  be  satisfied  before  an  activity  can  be  started.  At  least  one  postcondition 
will  be  true  when  the  activity  ends.  Preconditions  and  postconditions  are  not  used  to  prove  par¬ 
ticular  properties  of  the  software  process.  Instead,  they  represent  assertions  about  the  properties 
of  various  activities  and  the  interconnections  among  those  activities. 

An  action  is  a  software  task  that  may  be  performed  by  invoking  a  particular  tool  or  sequence  of 
tools  or  may  be  carried  out  by  a  human  without  automated  assistance.  An  action  may  have  several 
preconditions,  all  of  which  must  be  satisfied  before  that  action  can  be  performed.  An  action  may 
have  several  postconditions,  only  one  of  which  will  be  satisfied  when  the  task  is  completed. 

Messages  are  used  for  communication  and  synchronization  between  activities.  They  also  provide 
a  visibility  mechanism  for  project  management  in  that  messages  can  lie  associated  with  project 
milestones. 

Specification  of  the  particular  activities  to  lie  included  and  the  order  in  which  they  are  to  be 
performed  defines  a  particular  software  process.  The  set  of  possible  activities  forms  an  ’’alphabet” 
for  software  process  description  and  describes  a  class  of  software  creation  and  evolution  activities.  A 
particular  process  is  defined  using  a  language  (the  process  description  language)  over  this  alphabet. 
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A  given  software  project  then  corresponds  to  an  individual  string  in  this  language. 

A  software  process  description  may  be  written  using  the  constrained  expression  formalism.  A 
constrained  expression  describing  the  possible  behavior  of  a  software  process  consists  of  a  process 
expression  and  a  set  of  constraints.  The  process  expression  typically  describes  activities  of  parts 
of  a  process  without  regard  to  their  potential  interactions  with  other  parts.  The  constraint  set  im¬ 
poses  restrictions  on  how  events  from  various  parts  of  the  process  expression  may  be  combined  into 
possible  behaviors.  The  use  of  the  constrained  expression  formalism  provides  several  advantages. 
First,  this  notation  allows  a  concise  description  of  a  software  process,  including  concurrent  activ¬ 
ities.  Secondly,  the  constrained  expression  can  be  analyzed  to  determine  whether  a  given  process 
description  produces  a  correct,  or  acceptable  sequence  of  activities. 

This  approach  provides  the  capability  to  explicitly  model  human  actions  as  well  as  tool  invo¬ 
cations.  Adding  messages  makes  communication  among  activities  explicit.  The  formal  approach 
used  in  SPM  makes  it  possible  to  analyze  software  process  descriptions  using  techniques  developed 
for  event-based  descriptions  of  software  systems. 


CML  :  Conceptual  Modeling  Language 

The  TRIAD  project  uses  the  CML  language  (Conceptual  Modeling  Language)  to  specify  process 
programs  ([Ramanathan88],  [Ashok89]).  A  project  model  for  a  TRIAD  environment  is  composed 
of  four  related  models:  a  process  model,  data  model,  tool  model,  and  user  model.  CML  has 
features  for  process,  data,  tool,  and  user  modeling.  The  representational  requirements  of  these  four 
models  are  different  from  each  other.  However,  there  is  a  substantial  amount  of  interaction  between 
the  models.  Therefore,  all  the  modeling  primitives  are  unified  under  one  language  formalism.  This 
research  project  is  developing  a  compiler  that  translates  project  specific  descriptions  written  in  CML 
into  intermediate  representations  t  hat  ran  lie  interpreted  by  generic  interpreters  that  constitute  the 
runtime  system  of  an  environment. 

The  process  modeling  formalism  in  CML  is  intended  to  represent  those  well-defined  aspects 
of  the  software  process.  The  process  modeling  formalism  provides  primitives  for  representing  the 
following  features  of  the  software  process: 

1.  The  representation  of  each  project  related  activity  as  a  progression  from  states  to  states. 

2.  The  representation  of  activities  at  various  levels  of  abstraction. 

4.  T  he  concept  of  ownership  (responsibility)  for  an  activity. 

4.  Communication  between  agents  performing  activities. 

A  project  specific  process  model  contains  definitions  of  activities  and  the  relationships  between 
them.  Each  activity  has  a  precondition,  a  postcondition,  zero  or  more  attributes  (which  include 
references  to  database  objects  and  tools  that  are  used  during  that  activity),  zero  or  more  subac¬ 
tivities,  several  local  state  variables,  and  an  action  part  containing  imperative  code.  Activities  are 
organized  into  a  hierarchy  by  subactivity  relationships,  representing  an  elaboration  of  activities  to 
the  desired  level  of  detail. 
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An  activity  may  begin  if  its  precondition  is  true.  The  action  part  of  an  activity  can  contain 
statements  to  manipulate  the  database,  interact  with  the  user,  invoke  tools,  or  manipulate  the 
process  model.  Some  examples  of  statements  that  manipulate  the  process  model  are  create^task, 
delete_task,  wait_user,  send_user,  start_subactivity,  and  wait_subactivity. 

A  formalism  for  data  modeling  is  built  into  CML.  This  formalism  is  based  on  an  object-oriented 
semantic  data  model  and  has  been  developed  for  modeling  software  objects.  Besides  providing  prim¬ 
itives  for  modeling  the  structure  of  software  objects  and  interobject  relationships,  a  rule  formalism 
is  also  provided  to  express  domain-specific  semantics  of  the  objects  and  relationships. 

Appl/A  :  A  Process  Programming  Language 

The  Arcadia  research  project  is  exploring  several  issues  related  to  the  development  of  process- 
oriented  environments  [Taylor88].  One  research  issue  is  the  development  of  a  suitable  language 
for  process  programming.  The  language  emerging  from  this  research  is  Appl/A,  a  superset  of  Ada 
that  enables  the  definition  of  relations  among  software  objects.  Ada  was  chosen  as  a  base  language 
because  it  seemed  to  support  many  of  the  capabilities  envisioned  as  necessary  in  process  program¬ 
ming.  Ada  was  enhanced  with  a  relation  capability  to  provide  a  means  to  explicitly  represent 
interconnections  among  complex  ob  jects. 

Appl/A  supports  the  definition  of  relations  as  sets  of  arbitrary  tuples  of  software  objects.  It 
enables  users  to  specify  just  how  the  various  components  of  these  tuples  should  be  related  to  each 
other  and  how  the  consistency  of  these  components  can  be  verified  and  maintained. 

To  gain  experience  with  Appl/A  it  has  been  used  to  write  prototypes  of  several  process  programs. 
This  experience  has  lead  to  the  identification  of  several  other  necessary  features.  One  such  feature 
is  the  need  for  a  type  hierarchy.  As  an  example,  the  different  nodes  of  a  requirement  element  may¬ 
be  of  different  types,  but  they  also  share  some  common  features  and  they  can  be  better  modeled  as 
being  subtypes  of  a  common  parent  type.  Experience  with  Appl/A  has  lead  to  the  impression  that 
a  strictly  imperative,  algorithmic  language  is  not  likely  to  be  suitable  as  a  process  programming 
language.  Some  software  activities  seem  best  described  with  a  declarative  or  rule  based  paradigm. 


2.4  Selecting  a  Language 

The  current  research  into  the  software  re-engineering  process  requires  a  suitable  notation  for  record¬ 
ing  and  expressing  the  knowledge  learned  about  software  re-engineering.  The  current  research  fo¬ 
cuses  on  two  issues.  The  first  issue  is  to  identify  the  information  that  is  fed  into  the  re-engineering 
process  and  the  information  that  is  created  and  used  within  the  process.  The  goal  is  to  identify 
these  information  objects,  determine  their  properties,  and  map  their  interrelationships.  Further, 
the  operations  that  are  performed  on  these  information  objects  are  to  be  identified  and  described. 
The  second  issue  is  to  identify  the  tasks  that  are  done  within  the  re-engineering  process.  Each  task 
is  associated  with  a  collection  of  information.  Tasks  operate  on  information,  and  tasks  interact 
with  other  tasks.  All  of  this  must  be  expressible  within  the  process  modeling  language  chosen. 

To  select  a  language  for  process  modeling  the  features  of  the  languages  presented  in  the  previous 
section  were  considered.  For  each  Inn gun""  a  small  process  model  was  developed  to  obtain  a  feel 
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for  the  expressness,  clarity  of  expression,  and  adequateness  of  the  language.  The  current  research 
needs  were  compared  with  the  capabilities  of  the  various  languages. 

The  major  criterion  used  to  evaluate  the  languages  was  the  expressive  power  of  the  language. 
A  process  model  is  rich  in  detail  and  a  language  must  provide  suitable  constructs  for  expressing 
process  information.  Two  major  concerns  are  the  ability  to  express  data  information  and  task 
information. 

A  modeling  language  must  be  able  to  express  the  complex  properties  of  the  information  (data) 
that  is  created,  used,  and  manipulated  within  a  process.  A  language  must  express  the  elements  that 
form  an  information  object  and  the  type  of  information  associated  with  each  object.  Properties  of 
these  elements  and  their  relationships  with  other  others  must  be  expressible.  It  was  found  that  the 
range  of  expressiveness  of  data  information  varied.  Some  languages  focused  on  representing  data 
information  while  others  focused  more  on  representing  task  information.  A  few  languages,  such  as 
CML,  provided  good  support  for  both. 

A  modeling  language  must  be  able  to  express  the  details  of  each  task  within  a  process  and  the 
relationships  and  dependencies  between  tasks.  A  process  consists  of  many  tasks.  Further  there 
is  often  a  hierarchy  among  tasks,  with  some  tasks  representing  major  process  activities  and  other 
tasks  representing  minor  activities.  Tasks  have  many  details  associated  with  them,  such  as  the 
conditions  under  which  a  task  can  begin,  who  is  responsible  for  the  task,  who  is  assigned  to  the 
task,  the  steps  or  procedures  followed  by  the  task,  etc.  Tasks  may  depend  on  human  effort  and/or 
be  supported  by  tools.  The  modeling  language  must  be  able  to  express  this  information.  All  of  the 
languages  provide  constructs  for  expressing  task  information.  The  range  of  expressiveness  ranged 
from  high-level  abstractions  of  task  sequencing  to  detailed  representations  of  task  processing  steps. 

The  modeling  language  must  support  different  levels  of  abstraction.  Because  the  current  re¬ 
search  is  examining  the  properties  of  the  process  as  opposed  to  detailed  processing  steps,  a  high 
level  of  abstraction  is  required.  However,  the  ability  to  lower  the  abstraction  level  in  the  future 
and  stay  within  the  same  language  would  be  convenient.  After  all,  an  important  result  of  experi¬ 
ence  with  process  modeling  is  that  a  design  of  a  process  can  reveal  more  about  the  process  than  a 
detailed  process  program.  Most  languages  only  provided  one  abstraction  level.  This  level  ranged 
from  high-level  descriptive  modeling  to  detailed  process  programming. 

f  inally,  the  language  chosen  must  be  available  for  use.  Almost  all  process  modeling  languages 
are  experimental.  While  descriptions  of  these  languages  can  be  found,  implementations  or  running 
systems  are  rare.  Even  complete  language  descriptions  can  be  hard  to  obtain. 

(liven  these  considerations  and  after  reviewing  the  example  process  models  created  with  each 
language  the  language  CML  (Conceptual  Modeling  Language)  developed  for  the  TRIAD  project 
was  deemed  the  most  suitable.  This  language  has  the  expressive  power  needed  and  operates  within 
a  process  modeling  environment.  Further.  CML,  which  was  originally  developed  and  reported  in 
several  PhD  theses,  has  been  converted  into  a  commercial  product  available  from  Universal  Energy 
Systems. 

However,  while  the  authors  liked  the  expressiveness  of  CML  the  best,  it  was  decided  that  this 
language  would  not  be  used.  First,  the  commercial  system  based  on  this  language  was  not  available 
to  t  he  authors.  If  CML  were  used  to  record  the  process  model,  t  here  would  be  no  tools  for  processing 
the  model.  Second,  since  CML  is  not  widely  known,  its  usefulness  in  conveying  tl'**  process  details 
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to  others  would  be  limited.  Third,  CML  is  a  process  programming  language,  and  as  such  its  level  of 
abstraction  is  low  since  process  executable  information  must  be  given.  At  best  the  current  research 
would  only  result  in  a  skeleton  of  a  CML  process  model. 

Since  the  current  research  is  focusing  on  the  information  needs  and  major  tasks  of  the  re¬ 
engineering  process  it  was  decided  to  use  a  formal  specification  language.  A  formal  specification 
language  provides  the  expressiveness  to  convey  what  a  process  does  and  properties  of  information 
objects  and  operations  on  these  objects,  without  forcing  a  description  of  how  the  operations  and 
tasks  are  done.  A  formal  language  permits  information  to  be  expressed  concisely  and  with  precision. 
Further,  reasoning  about  process  properties  is  facilitated  by  a  formal  language.  Finally,  there  are 
several  well-known  formal  specification  languages,  thus  a  wider  audience  will  be  able  to  read  and 
use  the  process  model  specification. 

The  language  chosen  to  express  the  process  model  is  the  specification  language  Z.  Z  is  a  formal 
specification  language  based  on  typed  set  theory  [Spivey89].  Z  gains  its  simplicity  and  expressive 
power  bv  using  directly  the  well  developed  notations  of  mathematics.  Originally  developed  for 
formal  specification  of  software,  the  authors  have  found  its  expressive  power  suitable  for  specifying 
properties  of  the  re-engineering  process. 


2.5  The  Z  Specification  Language 

Since  Z  is  based  on  both  set  theory  and  predicate  calculus  it  has  the  dual  advantage  of  using  a 
mature  mathematical  notation  and  of  being  widely  accessible  to  those  with  some  mathematical 
background.  Z  uses  mathematical  notation  to  describe  in  a  precise  way  the  properties  which  a 
software  system  must  have,  without  describing  how  these  properties  are  achieved.  A  Z  specification 
is  mathematical:  the  variables  that  appear  in  a  specification  range  over  mathematical  objects.  A 
Z  specification  expresses  a  mathematical  model.  This  characteristic  of  Z  specifications  has  proved 
useful  in  capturing  and  expressing  the  properties  of  the  software  re-engineering  process. 

Every  mathematical  expression  which  appears  in  a  Z  specification  is  given  a  type.  This  type 
determines  a  set  known  to  contain  the  \tuue  of  the  expression.  Each  variable  is  given  a  type  by  its 
declaration.  This  use  of  mathematical  data  types  to  model  the  data  in  a  system  makes  it  possible 
to  reason  effectively  about  how  a  specified  system  will  behave.  Z  uses  predicate  logic  to  describe 
abstractly  the  effect  of  operations  on  data. 

Another  main  element  of  a  Z  specification  is  the  use  of  schemas.  A  schema  is  used  to  specify  a 
collection  of  variables,  each  with  a  type,  and  predicates  on  these  variables  that  specify  properties 
of  the  variables  and  relates  the  variables.  A  typical  schema  has  the  form: 

SchemaName _ _ 

1  anable  declarations 

Predicates 


['his  report  assumes  basic  familiarity  with  Z  and  set  theory.  Two  good  references  on  Z  are 
The  Z  Notation:  A  Reference  Manual"  by  Spivey  [SpivevSt)]  and  "Specification  Case  Studies"  by 
Mayes  [Hayes87].  A  summary  of  Z  notation  used  in  this  report  is  given  in  Appendix  A. 
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Chapter  3 


Software  Re-engineering  Process 


The  realization  that  software  development  projects  follow  a  definable  sequence  of  phases  was  an 
important  discovery  within  software  engineering.  The  resulting  Waterfall  model  was  much  talked 
about  and  many  organizations  adapted  it  for  their  use.  As  time  went  by,  it  was  discovered  that  the 
Waterfall  model  did  not  fully  model  the  interactions  between  software  development  phases.  Indeed, 
it  was  even  realized  that  alternative  models  of  software  development  existed. 

hike  the  Waterfall  model  for  software  development  there  is  a  traditional  model  for  software 
re  engineering.  The  re-engineering  model  is  based  on  three  phases.  It  is  a  high-level,  abstract 
model,  devoid  of  most  information  about  the  process.  Like  the  Waterfall  model  the  traditional 
software  re-engineering  model  focuses  on  the  technical  aspect  of  the  process  and  fails  to  mention 
other  aspects  such  as  management  or  support. 

One  contribution  of  this  research  is  an  emphasis  on  studying  the  complete  process  of  software  re¬ 
engineering.  A  successful  software  re-engineering  methodology  or  a  useful  environment  for  software 
re-engineering  must  be  based  on  an  understanding  of  the  complete  process,  not  just  a  portion  of 
it.  I  he  emphasis  on  technical  solutions  evident  in  many  research  papers  shows  a  lack  of  attention 
to  the  full  range  of  concerns  that  occur  during  a  software  re-engineering  project. 

In  the  next  section  the  traditional  software  re-engineering  process  model  is  presented.  In  section 
■t.'J  a  more  complete  model  of  the  software  re-engineering  process  is  presented.  Both  process  models 
are  presented  at  a  high  level  of  abstraction.  Chapter  4  gives  a  more  detailed  specification  of  one 
phase  identified  in  the  new  process  model. 


3.1  Traditional  Re-engineering  Model 

The  goal  of  software  re-engineering  research  is  to  develop  the  means  to  take  an  existing  software 
system  and  generate  from  it  a  new  system,  that  benefits  from  modern  software  engineering  practices, 
and  is  functionally  equivalent  to  the  original  system.  In  truth,  a  new  system  produced  by  re- 
engineering  may  not  be  functionally  equivalent  to  the  original  system.  Changes  in  the  requirements 
for  a  system  are  often  incorporated  into  the  new  system,  introducing  a  difference  between  the 
functionality  of  the  old  and  new  systems.  ,\  fundamental  assumption  underlying  the  concept  of 
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The  re-engineering  of  a  software  system  produces  a  new  form  of  the  system  that  is 
better,  in  some  way,  than  the  original  form. 

In  general,  a  software  system  is  re-engineered  because  one  or  more  of  its  properties  are  con¬ 
sidered  substandard.  There  are  many  properties  associated  with  a  software  system  such  as  the 
understandability  of  the  source  code,  ease  of  maintenance,  reliability,  cost  of  maintenance,  user  sat¬ 
isfaction,  ease  of  use,  quality  of  documentation,  size,  etc.  Empirical  results  support  the  assumption 
that  re-engineering  a  system  can  indeed  result  in  improvements  [Sneed90j. 

Modern  software  engineering  practices  are  based  on  the  principle  of  refinement.  Software  is 
produced  by  a  series  of  refinements  where  each  refinement  introduces  a  new  detail  into  the  system 
description.  The  choice  of  a  particular  refinement  is  guided  by  the  development  method  in  use. 
First,  a  specification  is  created.  A  design  is  created  through  a  series  of  refinements  on  the  specifi¬ 
cation.  (Note  this  is  not  true  of  older  development  methodologies  such  as  Yourdon-Constantine.) 
An  implementation  of  the  system  is  then  created  by  another  series  of  refinements  on  the  design. 
The  result  is  a  software  system  that  benefits  from  modern  software  development  practices.  In  soft¬ 
ware  engineering  the  principle  of  refinement  is  fundamental  to  software  development,  also  called 
forward  engineering. 

A  serious  problem  immediately  arises  when  the  application  of  modern  software  development 
practices  to  existing  systems  is  considered.  The  problen.  is  simply  that  the  existing  system  has 
already  been  developed.  A  particular  development  technique  expects  to  start  with  a  specification 
or  design  for  the  existing  system,  instead  of  the  source  code.  Unfortunately,  most  existing  soft¬ 
ware  systems  are  either  undocumented  or  existing  documentation  is  badly  outdated.  A  second 
fundamental  assumption  underlying  software  re-engineering  is  that: 

1'he  only  valid,  accurate,  and  up-to-date  description  of  the  functionality  of  an  exist¬ 
ing  software  system  is  the  source  code  for  that  system.  Therefore,  any  software  re¬ 
engineering  project  must  begin  with  the  system  source  code. 

The  problem  then  is  how  can  modern  software  development  practices  be  applied? 

The  solution  to  this  problem  is  the  use  of  abstraction.  Abstraction  is  applied  to  a  software 
system  by  ignoring  certain  details  of  the  system.  This  simplifies  a  task  by  ignoring  details  that  are 
irrelevant  to  the  current  problem  or  task.  Abstraction  is  the  reverse  of  refinement.  Where  refinement 
adds  successive  layers  of  details,  abstraction  strips  away  details.  The  principle  of  abstraction  is 
fundamental  to  reverse  engineering.  Chikofsky  and  Cross  defined  software  reverse  engineering 
as  : 


I'he  process  of  analyzing  a  software  system  to  identify  the  system's  components  and 
t heir  interrelationships  and  to  create  representations  of  the  system  in  another  form  or 
at  a  higher  level  of  abstraction  [( ’hikofskyOO]. 

Figure  3.1  shows  the  traditional  diagram  hat  contrasts  forward  and  reverse  engineering.  For¬ 
ward  engineering  begins  with  requirement  analysis  and  specification,  proceeds  into  design,  and 
design  is  followed  by  implementation.  Reverse  engineering  starts  with  an  implementation  and 
works  backward. 
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Forward 

Engineering 


Reverse 

Engineering 


Figure  3.1:  Forward  and  Reverse  Engineering 


The  principles  of  abstraction  and  refinement  embodied  in  reverse  engineering  and  forward  en¬ 
gineering  respectively,  form  two  pillars  of  software  re-engineering.  What  is  missing  is  the  bridge 
between  these  pillars.  This  bridge  is  formed  by  the  principle  of  alteration.  Alteration  is  a  change 
in  the  characteristics  of  a  system.  Alteration  is  done  on  information  at  some  level  of  abstraction  and 
includes  the  addition,  deletion,  and  modification  of  information  at  that  abstraction  level.  Within 
software  re-engineering  the  principle  of  alteration  is  typically  embodied  by  re-design.  Re-design 
is  t  he  art  of  altering  the  design  of  the  existing  system  to  produce  a  design  for  the  new  system. 

The  traditional  software  re-engineering  model  can  now  be  described.  Figure  3.2  presents  a 
diagram  of  this  model.  The  traditional  software  re-engineering  model  begins  with  the  source  code 
for  the  existing  system.  First,  the  source  code  is  analyzed.  Starting  with  the  analysis  information 
a  design  for  the  existing  system  is  constructed.  This  is  reverse  engineering.  Implementation  level 
details  of  the  system  are  discarded  and  only  information  about  the  structure  and  functionality  of 
the  system  are  retained.  The  reverse  engineering  phase  ends  with  a  reconstructed  design  for  the 
existing  system. 


Design 


Implementation 


Old  Implementation  Target  Implementation 

f  igure  3.2:  Traditional  Software  Re-engineering  Model 


I  Ih*  second  phase,  re-design,  takes  and  alters  the  reconstructed  design.  Fiidesirable  character¬ 
istic-!  of  the  design,  such  as  poor  structure,  are  removed.  The  design  may  be  changed  to  reflect 
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an  object-oriented  approach.  New  functionality  may  be  woven  into  the  design  and  existing  system 
functionality  may  change.  The  result  of  the  re-design  phase  is  an  altered  design  that  reflects  modern 
software  design  principles. 

The  third  phase,  re-implementation,  takes  the  altered  design  and  uses  forward  engineering 
techniques  to  create  a  new  implementation  of  the  system.  This  new  system,  referred  to  as  the 
target  system,  reflects  the  benefits  of  using  modern  design  and  coding  practices  to  implement 
software.  The  product  of  the  re-implementation  phase  is  the  desired  new  system. 

The  traditional  software  re-engineering  model  shows  three  major  phases.  The  sequencing  of 
these  phases  is  clearly  shown.  The  main  input  and  end-product  of  each  phase  is  also  given.  However, 
there  are  many  other  inputs  into  the  process  and  other  outputs  as  well,  that  are  not  mentioned 
in  the  above  description.  This  model  is  presented  at  a  high  level  of  abstraction.  Lowering  the 
abstraction  level  and  the  resultant  increase  in  process  detail  would  provide  the  answers  to  many 
questions  about  the  software  re-engineering  process. 

The  problem  with  this  model  is  that  even  if  each  phase  was  carefully  expanded  to  the  lowest 
level  of  process  detail,  there  would  remain  unanswered  questions  about  the  re-engineering  process. 
There  would  still  be  unexplained  tasks  that  are  part  of  any  re-engineering  project.  The  reason  is 
that  there  are  tasks  within  a  re-engineering  project  that  occur  outside  the  domain  or  area  of  concern 
of  the  three  specified  phases.  The  traditional  software  re-engineering  model  focuses  on  the  technical 
aspect  of  re-engineering  and  ignores  the  issues  of  managing  and  supporting  a  re-engineering  project. 

The  key  point  to  emphasize  is  that  the  traditional  model  of  software  re-engineering,  which  is 
shown  in  numerous  articles  on  the  subject,  is  incomplete.  The  traditional  model  ignores  several 
aspects  of  the  software  re-engineering  process.  A  useful  process  model  must  cover  the  full  range 
of  tasks  that  can  occur  during  a  re-engineering  project.  The  failure  of  the  model  to  cover  the  full 
range  of  tasks  limits  its  usefulness  for  presenting,  discussing,  reasoning  about,  and  improving  the 
software  re-engineering  process. 


3.2  New  Re-engineering  Model 

A  new  model  of  the  software  re-engineering  process  is  needed  that  includes  the  full  range  of  software 
re-engineering  tasks.  The  traditional  software  re-engineering  model  focuses  on  the  technical  aspects 
of  re-engineering  only.  However,  it  must  be  realized  that  there  is  more  to  the  re-engineering  process 
than  just  the  technical  work. 

3.21  Re-engineering  Process  Aspects 

There  are  three  aspects  or  areas  of  concern  within  the  software  re-engineering  process.  These 
aspects  are: 


Management  :  A  software  re-engineering  project  must  be  managed  just  like  any  other  software 
project.  Management  tasks  focus  on  project  control.  A  re-engineering  project  requires  plan¬ 
ning.  A  project  requires  monitoring.  A  project  requires  direction  and  motivation  to  keep  it 
on  track  and  on  schedule. 
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Technical  :  The  heart  of  a  re-engineering  project  is  the  technical  work.  It  is  the  technical  tasks 
that  reconstruct  the  system  design,  change  it.  and  re  implement  it  creating  the  target  system. 
In  addition,  the  target  system  must  be  tested.  The  documentation  for  the  system  must  also 
be  updated  or  replaced. 

Support  :  The  managerial  and  technical  aspects  of  software  re-engineering  require  support  if 
they  are  to  proceed  smoothly.  One  useful  support  responsibility  is  configuration  management, 
keeping  track  of  all  the  information  and  objects  used  within  a  project.  Quality  assurance  is 
a  support  task  that  monitors  the  output  of  tasks  and  ensures  conformance  with  project  stan¬ 
dards.  Monitoring  of  an  on-going  project  and  the  collection  of  historical  project  information 
provide  information  to  on-going  managerial  tasks. 

These  three  aspects  of  the  software  re-engineering  process  are  not  phases.  Management  of  a 
project  does  not  stop  and  technical  work  begin  at  some  point.  Within  each  aspect  is  a  sequence 
of  tasks  that  occur  through  out  a  re-engineering  project.  These  three  aspects  or  areas  of  concern 
and  the  tasks  that  embody  these  concerns  blend  together  to  form  a  complete  model  of  the  software 
re-engineering  process.  Figure  3.3  represents  those  three  aspects  of  the  software  re-engineering 
process. 


Now  that  the  aspects  that  must  be  incorporated  into  the  process  model  are  known,  what  does 
the  software  re-engineering  process  model  look  like?  This  question  is  addressed  by  examining  each 
aspect  in  more  detail. 
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Project  Management 


A  software  re-engineering  project  requires  careful  management.  One  important  task  of  management 
is  planning.  A  re-engineering  project  can  be  planned  in  a  top-down  manner.  A  general  plan  of  the 
entire  project  is  created  first  and  then  pieces  of  the  plan  progress  to  more  specific  levels  of  detail. 
Feedback  is  important  and  the  on-going  project  must  be  carefully  monitored.  Project  information 
must  be  analyzed  and  changes  to  the  project  plan  made  accordingly.  The  use  of  project  tracking 
information  is  referred  to  as  bottom-up  input  to  the  planning  process. 

Project  management  must  deal  with  several  sources  of  difficulties.  These  difficulties  include 
people,  technology,  physical  constraints,  costs,  organizational  objectives,  and  external  conflicts. 
Management  must  coordinate  the  many  different  tasks  that  occur  during  a  project.  The  interfaces, 
dependencies  and  concurrences  between  the  tasks  and  between  the  software  being  re-engineered 
must  be  taken  into  account.  Finally,  management  must  provide  supervision  over  the  many  and 
varied  types  of  individuals  involved  with  a  re-engineering  effort. 

Re-engineering  tasks  that  represent  management  duties  and  concerns  include: 

Define  Approach  The  project  approach  forms  the  project  plan.  The  approach  states  how  the 
existing  software  system  will  be  re-engineered.  The  approach  defines  the  methodology  used 
in  the  project,  the  project  phases,  and  the  tasks  within  each  phase.  The  approach  determines 
if  the  system  will  be  re-engineered  in  one  lump  sum,  in  a  sequence  of  pieces,  or  only  partially. 

Estimation  Project  management  must  lie  able  to  estimate  the  effort,  cost,  duration,  and  staffing 
requirements  for  a  re-engineering  project.  Estimates  of  cost  are  required  if  sufficient  funds 
are  to  budgeted.  Estimates  of  project  duration  are  important  for  coordinating  with  the 
system  users,  and  with  other  organizational  projects.  Estimates  of  staffing  requirements  are 
important  for  scheduling  personnel  that  will  be  assigned  to  the  project. 

Define  Organizational  Structure  Work  within  a  re-engineering  project  is  performed  by  teams. 
The  organization  of  project  teams  can  be  guided  by  the  project  approach  and  the  organiza¬ 
tion’s  (i.e.  company's)  philosophy.  There  can  be  a  team  for  each  project  phase,  or  several 
teams  can  work  in  parallel  doing  similar  tasks  on  different  portions  of  the  system. 

Define  Project  Procedures  and  Standards  Project  management  must  determine  the  proce¬ 
dures  that  will  be  followed  during  t  he  project.  Procedures  can  specify  how  to  conduct  reviews, 
how  to  report  errors,  how  to  check  out  and  in  items  from  the  configuration  management  sys¬ 
tem,  etc.  Project  standards  specify  the  acceptable  format  of  products  produced  during  the 
project. 

Identify  Resources  Project  management  must  determine  resources  need  by  the  re-engineering 
project.  Once  the  necessary  resources  are  identified,  they  must  be  located,  and  arrangements 
made  to  have  these  resources  available  to  the  project.  The  project  schedule  can  be  influenced 
by  the  availability  of  necessary  resources. 

Plan  System  Transition  Project  management  must  plan  how  the  target  system  will  be  phased 
into  usage  and  the  existing  system  phased  out.  Bringing  the  target  system  on-line  signals  the 
end  of  the  re-engineering  project,  but  it  must  be  planned  and  coordinated  in  advance. 
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Scheduling  A  re-engineering  project  consists  of  many  tasks.  Each  task  must  be  scheduled.  Tasks 
are  given  an  expected  starting  date  and  stopping  date.  The  project  schedule  represents  a 
relationship  between  tasks.  Some  tasks  can  not  begin  until  others  are  finished.  Some  tasks 
can  be  done  concurrently.  Some  tasks  will  be  repeated  with  different  inputs. 

Identify  Tools  There  are  many  tools  that  can  be  used  during  a  re-engineering  project.  Useful 
tools  must  be  identified  and  their  role  within  a  project  evaluated.  Tools  may  need  to  be 
purchased,  others  may  already  exist  in-house.  Tools  can  reduce  the  project  duration  and 
improve  system  reliability,  promote  consistency,  and  free  personnel  from  clerical  work.  Tools 
can  be  used  to  support  all  aspects  of  software  re-engineering. 

Define  Acceptance  Criteria  Project  management  must  negotiate  with  the  client  or  system  users 
and  reach  an  agreement  that  states  the  criteria  that  must  be  satisfied  for  the  target  system 
to  be  accepted. 

Conflict  Resolution  The  on-going  re-engineering  project  must  be  managed  and  any  problems  or 
conflicts  that  arise  must  be  resolved  bv  project  management.  Conflicts  can  be  of  a  technical  or 
non-technical  nature.  Technical  conflicts  are  best  resolved  by  technical  staff,  but  occasionally 
management  must  step  in  and  make  a  choice  that  solves  a  technical  problem. 

Project  Authorization  As  the  project  moves  from  phase  to  phase  and  from  major  task  to  major 
task,  project  management  must  often  give  go-ahead  permission  for  a  task  or  phase  to  begin. 
This  provides  management  with  an  opportunity  to  verify  that  a  milestone  has  been  reached 
and  that  the  work  done  to  date  is  satisfactory. 

Personnel  Management  Project  management  is  responsible  for  managing  project  personnel  and 
maintaining  project  morale.  People  assigned  to  a  project  may  quit  the  organization  or  ask  to 
leave  a  project.  A  person  may  not  do  his/her  job  well,  or  may  be  disruptive  to  their  team. 
Management  must  oversee  the  performance  of  project  personnel  and  ensure  that  each  person 
is  working  smoothly  and  satisfactorily  towards  the  project  completion. 

The  management  aspect  of  the  re-engineering  process  seeks  to  coordinate  and  control  the  many 

details  of  a  re-engineering  project.  Project  management  does  not  concern  itself  with  the  details  of 

the  technical  work,  but  must  be  familiar  with  the  technical  steps  and  what  is  involve. 


Project  Technical  Work 


It  is  the  technical  aspect  of  the  process  that  does  the  re-engineering  work.  It  is  the  technical  tasks 
that  manipulate  the  system  source  code,  design,  tests,  and  documentation.  The  traditional  focus 
on  the  technical  aspect  of  re-engineering  is  understandable.  It  here  that  most  of  the  project  work  is 
done.  Certainly,  the  most  visible  products  of  a  re-engineering  project  are  created  by  the  technical 
tasks.  The  list  of  technical  tasks  includes  the  following: 

Determine  Motivations  and  Objectives  This  task  focuses  on  two  questions:  Why  is  the  sys¬ 
tem  to  be  re-engineered?  What  does  the  project  hope  to  achieve?  The  answer  to  the  first 

question  is  a  list  of  project  motivations.  Is  the  system  difficult  to  maintain?  What  is  hap- 
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Analyze  Environments  If  the  system  is  to  be  ported  to  a  new  environment  or  if  the  existing 
system  environment  is  to  be  upgraded  or  changed,  the  differences  between  the  existing  envi¬ 
ronment  and  the  target  environment  must  be  stated.  This  analysis  affects  both  the  technical 
and  managerial  aspects  of  the  project.  A  difference  between  environments  can  affect  the 
changes  made  to  the  system.  A  new  target  environment  may  require  staff  training. 

Collect  Inventory  The  inventory  task  identifies  and  locates  all  source  code  files,  system  docu¬ 
ments,  documentation  files,  test  data,  test  results,  maintenance  history  records,  etc.  This 
information  is  turned  over  to  the  project  librarian  and  becomes  a  support  task  problem, 
but  the  identification  of  all  these  objects  requires  technical  knowledge  of  the  system.  The 
collection  of  all  system  pieces  forms  a  baseline  on  which  the  reverse  engineering  task  can 
begin. 

Test  Planning  The  target  system  must  be  thoroughly  tested  and  this  requires  the  development 
of  a  set  of  test  inputs  and  outputs,  known  as  a  test  plan.  The  existing  system  test  plan  can 
be  evaluated  and  sections  of  it  reused  depending  on  the  changes  that  will  produce  the  target 
system.  New  tests  for  the  target  system  must  also  be  created. 

Target  System  Testing  This  is  the  application  of  the  test  plan  to  the  target  system  to  verify 
that  no  errors  can  be  detected.  This  is  the  same  as  testing  done  during  software  development. 

Documentation  Planning  The  existing  system  documentation  must  be  collected  and  evaluated. 
Non-existent  documents  must  be  written.  Badly  outdated  documents  must  either  be  replaced 
or  rewritten.  Other  documents  will  also  need  to  be  evaluated  and  either  replaced  or  rewritten 
to  reflect  the  characteristics  of  the  target  system. 

Create  Documentation  This  is  the  work  of  re-documenting  the  system.  The  documentation  plan 
guides  the  changes  made  to  existing  documents  and  directs  the  writing  of  new  documents. 
The  target  system  must  have  a  complete,  accurate,  and  up-to-date  set  of  documentation. 

Analyze  Implementation  To  prepare  for  t he  design  reconstruction  work  the  existing  system 
source  code  must  be  completely  analyzed  and  its  properties  identified.  The  analysis  records 
the  implementation  details  of  the  existing  system.  A  data  dictionary  of  source  code  items  is 
created. 

Reconstruct  Design  This  task  uses  the  source  code  analysis  information  and  reconstructs  a 
design  for  the  existing  system.  Implementation  details  are  removed  from  the  information  and  a 
design  representation  is  created.  The  design  records  the  system  architecture  and  functionality. 
The  reconstructed  design  may  be  expressed  at  several  levels  of  abstraction. 

Plan  Design  Changes  The  changes  to  be  made  to  the  reconstructed  design  are  identified  and 
formed  into  a' plan.  Information  about  the  existing  design  and  the  envisioned  target  system 
must  be  studied.  The  resulting  plan  describes  the  changes  to  be  made  to  the  design. 

Update  Design  The  current  system  design  is  altered  according  to  the  re-design  plan.  Each  change 
made  to  the  design  is  the  result  of  a  design  decision.  A  record  of  all  changes  must  be  kept. 
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Implement  Design  This  task  uses  forward  engineering  techniques,  takes  the  altered  system  de¬ 
sign,  and  creates  a  new  implementation  of  the  system.  Standard  forward  engineering  tech¬ 
niques  may  need  to  be  adapted  to  difficulties  posed  by  re-engineering.  For  example,  it  is 
possible  that  only  a  portion  of  a  system  is  re-engineered.  Here,  it  will  be  necessary  to  merge 
the  new  target  portion  of  the  system  wit  li  the  retained  and  unmodified  portions  of  the  system. 

Analyze  New  Source  Code  To  document  the  target  system  source  code,  the  new  code  must  be 
analyzed.  This  is  similar  to  the  analysis  done  on  the  existing  system  source  code. 

Acceptance  Testing  This  task  verifies  that  the  target  system  satisfies  the  acceptance  criteria. 
Also  the  updated  documentation  must  be  verified,  as  well  as  converted  file  and  database 
formats.  Typically,  the  target  system  and  existing  system  are  executed  in  parallel,  and  a 
comparison  of  results  is  done. 

System  Transition  This  is  the  last  task  in  the  re-engineering  process.  This  task  controls  how 
the  target  software  is  placed  into  production  and  how  the  existing  system  is  phased  out. 


Project  Support 

Project  support  tasks  are  miscellaneous  tasks  that  can  not  properly  be  considered  as  technical  or 
managerial.  These  are  tasks  that  handle  details  and  duties  that  would  interfere  with  the  duties  of 
other  groups.  Several  supporting  tasks  are: 

Configuration  Management  There  are  many  physical  entities  to  be  identified,  handled,  mon¬ 
itored,  and  controlled.  F.ach  project  has  it  own  associated  source  code,  listings,  test  data, 
test  cases,  test  scenarios,  test  results,  and  documentation.  Typically,  a  separate  team  is 
formed  just  to  handle  this  task.  At  the  head  of  this  task  is  the  project  librarian,  a  person 
charged  with  tracking  all  the  project  items  and  their  versions.  Configuration  management 
for  software  re-engineering  is  similar  to  that  for  software  development.  However,  here  a  re¬ 
engineering  project  immediately  starts  with  a  large  volume  of  information  about  the  existing 
system,  defined  by  the  baseline  inventory. 

Quality  Assurance  The  quality  assurance  task  runs  throughout  the  project.  Typically,  every 
product  produced  by  a  task  is  subject  to  a  QA  review.  These  reviews  ensure  that  a  product 
conforms  to  the  project  standards  and  is  of  satisfactory  quality.  The  QA  task  also  verifies 
that  other  project  standards  are  adhered  too. 

Project  Tracking  The  on-going  project  must  be  tracked  and  its  progress  monitored.  Informa¬ 
tion  about  the  project  includes  actual  start  and  stop  dates  for  tasks,  the  number  of  people 
assigned  to  a  task,  the  hours  worked  by  project  personnel.  Tracking  information  is  analyzed 
by  management  to  verify  that  the  project  is  on  schedule  and  to  detect  any  possible  prob¬ 
lems.  Tracking  information  is  also  recorded  into  a  historical  project  database  that  can  be 
used  to  review  project  estimates  against  actual  values.  This  comparison  can  help  fine-tune 
the  estimation  models  u-ed  within  an  organization. 
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3.2.2  Re-engineering  Project  Phases 

The  new  software  re-engineering  process  model  proposed  in  this  report  consists  of  seven  phases. 
These  phases  cover  not  only  the  central  focus  of  re-engineering,  i.e.  the  production  of  the  target 
system,  but  also  project  preparation,  and  the  testing,  documentation,  and  installation  of  the  target 
system.  Figure  3.4  presents  a  diagram  of  the  software  re-engineering  process  model.  The  software 
re-engineering  process  model  contains  these  phases: 

1.  Analysis  and  Planning 

2.  Design  Reconstruction 

3.  Re-design 

3.  Re-implementation 
Target  System  Testing 
(>.  Re-documentation 
7.  Acceptance  and  System  Transition 


Design 

Reconstruction 

Analysis  And 
Planning 


A 


Design 

Re-design 

Design 

Documentation 

Implementation 

Implementation 

Old  Implementation 

Tested 

Implementation 

Installed 

Implementation 

Re-implementation 

Testing 

Transition 

Documentation 


New  Implementation 


Figure  3.1:  Software  Re-engineering  Process  Model 


Pat  h  phase  of  the  process  consists  of  a  collection  of  tasks.  Tasks  may  be  performed  sequentially 
or  m  parallel  depending  on  the  nature  of  the  tasks  and  the  organization  of  the  project.  Several 
of  the  tasks  run  throughout  much  of  the  process  such  as  configuration  management  and  project 
tracking.  The  tasks  assigned  to  each  phase  are  listed  below. 
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Analysis  and  Planning 

•  Determine  Motivations  and  Objectives 

•  Analyze  Environments 

•  Collect  Inventory 

•  Define  Approach 

•  Plan  System  Replacement 

•  Define  Acceptance  Criteria 

•  Define  Project  Procedures  and  Standards 

•  Identify  Resources 

•  Identify  Tools 

•  Test  Planning 

•  Documentation  Planning 

•  Estimation 

•  Define  Organizational  Structure 

•  Scheduling 

Design  Reconstruction 

•  Analyze  Implementation 

•  Reconstruct  Design 

Re-design 

•  Plan  Design  Changes 

•  Update  Design 

Re-implementation 

•  Implement  Design 

Target  System  Testing 

•  Target  System  Testing 

Re-documentation 

•  Analyze  New  Source  Code 

•  Create  Documentation 
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Acceptance  and  System  Transition 


•  Acceptance  Testing 

•  System  Transition 

Not  included  in  these  lists  are  the  miscellaneous  tasks  that  either  stand  by  themselves  or  do 
not  fit  into  the  area  of  concern  embodied  by  a  particular  phase.  These  miscellaneous  tasks  are: 

Miscellaneous  Tasks 

•  Configuration  Management 

•  Process  Tracking 

•  Conflict  Resolution 

•  Project  Authorization 

•  Personnel  Management 

•  Quality  Assurance 

The  new  software  re-engineering  process  model  presented  in  this  chapter  is  a  more  complete 
model.  The  identification  of  several  additional  phases  and  the  inclusion  of  many  more  tasks  suggests 
the  complexity  of  a  re-engineering  project.  However,  the  model  presented  in  this  chapter  is  still  a 
high-level  abstraction  of  the  re-engineering  process.  Many  details  of  the  process  are  have  not  been 
shown.  In  the  next  chapter  the  Analysis  and  Planning  phase  is  covered.  This  coverage  is  at  a  lower 
level  of  abstraction  and  focuses  on  t lie  information  objects  used  and  produced  by  the  phase  tasks. 
The  tasks  that  occur  during  this  phase  are  also  discussed  in  more  detail. 
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Chapter  4 


Analysis  and  Planning  Phase 


The  analysis  and  planning  phase  is  the  first  phase  of  the  software  re-engineering  process.  If  planning 
and  organization  are  the  keys  to  a  successful  project  then  this  is  the  key  phase.  There  may  be  an 
urge  to  hurry  up  and  just  produce  a  cursory  project  plan  or  to  skip  this  phase  all  together  so  that 
the  "real”  project  work  can  begin.  That  is  a  mistake,  only  by  careful  planning  can  a  re-engineering 
project  be  conducted  smoothly  and  with  a  minimum  of  unexpected  problems. 

The  analysis  portion  of  this  phase  focuses  on  technical  aspects  of  a  project.  Analysis  addresses 
three  issues.  The  first  issue  is  to  define  the  current  state  of  the  existing  system.  What  are  its 
shortcomings?  How  serious  are  these  shortcomings?  Because  the  re-engineering  work  will  begin 
with  the  existing  system  source  code,  the  properties  of  the  existing  system  must  be  understood. 

The  second  issue  is  to  define  why  the  existing  system  is  to  be  re-engineered.  Is  the  project 
prompted  by  dissatisfaction  with  one  or  more  aspects  of  the  system?  Are  there  external  influences 
involved?  Is  the  system  to  be  ported  to  a  new  computer?  Does  the  implementation  language  need 
to  be  changed  due  to  some  organizational  directive?  Has  the  existing  environment  been  upgraded 
and  the  system  is  to  be  changed  to  take  advantage  of  the  new  environment?  Is  the  motivation  a 
mix  of  dissatisfaction  and  external  influences.  The  project  motivations  must  be  understood  since 
they  directly  influence  the  project  objectives. 

The  third  issue  is  to  specify  the  characteristics  of  the  desired  target  system.  This  involves  two 
aspects  of  the  system.  First,  are  there  changes  to  the  system  requirements?  Will  the  functionality 
of  the  target  system  differ  from  that  offered  by  the  existing  system?  If  yes.  then  the  new  require¬ 
ments  must  be  specified,  just  like  a  software  development  project.  Are  there  changes  to  existing 
requirements?  Are  any  existing  requirements  to  be  deleted?  If  the  answer  is  yes  to  either  question, 
the  affected  requirements  must  be  noted.  If  may  be  difficult  to  exactly  state  which  existing  require¬ 
ments  are  to  be  changed  or  deleted,  since  the  system  may  have  no  valid  requirement  specification. 
The  second  aspect  involves  changes  to  the  characteristics  of  the  system.  Is  the  system  design  to 
be  changed?  Is  the  system  implementation  to  be  changed?  Changes  of  this  nature  can  include 
translation  to  another  language,  cleaning  up  of  program  logic,  the  introduction  of  modularized  or 
object-oriented  design  principles,  etc.  The  desired  characteristics  of  the  target  system  must  be 
understood  to  plan  the  work  necessary  to  create  the  target  system. 

The  planning  portion  of  this  phase  focuses  on  management  aspects  of  a  project.  To  conduct 
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a  successful  project  it  is  necessary  to  understand  the  scope  of  the  work,  the  resources  required, 
the  tasks  to  be  done,  the  milestones  to  be  tracked,  effort  to  be  expended,  and  the  schedule  to  be 
followed.  Project  planning  focuses  on  research  and  estimation.  Research  is  done  to  define  the  scope 
of  the  work  to  be  achieved.  Estimation  is  done  to  predict  the  project  effort,  duration,  and  cost. 
Estimates  are  used  to  guide  project  planning. 

The  remainder  of  this  chapter  is  devoted  to  a  formal  specification  of  the  analysis  and  planning 
phase.  The  specification  is  not  complete,  in  that  all  aspects  of  the  phase  are  not  specified.  The 
specification  focuses  on  two  aspects  of  the  phase:  information  and  operations  on  information. 

What  information  is  used  and  created  during  this  phase?  The  re-engineering  process  centers 
on  information.  Information  flow's  into  the  process,  is  manipulated,  created,  and  produced  as  a 
product  of  the  process.  Each  item  of  information  must  be  identified.  Each  property  specified,  each 
relationship  with  other  information  elements  clarified.  The  role  of  each  element  of  information 
within  the  process  must  be  stated.  To  understand  the  process  it  is  necessary  to  understand  the 
information  on  which  the  process  operates. 

What  operations  are  performed  on  the  information  used  in  this  phase?  Information  is  data. 
Elements  of  information  are  grouped  together  and  treated  as  objects.  Information  is  read,  created, 
changed,  and  deleted.  It  is  combined  to  form  new  information.  Operations  are  done  on  information 
to  achieve  task  goals,  phase  goals,  and  project  goals.  The  operations  that  can  be  done  on  each 
element  of  information  must  be  specified.  Enderstanding  these  operations  clarifies  what  can  and 
can  not  be  done  within  the  process. 

In  addition  to  these  two  issues,  the  specification  focuses  on  tasks  that  occur  within  the  phase. 
Descriptions  of  tasks  are  given,  but  formal  properties  and  interrelationships  between  tasks  are  not 
yet  provided.  One  last  issue  that  is  also  addressed  is  the  role  of  knowledge  within  the  process. 
Unmans  apply  knowledge  to  their  work.  Often  the  role  of  knowledge  is  ignored,  but  to  completely 
specify  the  re-engineering  process,  the  role  of  knowledge  within  the  process  must  be  considered. 


4.1  Types  of  Knowledge 

The  use  of  knowledge  permeates  the  entire  software  re-engineering  process.  However,  the  role 
of  knowledge  within  the  process  is  often  ignored  or  over  looked.  This  becomes  apparent  when 
process  tasks  are  considered.  For  example,  one  task  within  the  analysis  and  planning  phase  is  to 
produce  a  list  of  project  objectives.  Where  do  these  objectives  come  from?  What  input  or  source  of 
information  can  be  given  that  accounts  for  this  output  (the  list  of  objectives)  from  the  task?  These 
objectives  derive  from  a  human’s  understanding  of  the  existing  system's  status,  its  environment, 
and  the  needs  of  the  system's  users.  The  list  of  objectives  is  not  created  by  a  computerized  tool  that 
analyzes  information  stored  in  a  computer.  The  list  is  created  by  a  human  using  several  sources 
and  areas  of  knowledge  and  then  monied  using  a  computerized  tool  onto  a  computer  file. 

This  example  emphasizes  an  important  point  in  process  modeling.  Many  useful  tools  serve 
clerical  roles.  Humans  do  the  work  and  tools  are  used  to  record  the  result  of  human  labor.  As  our 
knowledge  and  understanding  of  parts  of  the  process  deepen  better  tools  that  take  on  more  of  the 
burden  of  the  work  will  be  created.  Knowledge  can  be  embedded  into  tools  that  can  then  do  work 
that  currently  is  a  human  responsibility. 
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The  issue  of  knowledge  and  its  role  in  t lie  process  is  important.  There  are  many  different 
knowledge  domains  that  are  used  in  the  software  re-engineering  process.  Currently,  it  is  not  possible 
to  fully  define  each  knowledge  domain.  Rather  than  enter  into  a  detailed  specification  of  knowledge, 
its  structure,  and  the  relationships  between  different  groups  of  knowledge,  we  will  specify  knowledge 
at  a  very  abstract  level. 

The  set  of  all  knowledge  is  denoted  bv  KNOWLEDGE.  The  structure  of  this  set  will  be  left 
unspecified.  This  may  be  unsatisfactory,  but  it  avoids  a  can  of  worms  that  would  side  track  this 
discussion.  Several  subsets  of  KNOWLEDGE  can  be  specified.  These  subsets  serve  to  group 
together  knowledge  related  to  a  particular  domain.  Again,  intuitive  definitions  of  these  sets  will  be 
given.  Each  knowledge  domain  has  the  specification: 

domain  C  KNOWLEDGE 

The  knowledge  domains  of  interest  are  explained  below.  There  are  possibly  other  knowledge  do¬ 
mains  used  in  software  re-engineering  that  are  not  listed.  As  our  understanding  of  the  software 
re-engineering  process  grows  it  is  likely  that  other  necessary  domains  will  be  identified. 

Application  Domain  Knowledge  This  is  knowledge  about  the  characteristics  and  properties  of 
the  problem  area  that  the  system  addresses.  A  software  system  encodes  a  process  for  solving 
a  problem  or  performing  some  task.  To  understand  why  a  system  does  what  it  does,  it  is 
necessary  to  understand  the  problem  area  in  which  the  system  operates. 

Environment  Knowledge  This  is  knowledge  about  the  computer  environment  in  which  the  sys¬ 
tem  executes.  It  is  knowledge  about  the  computer  hardware,  the  operating  system,  and  other 
systems  and  devices  with  which  the  existing  system  interacts.  Environment  knowledge  in¬ 
cludes  knowledge  about  any  changes  or  planned  changes  to  the  existing  system  environment 
or  the  target  environment  that  the  target  system  will  operate  in.  This  knowledge  includes  an 
understanding  of  the  computing  environment  at  the  organization  or  the  client  site. 

Existing  System  Knowledge  This  is  knowledge  about  the  characteristics  and  properties  of  the 
existing  system.  This  is  an  understanding  of  how  to  use  the  system,  how  it  works,  the  source 
code,  the  system  history,  where  the  source  files  are  located,  the  documentation,  and  the 
shortcomings  and  strengths  of  the  system. 

Management  Knowledge  This  is  knowledge  about  how  to  manage  a  project.  This  is  knowledge 
about  scheduling,  estimation  methods,  dealing  with  personnel,  understanding  potential  man¬ 
agement  pitfalls  and  how  to  avoid  them,  etc.  This  is  not  specific  knowledge  about  managing 
re-engineering  projects,  though  that  is  included.  It  is  a  broad  understanding  about  how  to 
manage  software  projects. 

Organization  Knowledge  This  is  knowledge  about  the  organization  that  will  conduct  the  soft¬ 
ware  re-engineering  project .  typically,  only  a  subset  of  the  organizational  resources  (i.e  people 
and  equipment)  are  dedicated  to  a  project,  but  an  understanding  of  the  entire  organization, 
its  structure,  its  policies  and  procedures  is  required. 

Programming  Language  Knowledge  This  is  knowledge  about  the  languages  used  to  implement 
'lie  existing  and  target  systems.  |i  is  an  understanding  of  the  language  principles,  the  syntax. 

.  ...  I  1  . .  .  •  I  1  '  .  »  *  .•  » 


53-37 


Software  Engineering  Knowledge  This  is  knowledge  about  software  engineering  principles  and 
practices.  Forward  engineering  is  a  component  of  the  re-engineering  process  so  general  soft¬ 
ware  engineering  techniques  must  be  known. 

Software  Re-engineering  Knowledge  This  is  knowledge  about  how  to  re-engineer  software, 
techniques,  methods,  and  methodologies.  It  is  the  understanding  about  how  re-engineering 
work  is  accomplished.  This  includes  knowledge  about  necessary  tasks,  techniques,  methods, 
pitfalls  and  how  to  avoid  them  or  get  out  of  them,  etc. 

Tool  Knowledge  This  is  an  understanding  of  tools  that  are  available  either  in-house  or  commer 
dally.  It  is  knowledge  about  how  tools  can  be  used,  the  problems  they  address,  and  their 
suitableness  for  any  given  task.  Tools  can  play  an  important  role  within  a  project  and  that 
role  must  be  understood  if  the  tools  are  to  be  used  effectively. 

4.2  Information  and  Operations 

This  section  presents  a  formal  specification  of  the  information  used  within  the  Analysis  and  Planning 
phase  of  the  software  re-engineering  process.  Operations  on  information  items  are  also  specified, 
lor  each  information  object,  a  description  of  the  information  and  its  role  in  the  process  is  given, 
followed  by  a  specification  of  the  information.  Finally,  operations  on  the  information  are  described 
and  specified. 

Error  conditions  and  error  handling  are  not  currently  covered  in  the  specification.  While  it  is 
possible  to  specify  errors,  ii  was  felt  that  the  extra  detail  would  detract  from  the  essential  points. 

4.2.1  Basic  Sets 

There  are  several  sets  that  are  used  through  the  specification.  Those  sets  are  defined  here.  The 
index  provided  with  this  report  shows  where  each  defined  item  of  the  specification  is  declared  and 
used.  The  reader  may  wish  to  refer  to  the  index  while  reading  the  specification. 

Dates  are  used  throughout  the  re-engineering  process.  A  project  has  a  start  date  and  stop 
date.  Tasks  and  phases  have  estimated  start  and  stop  dates,  as  well  as  actual  start  and  stop  dates. 
Employees  record  hours  worked  on  particular  dates.  The  set  of  all  valid  dates  is  denoted  by  the  set 
DATE. 

DATE  =  The  set  of  oil  valid  dates 
We  let  (  €  DA  TE  denote  an  unassigned  date. 

\ antes  are  used  throughout  the  re-engineering  process.  Programs  have  names,  files  have  names, 
personnel  have  names,  task''  have  names,  etc.  The  set  of  all  possible  names  is  denoted  by  the  set 
NAME. 

SAME  =  The  s( I  of  oil  mimes 
We  let  »  £  VI  ME  denote  an  unassigned  name. 
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Many  items  of  information  are  labeled,  individual  elements  of  a  requirement  specification  have 
labels,  steps  in  a  procedure  are  labeled,  elements  of  any  list  are  typically  labeled.  The  concept  of 
a  label  differs  from  the  concept  of  a  name.  An  item  that  is  labeled  typically  is  part  of  a  sequence 
of  items  or  a  collection  of  items  that  are  somehow  ordered.  Whereas,  a  name  denotes  individuality 
within  a  set  of  related  items.  The  set  of  all  possible  labels  is  denoted  by  the  set  LABEL. 

LABEL  =  The  set  of  all  labels 

Many  times  throughout  a  process  it  is  necessary  to  specify  that  some  action  can  not  happen 
until  a  set  of  conditions  has  been  satisfied.  Or  that  some  task  will  satisfy  a  set  of  conditions. 
A  condition  may  specify  that  a  certain  data  object  must  exist  or  that  another  process  step  has 
been  completed,  or  that  a  human  has  permission  to  begin  the  action  associated  with  a  step.  We 
let  CONDITION  denote  the  set  of  all  possible  conditions.  (For  now,  we  ignore  the  fact  that  a 
condition  will  most  likely  be  a  collect  of  conditions  connected  by  logical  connectives.) 

CONDITION  =  The  set  of  all  possible  conditions 


4.2.2  Property  Lists 

Many  of  the  information  objects  that  are  manipulated  during  a  re-engineering  project  have  an 
associated  collection  of  properties.  These  properties  differ  from  object  to  object.  For  example, 
a  source  code  file  has  a  name,  a  size,  a  list  of  functions  defined  in  it,  a  programming  language 
associated  with  it.  etc.  These  are  properties  of  source  code  files. 

A  property  typically  has  a  name  and  a  value.  The  set  of  all  possible  properties  is  denoted  by 
the  set  PROPERTY.  The  set  of  all  possible  property  names  is  denoted  bv  the  set  PROP_NAME. 
The  set  of  all  possible  property  values  is  denoted  by  the  set  VALUE. 

P ROBERT)  =  The  set  of  all  properties 
YALl’E  =  The  set  of  all  p-operty  values 


PROF -NAME  =  The  set  of  all  pmperty  names 
|  PROP -NAME  c  NAME 

A  list  of  properties  is  modeled  as  a  partial  function  from  property  names  to  values: 
PROP-LIST  :  PROP -NAME  -+-  VALVE 

A  initial  propertv  list  is  empty.  This  is  denoted  by  the  following  constant: 
INITIAL-PROP-LIST  =  0 


Property  List  Operations 
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•  Add- property 

•  Update- property 

•  Delete- property 

•  Get-property 

•  List-properties 

To  denote  that  a  property  list  is  changed  by  an  operation  the  following  notation  is  used: 
^.PROP-LIST  =  [  pi,  pi' :  PROP-LIST  ] 

To  specify  that  a  property  list  is  not  changed  by  an  operation  the  following  notation  is  used: 


as  PROP-LIST  =  [  PL  pi' :  PROP-LIST  |  PV  =  pi  ] 

The  operation  Add-property  is  used  to  add  a  new  property  to  an  existing  property  list. 


I  he  operation  Update-property  changes  the  value  associated  with  a  property. 

_ Update-property _ _ _ 

A  PROP  -LIST 
P?  :  PROP -LIST 
v 1  :  VALUE 

pi  €  domp/ 

pi'  =  p/0  {p?  v-  vl) 


1  he  operation  Delete-projierty  removes  a  property  from  a  property  list. 

_ Delete-property _ _ 

A  PROP-LIST 
pi  :  PROP 

pi  €  dom  pi 

pT  =  {pi}  <  pi 
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The  operation  Get-property  retrieves  the  value  associated  with  a  property. 


_ Get-property _ 

=  PROP-LIST 
p?  :  PROP-NAME 
u! :  VALUE 


p?  €  dom  pi 

t>!  =  p/(p?) 


The  operation  List-properties  retrieves  a  list  of  all  the  properties  and  their  values. 

_ List-properties _ 

=  PROP-LIST 

list !  :  P{PROP-NAME  x  VALUE} 

li  s'  =  {  p  :  PROP-NAME;  v  :  VALUE  |  pl(p)  =  t>  } 


4.2.3  Motivations  and  Objectives 

The  motivations  for  re-engineering  a  software  system  often  reflect  problems  with  the  existing  system. 
There  are  a  variety  of  reasons  to  re-engineer  a  system.  The  primary  reason  is  often  to  improve 
the  system  maintainability  in  an  eflort  to  reduce  maintenance  costs  or  shorten  the  time  required 
to  change  the  system.  The  motivations  for  a  project  cause  the  project  to  happen.  Motivations  are 
typically  problems  or  events  that  must,  be  addressed.  Possible  re-engineering  project  motivations 
include: 


•  The  existing  system  is  too  costly  to  maintain. 

•  It  takes  too  long  to  make  changes  to  the  existing  system. 

•  The  existing  system  (or  part  of  it)  must  be  rewritten  in  another  programming  language. 

•  The  existing  system  is  unreliable. 

•  The  system  must  be  ported  to  a  new  operating  environment.  This  refers  to  adapting  the 
system  to  new  hardware  or  operating  system. 

•  The  system  must  adapt  to  changes  in  its  operating  environment.  This  may  be  caused  by  a 
new  release  of  an  operating  system,  compiler,  or  hardware. 

•  The  documentation  for  the  system  is  not  complete,  non-existent,  out  dated,  or  badly  inaccu¬ 
rate. 

•  The  system  uses  old  home-grown  utilities. 

•  The  system  does  not  make  um>  of  newer  program  mine  language  features. 
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The  concept  of  re-engineering  project  motivations  is  formalized  by  defining  the  set  of  all  possible 
motivations  to  re-engineer  a  software  system.  This  is  denoted  by  the  set: 

MOTIVATION  =  The  set  of  all  possible  re-engineering  motivations 

Project  objectives  are  typically  the  response  to  project  motivations,  however,  that  is  not  always 
true.  Once  the  decision  is  made  to  re-engineer  a  system  other  objectives  are  often  added.  Other 
objectives,  such  as  a  desire  to  make  changes  to  the  system  requirements,  are  often  combined  into  a 
re-engineering  project.  (The  old  while  we’re  at  it,  why  don’t  we  also  do  this...  cliche.)  Objectives 
are  project  goals.  Objectives  specify  what  the  project  is  to  accomplish.  Possible  re-engineering 
project  objectives  include: 

•  Improve  the  maintainability  of  the  software  system. 

•  Translate  the  system  (or  part  of  it)  into  another  programming  language. 

•  Improve  the  reliability  of  the  system. 

•  Port  the  system.  (System  conversion.) 

•  Adjust  the  system  to  operate  within  the  new  environment. 

•  Change  the  functionality  provided  by  the  system. 

•  Re-document  the  system.  Create  new  documents  when  a  required  document  is  non-existent. 

Update  or  replace  existing  documents. 

•  Replace  home-grown  utilities  with  environment  supplied  utilities.  For  example,  replace  locally 

written  file  manipulating  routines  with  standard  operating  system  calls. 

•  Make  better  use  of  programming  language  capabilities  and  features. 

The  concept  of  re-engineering  project  objectives  is  formalized  bv  defining  the  set  of  all  possible 
objectives  for  a  re-engineering  project.  This  is  denoted  by  the  set: 

OBJECTIVE  =  Thf  vet  of  all  /wssible  re-engineering  objectives 

There  are  typically  several  project  motivations  and  objectives,  so  it  is  helpful  if  each  is  labeled. 
This  simplifies  the  problem  of  referring  to  a  particular  project  motivation  or  objective. 

The  list  of  project  motivations  and  objectives  provide  a  high-level  description  of  what  a  re¬ 
engineering  project  is  to  accomplish  (goals)  and  the  reasons  for  the  project.  These  two  lists  are 
grouped  together  to  form  the  project  definition.  The  project  definition  may  also  be  called  the 
project  overview.  The  project  definition  is  formalized  as: 

DEFINITION _ 

reasons  :  LABEL™  MOTIVATION 
goals  :  LABEL  ™  OBJECTIVE 


53-42 


Notice  that  each  labeled  list  is  formalized  as  a  one-to-one  partial  function.  This  captures  the  idea 
that  a  particular  motivation  or  objective  only  occurs  once,  and  has  an  unique  label. 

The  initial  value  for  the  project  definition  is  given  by  the  constant  INIT-DEFINITION. 

I  NIT  -DEFINITION _ 

DEFINITION 

reasons  =  0 
goals  =  0 


Motivations  and  Objectives  Operations 

The  project  definition  is  manipulated  by  several  operations.  These  operations  are  : 

•  Add- reason 

•  Add-goal 

•  Delete- reason 

•  Delete-goal 

•  Get- reason 

•  Get-goal 

•  List-reasons 

•  List -goals 

The  need  to  add  new  reasons  and  goals  is  clear.  The  need  to  delete  a  reason  or  goal  is  necessary 
due  to  changing  views  of  the  project.  The  initial  list  of  reasons  and  goals  is  likely  to  change  as  the 
project  scope  is  explored  in  more  depth. 

To  specify  that  the  project  definition  is  changed  by  an  operation  the  following  notation  is  used 


A DEFINITION  =  [  DEFINITION.  DEFINITION'} 


To  denote  that  the  project  definition  is 
used: 


not  changed  bv  an  operation  the  following  notation  is 


=  DEFINITION  =  [  DEFINITION.  DEFINITION'  \  DEFINITION'  =  DEFINITION  } 


1  he  operation  Add-reason  is  used  to  add  a  new  reason  to  the  list  of  reasons  for  re-engineering 
a  >\'tem. 
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_  Add-reason _ 

A  DEFINITION 
ml  :  MOTIVATION 
II :  LABEL 

11  £  dom  reasons 

reasons'  =  reasons  U  {/?•-►  m?  } 

goals'  —  goals 


The  operation  Delete-reason  is  used  to  remove  the  specified  reason  from  the  list  of  reasons. 

^  Delete-reason . _ . . 

A  DEFINITION 
11 :  LABEL 

11  €  dom  reasons 
reasons'  =  {/?}  O  reasons 

goals'  =  goals 


The  operation  Get-reason  is  used  to  retrieve  a  reason  for  re-engineering  a  system  from  the 
project  definition,  given  the  label  for  tie-  reason. 

Get-reason  _ _ _ _ _ 

=  DEFINITION 

11  :  LABEL 

ml  :  MOTIVATION 


ml  =  reasons(ll) 


The  operation  List-reasons  is  used  to  ret  rieve  a  list  of  all  the  reasons  for  re-engineering  a  system. 

_  List- reasons _ _ _ 

s  DEFINITION 
11 :  LABEL 

ml  :  P{  LABEL  x  MOTIVATION  } 

ml  =  {  /:  LABEL ;  m  :  MOTIVATION  j  reasons(l)  =  in  } 


The  operations  Add-goal.  Delete-goal.  ( let-goal,  and  List-goals  are  similarly  defined. 

4.2.4  Operating  Environments 

A  software  system  is  assembled  and  operated  within  an  environment.  This  environment  is  defined 
bv  the  operating  system,  the  computer  architect  lire,  the  compiler,  other  systems  that  interact  with 
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the  existing  system,  devices  used  by  the  system,  vendor  supplied  libraries  of  routines,  etc.  If  any 
of  these  elements  are  changed,  it  may  be  necessary  or  desirable  to  adapt  the  existing  system  to 
conform  with  the  new  environment. 

A  change  in  the  environment  can  be  caused  by  an  upgrade  in  the  operating  system  or  the 
compiler.  It  may  be  a  change  in  the  interface  to  another  system.  The  existing  system  may  be 
ported  to  a  new  computer  with  a  similar  or  dissimilar  operating  system  and  different  hardware 
architecture.  If  the  target  environment  is  not  the  same  as  the  existing  environment,  it  is  necessary 
to  analyze  the  two  environments,  detect  differences  between  them,  and  determine  the  possible 
effects  of  these  differences  on  the  existing  system.  Environmental  differences  that  are  important  to 
a  re-engineering  project  are  those  differences  that  can  affect  the  way  the  system  operates. 

Information  about  both  the  existing  environment  and  the  target  environment  must  be  collected. 
Environment  information  includes  the  type  of  computer,  operating  system,  and  its  release  level, 
vendor  supplied  utilities,  utility  interfaces,  word  size,  byte  ordering,  character  representation,  etc. 

Information  about  a  particular  environment  is  modeled  by  a  property  list.  There  are  two 
environments  that  must  be  modeled:  the  existing  and  target  environments.  Information  about 
these  environments  is  formalized  bv  cxisting-env  and  targeL.env,  where: 

cxisting-env  :  PROP-LIST 
target-env  :  PROP-LIST 

Once  each  environment  lias  been  analyzed  the  differences  between  the  two  environments  must 
be  noted.  The  differences  are  modeled  by  Riffs,  where  diffs  lists  properties  of  the  two  environments 
that  differ.  For  example,  both  environments  may  provide  a  system  call  to  open  a  file,  but  the 
parameters  used  by  the  call  differs  between  the  two  environments. 

diffs  :  PROP -NAME  -  VALUE  x  VALUE 

The  existing  environment  features  that  are  missing  in  the  target  environment  must  be  noted. 
Features  of  the  target  environment  that  are  not  present  in  the  existing  environment  must  also  be 
noted.  These  two  sets  of  information  are  modeled  by  lacking  and  new-features. 

lacking  :  P  PROP-SAME 
new-features  :  P  PROP-SA  ME 

The  set  lacking  lists  properties  and  features  of  the  existing  environment  that  are  not  found  in  the 
target  environment.  The  set  ru  u-fcatures  lists  properties  and  features  of  the  target  environment 
that  are  not  found  in  the  existing  environment. 

Finally,  we  want  to  know  about  the  differences  that  are  important.  This  is  information  is 
modeled  by: 

concerns  :  PROP-LIST 

The  part  ial  function  conn  ms  does  not  list  properties  of  the  environments,  but  concerns  ( properties ) 
and  statements  about  those  concerns  (values).  The  concerns  recorded  refer  to  differences  between 
the  two  environments  that  may  influence  the  system. 
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All  the  above  information  is  collected  together  into  a  schema  that  represents  the  environment 
analysis.  This  schema  also  states  the  relationships  between  these  information  items. 

ENVIRONMENTS _ 

ex isting-env  :  PROP-LIST 

target-env  :  PROP-LIST 

diffs  :  PROP-NAME  —  VALUE  x  VALUE 

lacking  :  P  PROP-NAME 

new -features  :  P  PROP-NAME 

concerns  :  PROP-LIST 

diffs  =  {p  :  PROP-NAME ;  :  VALUE  | 

p  €  dom  existing-env  Ape  dom  target-env  A 

1/1  =  ezisting-env(p)  A  !^  =  target-env(p)  A  t>i  ^  t>2  } 

lacking  C  dom  existing-env 
lacking  D  dom  target-env  =  0 

new -features  C  dom  target-env 
nnr-features  D  dom  existing-env  =  0 


Initially,  when  the  Analysis  and  Planning  pnase  begins  no  environment  information  is  recorded. 
The  initial  environment  information  is  denoted  bv  the  constant  INIT-ENVIRONMENTS. 


INI  T-EN  VIRONMEN  TS 
ENVIRONMENTS 

<  risting-env  —  0 
large  t-c  nv  =  0 
diffs  =  0 
lacking  =  0 
new-features  =  0 
concerns  =  0 


Operating  Environment  Operations 

Several  operations  on  environment  information  can  be  defined.  Because  most  of  the  information  is 
stored  as  property  lists  the  property  list  operations  can  be  used.  The  operations  on  environment 
information  are: 

•  Add  existing-env-prop 


•  Delete-existing-env-prop 

•  I '  pdate-existing-en  v- prop 

•  (let  existing-env-prop 
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•  List-existing-env-props 

•  Add-target-env-prop 

•  Delete-target-env-prop 

•  Update-target-env-prop 

•  Get-target-env-prop 

•  List-target-env-props 

•  List-differences 

•  List-lacking 

•  List-new-features 

•  Note-concern 

•  Delete-concern 

•  Update-concern 

•  Get-concern 

•  List-concerns 

Notice  that  there  are  no  operations  that  set  the  values  of  diffs,  lacking ,  or  new— features.  This 
information  can  he  determined  automatically  a>  the  specification  shows. 

To  denote  that  the  ENVIRONMENTS  information  can  be  changed  by  an  operation  the  following 
notation  is  used. 

A ENVIRONMENTS  =  [ENVIRONMENTS.  ENVIRONMENTS'  ] 

To  denote  that  the  ENVIRONMENTS  information  is  not  changed  by  an  operation  the  following 
notation  is  used. 

=  ENVIRONMENTS  =  [  ENVIRONMENTS .  ENVIRONMENTS'  | 

ENVIRONMENTS'  =  ENVIRONMENTS  ] 

The  operation  Add-existing-cnv-prop  is  used  to  add  a  new  property  to  the  list  of  information 
about  the  existing  environment. 
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Add-existing-env-prop - - — — 

A ENVIRONMENTS 
pi  :  PROP-NAME 
vl  :  VALUE 

Add-property[  existing _em>/ pi.  existing -env' j pi'  j 

target-env'  =  target-env 
diffs'  =  diffs 
lacking'  =  lacking 
new -features'  —  new -features 
concerns'  =  concerns 


The  operation  Delete-extsting-env-prop  is  used  to  remove  a  property  from  the  list  of  information 
about  the  existing  environment. 

Delete-existing-env-prop . — - - - - - - - 

A ENVIRONMENTS 
pi  :  PROP-NAME 

Delete-property[  existing-f.nr  /  pi .  existing -env'  I  pi'  ] 

target-env'  =  targe  t-cnr 
diffs'  =  diffs 
lacking'  =  lacking 
new-features'  =  ne  w-feahms 
concerns'  =  concerns 


The  operation  Update-e xisting-e  nv-prop  is  used  to  change  the  information  about  a  property  of 
the  existing  environment. 

Update-existxng-enr-prop — — - -  — - - - 

A ENVIRONMENTS 
pi  :  PROP -NAME 
vl  :  VALUE 

Update-property[  ensting-cnv / pi .  e listing -env' / pC  j 

target-env'  =  target-t  nv 
diffs'  =  diffs 
lacking'  =  lacking 
new-features'  —  ne  tc-feature  s 
concerns'  =  concerns 


rite  operation  (iet-existinef-i  nc-grng  used  to  retrieve  an  item  of  information  about  the  existing 
environment. 
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_ Add-existing-env-prop _ 

=  ENVIRONMENTS 
p?  :  PROP -NAME 
v\  :  VALUE 

Get-property[  existing-env  / pi,  existing -env' / pi'  ] 


The  operation  List-existing-env-props  is  used  to  report  ail  the  information  collected  about  the 
existing  environment. 

List- existing- env-prop _ 

=  ENVIRONMENTS 

list\  :  P{  PROP-NAME  x  VALUE  } 

List-properties[  existing -env / pi,  existing -env' / pi'  ] 


The  operations  on  target  environment  properties,  Add-target-env-prop ,  Delete-target-env-pmp, 
Update-target-env-prop,  Get-target-env-prop.  and  List-target-env-props  are  defined  similarly  to  the 
above  operations. 

The  operation  List-differences  is  used  to  obtain  a  list  of  all  the  environment  differences. 

List-differences _ 

s  ENVIRONMENTS 

lisp.  :  TROP-NAME  x  VALUE  x  VALUE 

list'.  =  {  p  :  PROP-NAME-.  Vi,  .-2  :  VALUE  \  diffs(p)  =  ( r, ,  i-j)  } 


The  operation  List-lacking  is  used  to  obtain  a  list  of  all  the  properties  of  the  existing  environment 
that  are  lacking  in  the  target  environment.  The  absence  of  a  feature  used  by  the  existing  system 
can  increase  the  complexity  of  a  project,  since  it  usually  requires  that  a  similar  feature  be  developed 
in  the  target  environment. 

_ list-lacking _ _ _ _ _ 

=  ENVIRONMENTS 
listl  :P  PROP-NAME 

list !  =  lacking 


The  operation  List-new-features  is  used  to  obtain  a  list  of  all  the  properties  of  the  target 
environment  that  are  lacking  in  the  existing  environment.  There  may  be  features  in  the  target 
environment  that  the  system  should  be  adapted  to  use.  This  operation  is  specified  similarly  to  the 
operation  List-lacking. 

The  operation  Note-conn  m  is  used  to  note  a  property  of  the  two  environments  that  may  be  a 
concern  to  the  project.  The  information  recorded  is  not  the  property  in  question,  but  a  concern 
about  the  property  difference  and  information  about  that  concern. 
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_ Note-concern - 

A ENVIRONMENTS 
c?  :  PROP-NAME 
e?  :  VALUE 

Update  -  property[concems / pi,  concerns' j pi ,  c?/p?,  f  ? / u ? ] 

existing-env'  =  existing-env 
target-env'  =  target-env 
diffs1  =  diffs 
lacking'  =  lacking 
new-features'  =  new-features 


The  operation  Delete-concern  is  used  to  remove  a  concern  that  has  previously  been  recorded. 
This  operation  may  be  necessary  as  the  understanding  of  the  project,  system,  and  environments 
develops. 

_ Delete-concern _ _ 

|  A ENVIRONMENTS 
c?  :  PROP-NAME 

Delete  -  property[concerns / pi .  concents' / pi .  c?/p?] 

existing-env'  =  existing-env 
target-env'  =  target-env 

:  rfi/jV  =  rfijgfar 

lacking'  =  lacking 
\  new-features'  =  new -features 


The  operation  List-concerns  is  used  to  obtain  a  list  of  all  the  concerns  about  environment 
differences  that  have  been  recorded. 


P 


/,  ist-concerns _ 

=  ENVIRONMENTS 

list'.  :  P{  PROP-NAME  x  VALUE  } 

List  -  properties[concerns / pi .  concerns' / pl'\ 


4.2.5  Inventory 

The  inventory  lists  the  objects  that  are  to  be  re-engineered  or  used  during  a  re-engineering  project. 
1'hese  objects  must  be  identified  and  properties  of  these  ob  jects  must  be  collected.  The  objects  to 
be  collected  vary  based  on  how  the  system  is  viewed.  There  are  at  least  two  viewpoints:  the  logical 
view  and  the  physical  view.  Doth  logical  and  physical  information  must  be  recorded,  as  they  will 
be  used  throughout  the  project. 

The  logical  viewpoint  sees  the  system  as  composed  of  objects  such  as  programs.  JCL  commands, 
data  stores,  databases,  and  routines  taken  from  libraries.  There  are  also  supporting  objects  such 
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as  documents.  Documents  can  be  user  manuals,  reference  guides,  system  requirement  documents, 
design  documents,  maintenance  history  records,  etc.  Other  supporting  objects  include  test  plans 
which  consist  of  test  cases,  test  input  data,  and  logged  test  output  data. 

The  physical  viewpoint  sees  that  system  objects  are  stored  in  files.  Files  have  properties  such 
as  name,  and  size,  as  well  as  record  layouts  for  datafiles,  programming  language  type  for  source 
code  files,  etc.  Documents  may  be  stored  in  files  or  perhaps  only  a  hardcopy  exists.  Test  plans  are 
typically  documents,  but  test  cases  are  typically  input  data  files  and  logged  output  data  files. 

To  formally  represent  the  inventory,  we  begin  by  naming  the  set  of  all  possible  system  objects. 
This  set  is  denoted  by  OBJECT. 

OBJECT  =  The  set  of  all  possible  system  objects 

Every  object  has  a  name  or  an  id.  To  simplify  things  at  this  point  we  assume  that  all  object  names 
are  unique.  The  set  of  all  possible  object  names  is  denoted  by  OBJ_NAME. 

OBJ S AME  =  The  set  of  all  possible  object  names 

OBJ  SAME  C  NAME 

Each  object  has  a  set  of  properties  associated  with  it.  One  property  that  every  object  has  is  a 
type.  The  type  reflects  the  viewpoint  on  the  object.  Possible  types  include  program.  JCL  script, 
database,  test  plan,  source  code  file,  data  file,  document  file,  test  data  file,  library 
file,  design  document,  etc.  Furthermore,  each  object  of  the  same  type  must  have  the  same 
properties,  i.e  the  same  information  should  be  collected  for  similar  objects.  This  is  specified  by  the 
set  OBJE(T_PROPERTIES  which  denotes  the  set  of  predefined  property  lists  for  different  types 
of  objects. 

OBJECT -PROPERTIES  =  Tht  -it  nil  predefined  object  projurty  lists 

OBJECT  .PROBER TIES  C  PROP -LISTS 
jB  ol  :  OBJECT-PROPERTIES  •  »l  =  0 
Sol  :  OBJECT-PROPERTIES  •  type?  dome/ 

The  predicates  state  that  t  he  property  list  for  an  object  can  not  be  empty  and  must  have  a  property 
called  type. 

We  define  a  schema  INVENTORY  which  characterizes  the  system  inventory  information. 

IS \  ES TORY _ 

inv-list  :  F  OBJ _.Y AME 

inv_mfo  :  OBJ -SAME  -  PROP-LIST 

inr-list  =  dom  mv_nifn 

Sob]  :  OBJ -SAMI::  uhjl  :  OBJ  SAME  • 

inv-info{  <>bj  )( type )  =  inv-info t  obj2){  type)  => 
i  dom  mr -info{  obj  )  —  dom  n\c-info(obj'2) 


Notice  that  mv-lisi  is  defined  .n  a  finite  >01.  I  his  states  that  the  number  of  objects  associated  with 
a  system  is  finite. 
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At  the  start  of  a  project,  no  inventory  information  exists.  The  initial  inventory  information  is 
denoted  by  INIT-INVENTORY. 

INIT-INVENTORY _ 

INVENTORY 

mv-list  =  0 
inv-info  =  0 


Inventory  Operations 

The  system  inventory  is  manipulated  by  several  operations.  These  operations  are: 

•  Add-Inv-Object 

•  Del-Inv-Object 

•  Update-Inv-Obj-Prop 

•  Look  Inv-Obj-Prop 

•  List-Inv-Obj-Props 

•  List-Inv-Objects 

Notice  there  are  no  operations  to  add  or  delete  properties  for  an  object.  The  type  of  an  object 
determines  the  properties  that  object  possesses. 

To  denote  that  the  inventory  is  changed  by  an  operation  the  following  notation  is  used  : 

A INVENTORY  =  [INVENTORY.  INVENTORY'} 

To  denote  that  the  inventory  is  not  changer!  bv  an  operation  the  following  notation  is  used  : 

=  INVENTORY  =  [  INVENTORY.  INVENTORY'  \ 

INVENTORY'  =  INVENTORY  ] 

To  add  a  new  object  to  t he  inventory  the  Add-Inv-Object  operation  is  used.  By  using  this 
operation,  a  new  object  is  added  and  its  property  list  is  initialized. 

_ Add-  Inx-  Ob <je  rt _ 

A  INVENTORY 
object ?  :  OHJ-NA  ME 
type?  :  VALVE 

object ?  $  mv-list 

3  empty -tyjtr -list  :  ()RJ ECT-PRORI  RTIES  •  einpty-tijjH -list(  type  ■  =  t ype ? 
mv-list'  =  mr_Zi.v/  j  {object! } 

i nr -info'  =  mv-info  u  {object!  • —  >  mpty-type—li.*t} 
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Where  empty_type-list  is  a  predefined  initial  property  list  for  the  type  of  object  added  to  the 
inventory.  This  helps  assure  that  all  property  lists  for  objects  of  the  same  type  hold  the  same 
properties. 

The  operation  Del-Inv-Object  is  used  to  remove  an  object  from  the  inventory. 

—  Del-Inv-Object _ _ _ 

A  INVENTORY 
object ?  :  OBJ -NAME 

object ?  £  inv -list 

inv-list'  =  inv-list  —  {object"! } 
m v -info'  —  { object ?}  <3  inv-info 


Since  the  list  of  properties  is  fixed  for  each  type  of  object  the  ability  to  add  or  delete  a  property 
is  not  needed.  However,  it  will  be  necessary  to  update  the  value  of  a  property.  This  can  be  done 
with  the  Update- Inv-Obj- Prop  operation. 

Update- Inv-Obj- Prop _ _ _ 

A INVENTORY 
object ?  :  OBJ -NAME 
P?  :  PROP -NAME 
r?  :  VALUE 

object ?  £  inv-list 

inv-list'  =  inv -list 

Update-property[  inv-info(obj(Ct'!)/ pi.  inv -info' ( object  ! )/ pi'  } 


The  operation  Look- Inv-Obj- Prop  provides  t  he  ability  to  examine  the  value  of  an  object  property. 

_ Look- Inv-Obj- Prop _ _ 

=  INVENTORY 
object ?  :  OBJ -NAME 
p!  :  PROP -NAME 
id  :  VALUE 

object ?  £  inv-list 

(!( t-property[  inv -inj d(  object  ! ) / pi .  in  v -in fo(  object  ! )/ pi'  ] 


The  operation  List- Inv-Obj- Prop  returns  a  li>i  of  t lie?  properties  and  property  values  associated 
with  the  specified  object. 
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List- Inv-Obj- Prop — - - — - 

=  INVENTORY 

object'!  :  OBJ -NAME 

list\  :  P{  PROP-NAME  x  VALUE  } 

object ?  €  inv-list 

List-properties[  inv -inf o( object!)/ pi,  tnv_m/o(  object?) / pf'  ] 


The  operation  List- Inv- Objects  returns  a  list  of  all  the  objects  in  the  inventory. 

List- Inv- Objects _ - _ . _ — - - - - 

=  INVENTORY 
list'.  :  F  OBJ -NAME 

list'.  =  inv-list 


4.2.6  Tasks 

A  project  consists  of  a  collection  of  tasks  that  must  be  performed.  Identifying  the  necessary 
tasks,  staffing,  supporting,  and  scheduling  these  tasks  is  the  responsibility  of  project  management. 
Projects  are  often  organized  around  project  plans  and  project  phases.  Here,  these  concepts  are 
formalized  as  tasks. 

Project  tasks  form  a  hierarchy.  There  are  major  tasks  such  as  reverse  engineer  the  source  code, 
document  the  target  system,  and  plan  the  project.  Each  major  task  organizes  a  collection  of  sub¬ 
tasks.  ’or  example,  to  reverse  engineer  the  source  code,  sub-tasks  for  collecting  the  source  code 
files,  static  analysis  of  source  code,  and  recovering  a  design  are  required. 

Each  task  has  a  goal.  Each  task  specifies  a  procedure  to  be  used  to  achieve  the  goal  of  that 
task.  A  task  procedure  may  be  a  loose  set  of  guidelines  to  direct  a  human  or  group  of  humans 
through  the  task  or  it  may  be  a  specific  set  of  directions.  A  task  procedure  may  specify  sub-tasks 
that  must  be  done  to  fulfill  the  purpose  of  the  current  task. 

A  task  serves  several  purposes  within  a  project.  The  project  manager  creates  and  defines  project 
tasks.  The  project  manager  uses  tasks  to  represent  the  project  plan.  Tasks  are  used  in  project 
tracking.  To  the  project  manager  tasks  are  objects  that  can  be  manipulated.  Thus  tasks  are  viewed 
as  static  objects.  Tasks  are  also  dynamic  objects.  Project  tracking  requires  that  the  real  start  and 
stop  dates  for  a  task  be  recorded.  The  time  personnel  spend  on  a  task  must  be  recorded.  Project 
personnel  use  tasks  as  directives.  A  task  specifies  to  personnel  what  work  is  to  be  done  and  how 
it  can  be  achieved.  This  is  an  interesting  notion.  A  task  provides  a  control  structure  for  a  portion 
of  the  work  done  during  a  project.  People  "execute"  a  task  or  sometimes  a  task  can  be  automated 
and  executed  as  a  program.  Sometimes,  portions  of  a  task  can  be  automated,  creating  tools  used 
by  humans  to  achieve  the  goal  of  a  task. 

There  is  a  collection  of  information  associated  with  any  task.  This  information  divides  into 
several  groups,  each  of  which  can  be  formalized.  The  three  groups  explained  below  are:  planning, 
tracking,  and  personnel  information. 
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Each  task  fits  into  the  project  plan  and  is  associated  with  planning  information.  Each  task  has 
a  scheduled  start  and  stop  date.  Each  task  has  an  estimate  of  the  total  number  of  hours  that  will 
be  worked  by  personnel  on  the  task.  Each  task  has  a  pre-condition.  The  pre-condition  states  the 
conditions  that  must  be  satisfied  before  the  task  can  begin.  This  may  specify  that  approval  from 
the  project  manager  must  be  given,  or  that  certain  other  tasks  must  first  be  completed.  Each  task 
has  a  post-condition  that  states  the  goal  of  the  task.  The  post-condition  must  be  satisfied  for  the 
task  to  be  considered  complete. 

To  formalize  this  planning  information  several  sets  must  be  defined.  The  relation  AFTER 
specifies  an  ordering  among  dates,  namely  which  dates  come  later  than  others.  Project  time,  as  in 
the  number  of  hours  worked  or  the  estimated  number  of  hours  for  a  task,  can  be  represented  by  a 
number,  but  for  clarity  we  use  the  set  TIME. 

AFTER  =  DATE  <—  DATE 
TIME  ~  N 

Planning  information  for  a  task  includes  a  description  about  how  the  task  work  will  be  done.  This 
includes  scheduling  and  planning  for  sub-tasks  that  must  be  done  for  a  task  to  fulfill  its  goal.  The 
information  about  how  a  task  can  be  performed  is  stored  as  a  procedure.  The  task  procedure  (as 
formalized  by  PROCEDURE)  specifies  how  the  task  is  to  be  achieved.  The  degree  of  guidance 
given  by  the  procedure  will  vary.  Some  tasks  can  be  automated  by  tools.  Other  tasks  have  steps 
that  require  human  judgement  or  human  action.  Reviews  are  a  good  example.  The  steps  of  a 
review  can  be  specified,  but  the  work  must  be  done  by  humans.  However,  the  sub-task  of  notifying 
review  participants  can  be  automated  by  using  e-mail. 

Identifying  the  tools  required  by  a  task  is  part  of  project  planning.  For  many  tasks  there  is  a 
collection  of  tools  that  can  be  used  to  help  achieve  the  task  goal.  For  example,  a  documentation 
task  will  need  tools  for  editing,  browsing,  and  text  formatting.  Hy  specifying  the  tools  needed  by 
a  task,  permission  to  use  these  tools  can  be  granted  to  the  people  assigned  to  the  task.  The  set  of 
all  possible  tools  is  denoted  by  TOOL. 

TOOI =  Tin  set  of  all  possiblt  tools 

Each  tool  has  a  name  that  uniquely  identifies  it.  The  set  of  all  possible  tool  names  is  denoted  by 
TOOL-NAME. 

1 001,-SAME  =  Tin  set  of  all  possible  tool  names 
TOOT-SAME  C  SAME 


Task  planning  information  is  formalized  by  the  schema  TASK_1M.AN. 
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TA  SKJPLA  N _ 

planned-start  :  DATE 
planned-stop  :  DATE 
planned-time  :  TIME 
start -condition  :  CONDITION 
goal  :  CONDITION 
task-proc-name  :  PROC-NAME 
tools  :  F  TOOL-NAME 

planned-stop  ^  e  =>  planned-stop  AFTER  planned-start 


As  a  task  is  performed,  project  tracking  will  collect  information  about  the  task.  The  actual 
start  and  stop  dates  must  be  recorded.  These  dates  may  differ  from  the  planned  start  and  stop 
dates.  The  status  of  the  task  must  be  maintained.  A  task  may  be  inactive,  i.e.  waiting  for  work 
to  begin,  active  meaning  that  work  is  being  done,  or  completed  meaning  that  the  task  goal  has 
been  satisfied  and  no  work  on  the  task  continues.  In  rare  cases  a  task  can  be  suspended  due  to  a 
delay  in  a  project.  Task  status  information  is  denoted  by  the  set  TSTATUS. 

TSTATUS  =  {  inactive,  in-progress,  completed,  suspended  } 

Task  tracking  information  is  formalized  bv  the  schema  TASK-TRACK. 

TA  SK - TRA  CK _ 

actual-start  :  DATE 
actual  stop  :  DATE 
status  :  TSTATUS 

actual  stop  ^  e  =>  actual-stop  AFTER  actual-start 
actual-stop  ^  e  o  status  =  completed 
status  ^  inactive  actual-start  ^  e 


During  a  project,  personnel  will  be  assigned  to  tasks.  Assigning  people  to  tasks  occurs  during 
project  planning  and  throughout  a  project.  Each  person  is  identified  by  a  name.  The  set  of  all 
people  names  is  denoted  by  PERSON-NAME. 

PERSON -NAME  =  The  set  of  all  fmssible  people  names 

PERSON -NAME  C  NAME 

Each  task  has  a  person  who  is  assigned  as  the  task  leader.  Additional  people  may  also  be 
assigned  to  a  task.  Task  planning  includes  an  estimate  of  the  amount  of  time  each  person  will  work 
on  a  task.  Tracking  information  is  also  associated  with  personnel  information.  People  spend  time 
working  on  a  task  and  the  hours  worked  must  be  recorded.  Personnel  assigned  to  a  task  have  a 
status.  At  first,  personnel  may  be  tentatively  assigned,  later  personnel  may  become  active  on  a 
task,  and  even  later  may  finish  the  task,  or  be  re-assigned  to  another  project.  The  set  of  possible 
personnel  statuses  is  denoted  by  PSTATl’S. 

PST  A  TVS  =  {  assigned,  active,  inactive.  ...  } 
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For  each  person  both  planning  and  tracking  information  must  be  recorded.  This  information  is 
formalized  by  the  partial  function  PERSON-IN FO.  The  tuple  lists  a  person’s  status,  estimated 
number  of  hours  worked,  and  recorded  number  of  hours  worked. 

PERSON. INFO  =  PERSON  .NAME  -*  PSTATUS  x  TIME  x  TIME 

Information  about  personnel  assigned  to  a  task  is  formalized  by  the  schema  TASK-PERSONNEL. 
The  predicate  for  this  schema  states  that  if  the  task  leader  has  been  assigned  then  information 
about  the  task  leader  must  be  present  in  the  worker  information  function. 

TASK  PERSONNEL _ 

task. leader  :  PERSON. NAME 
workers  :  PERSON. INFO 

task.leader  ^  c  =>  task.leader  €  dom  workers 


The  parts  described  above  are  assembled  into  a  schema  that  defines  the  properties  of  a  task. 

TA  SK _ 

TASK.PL  AN 
TASK. TRACE 
TASK  PERSONNEL 

status  =completed=>  goal 

status  / inactive  =>  task.leadt  r  ^  < 


The  first  predicate  states  that  the  goal  of  a  task  must  be  satisfied  before  the  task  can  be 
considered  completed.  The  second  predicate  states  that  if  the  status  has  been  started  then  a  task 
leader  must  have  been  assigned. 

Operations  on  PERSON_INFO 

Before  defining  operations  on  tasks  a  few  operations  on  task  personnel  information  must  be  defined. 
Information  about  workers  is  modeled  bv  a  partial  function  that  given  an  employee’s  name  returns 
a  tuple  of  information.  Operations  for  extracting  information  from  such  a  tuple  are  : 

•  Pstatus 

•  Fst_time 

•  Rec_time 

The  Pstatus  operation  takes  .1  personnel  information  tuple  and  returns  the  specified  person's 
status. 
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_ Pstatus _ 

(  s  :  PSTATUS,  et  :  TIME,  wt  :  TIME  )? 
sfatus! :  PSTATUS 

status!  =  s 


The  EsL.time  operation  takes  a  personnel  information  tuple  and  returns  the  estimated  work 
hours  for  the  specified  person. 

_ Est_time  _ _ _ 

(  s  :  PSTATUS,  et  :  TIME,  wt  :  TIME  )? 
time !  :  TIME 


time !  =  et 


The  Rec-time  operation  takes  a  personnel  information  tuple  and  returns  the  recorded  number 
of  work  hours  for  the  specified  person. 

_  Rec_time  _ _ 

(  s  :  PSTATUS,  et  :  TIME ,  wt  :  TIME  )? 
time!  :  TIME 

time!  -  wt 


Task  Operations 

There  are  a  number  of  necessary  and  useful  operations  that  can  be  performed  on  a  task.  Operations 
on  task  planning  information  include: 

•  Create-task 

•  Assign-task-dates 

•  Assign-est-time 

•  Assign-start-condition 

•  Assign- task-goal 

•  Assign-task-procedure 

•  Gct-task-dates 

•  Get-task-est-time 

•  Get  task-start-cond 

•  Get-task-goal 
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•  Get-task-procedure 

•  Add-task-tool 

•  Delete-task-tool 

•  List-task-tools 

Operations  on  task  tracking  information  include: 

•  Start-task 

•  Stop-task 

•  Suspend-task 

•  Get-task-status 

Operations  on  task  personnel  information  include: 

•  Assign-task-leader 

•  Add-task-personnel 

•  Remove-task-personnel 

•  Set-task-personnel-status 

•  Get-task-porsonnel-info 

•  Record-personnel-time 

•  Get-total-task-time 

•  List-task-personnel 

To  denote  that  an  operation  on  a  TASK  changes  information  about  the  TASK  the  following 
notation  is  used  to  denote  which  section  of  information  is  changed. 

A  TASK-PLAN  =  [  TASK-PLAN,  TASK-PLAN'  ) 

A  TASK -TRACK  =  {  TASK-TRACK .  TASK-TRACK' } 

A  TASK -PERSONNEL  =  [  TASK -TER  SOS  S  EL,  TASK -PERSONS  EL' ] 

To  denote  that  an  operation  on  a  TASK  does  not  change  the  TASK  information  the  following 
notation  is  used: 


1 

< 


=  TASK. TRACK  =  (  TASK. TRACK,  TASK. TRACK'  | 
TASK. TRACK'  =  TASK. TRACK  ] 


=  TASK  PERSONNEL  =  [  TASK  PERSONNEL,  TASK  PERSONNEL'  \ 

TASK. PERSONNEL  =  TASK  PERSONNEL'  ] 

The  operation  Create-task  is  used  to  create  a  new  task.  All  task  variables  are  set  to  specify  that 
they  are  initially  unassigned. 

_ Create-task. _ _ _ _ _ _ _ _ 

TASK 

ATASK-J>LAN 
ATASK. TRACK 
A  TASK  PERSONNEL 

planned  .start'  =  c 
planned-stop'  =  t 
planned-time1  =  0 
start-condition'  =  false 
goal'  =  false 
task. proc. name'  -  ( 
tools'  =  0 

actual-start'  =  ( 
actual. stop'  =  c 
status'  = inactive 

task-leader'  -  e 
workers'  =  0 


Each  element  in  the  TASK  schema  needs  to  he  set  during  a  project.  Thus  an  "assign  value” 
operation  is  needed  for  each  element.  Only  a  few  of  these  operations  are  given  here  since  they 
are  all  similar.  The  first  operation  of  this  type  is  Assign-task-dates.  This  operation  is  used  when 
project  planning  has  determined  the  estimated  starting  and  stopping  times  for  a  task. 
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_ Assign-task-dates _ 

TASK 

ATASK-PLAN 
=  TASK -TRACK 
=  TASK -PERSONNEL 
start ?  :  DATE 
stop ?  :  DATE 

planned -start'  =  start? 
planned-stop'  =  stop ? 

planned-time1  =  planned-time 
start-condition'  =  start-condition 
goal1  =  goal 

task-proc-name1  =  task -proc -name 
tools'  =  foo/s 


The  operation  Add-task-tool  adds  a  new  tool  to  the  list  of  tools  that  ran  by  used  bv  a  task. 

_ Add-task-tool _ 

TASK 

A  TASK -PLAN 
=  TASK -TRACK 
=  TASK -PERSONNEL 
tool ?  :  TOOL-NAME 

tools'  =  too/,s  U  {too/?} 

planned  start'  =  planned-start 
planned-stop'  =  planned-slop 
planned-time'  =  planned-tune 
start -condition'  =  start -condition 
goal'  =  goal 

task-proc-name'  =  task-proc-name 


The  next  "assign  value"  operation  is  Start-task.  This  operation  records  the  date  on  which  the 
task  work  started.  It  also  verifies  that  the  starting  condition  is  satisfied.  If  everything  is  all  right 
the  task  status  is  upgraded  to  active.  Error  handling  is  not  currently  specified,  i.e.  what  if  the 
start_condition  is  not  satisfied. 
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_ Start- task _ _ _ 

TASK 

=  TASK  .PLAN 
A TASK -TRACK 
s  TASK-PERSONNEL 
start ?  :  DATE 

start-condition  =>  actual-start1  =  start ?  A  status'  =  in-progress 
->  start-condition  =>  actual-start'  =  actual-start  A  status'  =  status 

actual-stop1  =  actual-stop 


The  operation  Assign-1  ask-leader  is  used  to  specify  the  task  leader.  The  person  who  is  to  be 
the  task  leader  must  already  be  entered  in  the  list  of  workers  assigned  to  the  task. 

_ Assign-task-leader _ _ _ _ 

TASK 

=  TASK -PLAN 
=  TASK -TRACK 
A  TASK -PERSONNEL 
leader ?  :  PERSON -NAME 

leader ?  £  dom  workers  =>  task-leader'  =  leader ? 
leader ?  $  dom  tvorkers  =>  task-leader'  =  task-leader 

icorkers'  =  workers 


The  final  example  of  an  "assign  value"  operation  is  Add-task-j)ersonnel.  This  operation  is  used 
to  assign  a  new  person  to  a  task.  This  operation  can  not  be  used  to  change  information  about  a 
person  already  assigned  to  the  task. 

_ Add- task- personae  I _ _ _ 

TASK 

=  TASK -PLAN 
=  TASK -TRACK 
A  TASK -PERSONNEL 
personl  :  PERSON -NAME 
status ?  :  PST  AT  US 
hours ?  :  ASSIGNED.HOURS 

person ?  $  dom  workers  => 

workers'  =  irorkers  U  {  person7.  (  status' !,  hours!  ,0  )  } 
person ?  6  dom  workers  =>  workers'  =  tuorkers 

task-leader'  =  task-leader 


The  operation  Stop-task  records  the  date  on  which  the  task  work  ends.  A  task  can  end  when 
its  goal  has  been  satisfied. 
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__  Stop-task _ 

TASK  =  TASK. PLAN 
ATASK  .TRACK 
=  TASK-PERSONNEL 
stop ?  :  DATE 

goal  =>  actual-stop'  =  stop 1  A  status'  =  completed 
Jljoal  =>  actual-stop1  =  actual-stop  A  status'  =  status 

actual-start'  =  actual-start 


The  operation  Remove-task-personnel  is  used  to  remove  a  person  from  a  task.  When  a  person 
is  taken  off  a  task,  any  hours  worked  by  that  person  on  that  task  must  be  saved,  so  only  the  status 
of  the  person  is  changed. 

_ Remove-task-personnel _ _ _ 

TASK 

=  TASK-PLAN 
=  TASK -TRACK 
A  TASK-PERSONNEL 
person ?  :  PERSON-NAME 

jterson ?  £  dom  workers 

workers'  =  workers  *■£  {  person  !  >— 

( inactive ,  Est-time(workers( pt  rson'!)),  Rec-time(workers(person'!))  ) 
task-leader'  =  task-leader 


The  operation  Record-personnel-timc  is  used  to  iog  the  hours  worked  by  a  person  on  a  task. 

_ Record-personnel-time _ _ 

TASK 

=  TASK -PLAN 
=  TASK -TRACK 
A  TASK-PERSONNEL 
person ?  :  PERSON-NA  ME 
newhours'!  :  TIME 

workers'  =  workers  ^  {  jterson  !  *— 

{  Pstattis{  workers(  person ? ) ).  E.-t-lwurs(  wo rke rs( person? ) ), 

Rec-time{workers{ person  ! ))  -j-  newhours ?  ) 

task-leader'  =  task-leader 


I'he  operation  Gct-task-pmcedure  is  used  to  retrieve  information  about  the  procedure  associated 
with  a  task. 
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_  Get-task-procedure _ _ 

TASK 

=  TASK -PLAN 
=  TASK -TRACK 
s  TASK-PERSONNEL 
=  PROJECT  -  PROCEDURES 
task-procedure\  :  PROC-NAME 

task -procedure !  =  task -proc -name 


The  operation  Get-task-status  is  use  to  lookup  the  status  of  a  task. 

Get-task-status  . . 

TASK 

=  TASK -PLAN 
=  TASK -TRACK 
=  TASK -PERSONNEL 
state'.  :  TSTATUS 


state !  =  status 


The  operation  Get-total-task-hmc  is  used  to  calculate  the  total  number  of  hours  that  have  been 
worked  on  a  task.  This  number  is  determined  by  summing  the  hours  worked  by  each  person  who 
lias  worked  on  the  task. 

Get-total-task-time _ _ 

TASK 

=  TASK -PLAN 
=  TASK -TRACK 
=  TASK -PERSONNEL 
total-time !  :  TIME 

total-time'.  =  e  d<>n,  worirr,  ll<r„linn[  workers(w)  ) 


The  operation  List-task-prrsonnrl  used  to  show  all  personnel  assigned  to  a  task. 


l.ist-task-i>ersonnel - 

TASK 

s  TASK -PLAN 
=  TASK -TRACK 
5  TASK -PERSONNEL 
l>ersonnel'.  :  F  PERSON -NAME 

;  /H'rsonnrl'.  =  dom  work/  r> 
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4.2.7  Project  Plan 


i 


The  project  plan  defines  the  project  structure.  The  plan  specifies  how  work  will  be  conducted, 
determines  phases  of  the  project,  tasks  involved  in  each  phase,  and  how  the  tasks  are  to  be  done. 
The  plan  states  how  the  target  system  will  be  produced.  Defining  the  project  plan  is  a  major 
portion  of  the  project  planning  work. 

The  basic  unit  of  project  planning  is  the  task.  The  project  approach  defines  the  tasks  that 
must  be  done  to  reach  the  target  system.  The  concept  of  a  task  has  been  formalized  by  the  schema 
TASK.  Each  task  is  given  a  name.  The  set  TASK-NAME  denotes  the  set  all  possible  task  names. 

TASK-NAME  =  The  set  of  all  possible  task  names 

TASK-NAME  C  NAME 

The  set  of  all  tasks  to  be  used  within  a  project  is  denoted  by  PROJECT-TASK. 

PROJECT-  TA  SI< :  TA SK-NA  ME  TA SK 

Notice  that  is  a  one-to-one  partial  function.  This  states  that  each  task  in  a  project  is  unique.  Some 
tasks  may  occur  several  times  in  a  project,  such  as  document-program,  but  these  tasks  will  be 
performed  on  different  data  each  time,  making  the  different  instances  of  a  task  unique. 

Initially,  there  are  no  tasks  planned  for  a  project.  The  initial  collection  of  project  tasks  is 
denoted  by  INIT-PRO.JECT  TASK 

IN  IT-  PROJEC  7  -  TA  Six'  =  0 

A  large  project  is  typically  divided  into  phases.  For  re-engineering  typical  phases  include  reverse 
engineering,  re-design,  re-implementation,  testing,  system  transition,  and  documentation.  In  addi¬ 
tion,  a  large  system  may  be  re-engineered  in  pieces,  with  one  portion  of  the  system  re-engineered 
before  work  begins  on  the  next.  An  entire  system  may  be  re-engineered  in  increments,  where  each 
increment  produces  a  better  version  of  the  system  and  slowly  converges  toward  the  desired  target 
system. 

A  phase  is  considered  to  be  a  set  of  tasks.  All  the  tasks  in  a  phase  are  presumed  to  work  towards 
a  common  phase  goal.  The  concept  of  a  project  phase  is  denoted  by  PHASE. 

PHASE  C  dom  PROJECT-  TA  SK 

PHASE ? 0 

The  project  plan  is  modeled  bv  a  partial  function  from  names  to  phases.  This  function  is  used 
to  name  each  phase.  In  addition,  the  name  assigned  to  a  phase  matches  the  name  of  a  task.  The 
task  with  this  name  is  the  phase  task  and  the  directions  for  performing  the  phase  are  contained 
within  the  task  information. 

PROJECT-PLAN _ 

plan  :  TASK -NAME  -  PHASE 

dom  plan  C  i\om  PROJECT-  TASK 
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When  the  project  starts  the  PROJECT-PLAN  is  already  initialized.  The  plans  specifies  one 
phase,  the  analysis  phase,  and  only  those  tasks  that  are  used  in  that  phase.  One  result  of  defining 
the  project  approach  is  that  the  project  plan  is  expanded  to  specify  the  entire  project.  The  issue 
of  how  the  initial  project  plan  is  defined  is  left  as  outside  the  scope  of  the  re-engineering  process. 

Project  Plan  Operations 

There  are  several  operations  on  the  project  plan  that  can  be  defined.  These  are: 

•  Create- plan- phase 

•  Delete-plan-phase 

•  Cet-plan-phase 

•  List-plan-phases 

•  Add-phase-task 

•  Delete- phase-task 

•  List  phase-tasks 

There  are  several  operations  that  can  he  define  on  the  project  tasks  also.  These  are: 

•  Create-proj-task 

•  Delete- pro  j-task 

•  Cet-proj-task 

•  l.ist-proj-tasks 

To  denote  that  an  operation  changes  information  about  the  project  tasks  or  the  project  plan 
the  following  notation  is  used: 

A  PROJECT-  TA SK  =  f  PROJECT-  7,1  SK, PROJECT-  TA  SE1  ] 

A  PROJECT-  PL  A! S  =  ;  PROJECT- PI,  AN, PROJECT-  PLA.V  } 

To  denote  that  an  operation  does  not  change  information  about  the  project  tasks  or  the  project 
plan  the  following  notation  o  used: 

=  PROJECT-TASK  =  PROJECT-  TASK, PROJECT-  TASK1  | 

PR(  )J ECT-  TA  SIC  =  PROJEA  7  -  7  ,1  SK  ] 

=  PROJECT- PL  A  Y  =  PROJ  ECT-  TLA  S.PROJEC  T-  PI.  A  .V  j 

PROJECT- Pf,A\=  PROJECT-PI. AS  \ 
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The  operation  Create-plan-phase  creates  a  new  phase  and  adds  it  to  the  project  plan.  Remember 
each  phase  is  a  collection  of  task  names  and  each  phase  is  named  after  a  major  task.  The  task  that 
a  phase  is  named  after  is  part  of  the  phase. 

_ Create-plan-phase _ _ _ _ _ 

A PROJECT-PLAN 
=  PROJECT-TASK 
task ?  :  TASK-NAME 

task ?  6  dom PROJ EC T-T ASK 
task ?  £  dom  plan 

plan'  =  plan  U  {  task ?  >-►  {tasfc?}} 


The  operation  Delete-plan-phase  deletes  a  phase  from  the  project  plan. 

__  Delete-plan-phase _ _ 

APROJECT-PLAN 
phase ?  :  TASK -NAME 

phase ?  €  dom  plan 
plan'  =  {phase?}  O  plan 


The  operation  Add-phase-task  adds  a  task  to  a  phase. 


Add-phase-task _ 

APROJECT-PLAN 
=  PROJECT-TASK 
phase?  :  TASK-NAME 
task?  :  TASK -NAME 

task?  6  dom  PROJECT-  TASK 
phase?  €  dom  plan 
task?  $  plan(  phase?) 

plan '  =  plan  -©  {  phase?  ►—  plan(phas<?)  u  {task?}  } 


The  operation  Create-proj-task  creates  a  new  task  and  associates  it  with  the  list  of  project  tasks. 

_ Create-proj-task _ 

A  PROJECT- TASK 
task'!  :  TASK -NAME 

task?  g  dom  PROJECT-  TASK 

PROJECT-TASK  =  PROJECT-  TASKu\  task?  ~  Create -task  } 


file  operation  Drlete-proj-task  removes  a  task  from  the  list  of  project  tasks.  A  task  can  not  he 

I  I  .  t  I  •  1  *  •  •  »  « 
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Delete-proj-task - 

A  PROJECT-TASK 
=  PROJECT-PLA  N 
task ?  :  TASK-NAME 

task!  €  dom PROJECT-  TASK 
task!  i  dom  PROJECT-PLAN 

V  p  :  TASK-NAME  •  p  €  dom  PROJECT-  PLAN  A  task!  <£PROJECT-PLAN(p) 
PROJECT-TASK!  =  {fastfc?  PROJECT-TASK 


4.2.8  Acceptance  Criteria 

The  project  team  and  the  client  must  reach  an  agreement  that  specifies  the  conditions  under  which 
the  target  system  will  be  accepted.  These  conditions  are  often  called  the  acceptance  criteria.  It 
is  not  sufficient  to  state  that  the  target  system  will  successfully  pass  all  system  tests  or  that  the 
target  system  will  correctly  implement  the  system  requirements.  The  acceptance  criteria  should 
complement,  not  duplicate  the  internal  quality  assurance  procedures.  Acceptance  criteria  can 
specify  a  broad  range  of  conditions  that  must  be  satisfied  before  the  target  system  will  be  accepted. 
Possible  criteria  include: 

•  Functionality  of  the  existing  system  that  was  not  scheduled  to  be  changed  (i.e,  the  require¬ 
ments  for  the  feature  are  the  same  in  both  the  existing  and  the  target  specifications)  will 
operate  identically  in  both  t he  existing  and  target  systems. 

•  Kach  document  for  the  existing  system  will  either  be  rewritten  completely  or  updated  to 
account  for  differences  between  the  existing  and  target  systems. 

•  I  lte  test  cases  used  will  execute  a  specified  percentage  of  the  system  source  code. 

•  Capability  xyz  of  the  target  system  will  execute  within  its  timing  constraints  for  the  following 
test  cases  ... 

•  The  target  system  will  have  a  reliability  level  such  that  it  only  fails  once  per  specified  unit  of 
time. 

•  Changes  to  the  user  interface  will  be  documented. 

The  set  of  all  possible  criteria  is  denoted  as  CRITERIA. 

CRITERIA  =  The  set  of  till  possible  criteria 

The  concept  of  acceptance  criteria  is  formalized  as  an  one-to-one  partial  function  from  labels  to 
criteria.  Each  criterion  in  the  acceptance  criteria  must  have  a  unique  label.  The  nature  of  a 
criterion  is  vague.  A  criterion  is  a  requirement  or  condition  that  must  be  satisfied  bv  the  system 
and  related  products,  such  as  documentation. 

ACCEPT -CRITERIA  :  LABEL  ~  CRITERIA 
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Initially,  no  acceptance  criteria  is  specified.  The  initial  acceptance  criteria  is  denoted  by 
INIT_ACCEPT_CRIT. 


IN  IT  .ACCEPT  .CRITERIA  =  0 


Acceptance  Criteria  Operations 

There  are  several  basic  operations  that  can  operate  on  the  acceptance  criteria  information.  These 
operations  are: 

•  Add-accept-criterion 

•  Delete-accept-criterion 

•  Update- accept-crit-label 

•  Update- accept-criterion 

•  Get-accept -criterion 

•  List-accept-criteria 

To  denote  that  an  operation  changes  the  acceptance  criteria  information  the  A  notation  is  used: 
N.  ACCEPT -CRITERIA  =  [  ACCEPT-CRITERIA,  ACCEPT- CRITERIA' ) 

To  denote  that  an  operation  does  not  change  the  acceptance  criteria  information  the  =  notation 
is  used: 


=  ACCEPT -CRITERIA  =  [  ACCEPT -CRITERIA,  ACCEPT -CRITERIA'  \ 

ACCEPT -CRITERIA'  =  ACCEPT -CRITERIA  ] 

The  operation  Add-accept-criterion  is  used  to  add  a  new  criterion  to  the  list  of  acceptance 
criteria. 


Add-accept-criterion _ _ _ _ _ 

A  ACCEPT -CRITERIA 
label' ?  :  LABEL 
crit ?  :  CRITERIA 

label'!  i  dom  A  CC'E  PT-CRITER I A 
crit ?  i  ran  ACCEPT -CRITERIA 

ACCEPT -CRITERIA'  =  ACCEPT -CRITERIA^  {  label ?  —  mt!  } 


The  operation  Deiete-arcept-rritf  non  is  used  to  remove  a  criterion  front  the  listed  acceptance 
criteria. 
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_ Delete-accept-criterion _ _ _ _ 

A  A  CCEPT -CRITERIA 
label ?  :  LABEL 

label ?  €  dom  ACCEPT -CRITERIA 

ACCEPT -CRITERIA'  =  {  label ?  }  <  ACCEPT -CRITERIA 


The  operation  Update-accept-crit-label  is  used  to  change  the  label  assigned  to  a  criterion. 

_ Update-accept-crit-label _ _ _ 

^ACCEPT -CRITERIA 
old-label  :  LABEL 
new-labell  :  LABEL 

old-label ?  €  dom  ACCEPT  -CRITERIA 
new-label ?  g  dom  A  CCEP T-CR ITERIA 

A CCEPT-CRITERIA'  =  {  old-label?  }  <  ACCEPT -CRITERIAu 
{neu- -label?  —  ACCEPT -CRITERlA(oldJabel)  } 


The  operation  Update-accept-criterion  is  used  to  change  the  criterion  associated  with  a  label. 
The  idea  is  that  a  criterion  has  been  updated,  but  not  completely  discarded  and  replaced. 

_ Up<late-accept-critenon _ 

A  ACCEPT -CRITERIA 
label?  :  LABEL 
crit?  :  CRITERIA 


label?  e  dom  ACCEPT -CRITERIA 
crit?  i  ran  ACCEPT -CRITERIA 

ACCEPT -CRITERIA'  =  ACCEPT -CRITERIA  *  {  label?  -  cnt?  } 


The  operation  Get-accept-criterion  is  used  to  retrieve  the  criterion  associated  with  the  specified 
label. 


_ Gct-accept-critenon _ 

=  ACCEPT -CRITERIA 
label?  :  LABEL 
crit'.  :  C  RITERIA 


label?  t  dom  .4  CCE PT -CRITERIA 
crt'f!  =  ACCEPT -CRITF.RIA{  lain l?) 


I’he  operation  List-acre pt-eritrria  is  used  to  obtain  a  list  of  all  the  acceptance  criteria. 
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List-accept-criteria _ 

=  ACCEPT  .CRITERIA 

list'.  :  F{  LABEL  x  CRITERIA  } 

list'.  =  {  / :  label;  c  :  CRITERIA  |  ACCEPT -CRITERIA^!)  =  c  } 


4.2.9  Project  Procedures 

One  result  of  project  planning  is  the  development  and/or  collection  of  procedures  that  are  to  be 
followed  during  a  project.  Many  software  organizations  typically  have  a  set  of  standard  procedures 
that  define  a  way  of  doing  things  within  the  organization.  Procedures  may  describe  the  correct  way 
to  extract  information  from  a  configuration  management  system,  provide  directions  about  how  to 
report  errors,  or  describe  how  to  set  up  and  conduct  a  review  session. 

Many  re-engineering  tasks  are  highly  proceduralized  and  require  little  human  judgement.  How¬ 
ever.  procedures  may  need  fine-tuning  to  accommodate  project-specific  details.  The  specification 
and  use  of  project  procedures  helps  to  ensure  consistency  during  the  project.  This  can  simplify  the 
burden  of  managing  a  project  and  is  reflected  in  the  consistency  and  overall  quality  of  the  target 
system.  Some  typical  procedures  are  listed  below: 

•  Inventory  collection 

•  Testing 

•  Project  monitoring 

•  Project  reviews 

•  Reverse  engineering  of  source  code 

•  Reverse  engineering  of  data  files 

•  Installing  target  system 

•  Change  control 

•  Krror  reporting 

•  Quality  assurance 

•  Status  reporting 


Procedures  are  used  throughout  any  project.  An  ideal  procedure  provides  detailed  step-bv-step 
instructions  for  performing  a  task.  Such  a  procedure  can  be  automated  and  incorporated  into  a 
tool  or  operation  used  to  support  a  project.  I'nforiunatelv,  many  procedures  used  during  a  project 
(  an  only  be  loosely  stated  and  require  eit  her  human  guidance  or  must  be  done  manually. 


1  lie 

ni  vlntti, 


general  characteristics  of  procedures  can  be  formally  specified.  A  procedure  is  a  sequence 

I't*  t  IwtrO  ic  >  n  i  monn  cl  one  n-lw>r  »  eoef  clone  eon  ♦  '"***•  A  ^ 
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other  steps  are  completed.  A  procedure  always  has  a  step  designated  as  the  first  step.  The  first  step 
is  the  step  that  is  initiated  when  the  procedure  begins.  For  convenience  it  is  assumed  that  steps  are 
labeled.  Further,  a  step  consists  of  a  label,  a  pre-condition,  an  action,  and  a  post-condition.  The 
concept  of  actions  is  formalized  by  the  set  ACTION.  For  now  the  exact  details  of  what  constitutes 
an  action  will  be  left  unspecified. 

ACTION  =  The  set  of  all  possible  actions 

The  concept  of  a  procedure  step  is  formalized  by  the  set  STEP.  The  definition  of  STEP  says 
that  each  step  has  an  identifying  label.  Each  step  has  two  conditions.  The  first  condition  is  a 
pre-condition,  a  condition  that  must  hold  true  before  the  action  associated  with  the  step  can  be 
performed.  The  second  condition  is  a  post-condition,  a  condition  that  must  hold  true  after  the  step 
action  is  complete. 

STEP  =  LABEL  x  CONDITION  x  ACTION  x  CONDITION 

The  concept  of  a  procedure  is  formalized  by  the  schema  PROCEDURE.  A  procedure  consists 
of  a  finite  number  of  steps,  an  ordering  among  the  steps,  and  a  designated  first  step. 

PROCEDURE - - - - 

proc  :  F  STEP 

method  :  LABEL  -  LABEL 

first-step :  LABEL 

V /,,/_>  :  LABEL.  r/,.r/2  :  ACTION.  C1.c2.pup2  :  CONDITION  • 

/,  method  l2  =>  (  3(  /| .  cx.  <i\.  p\ )  €  proc  A  3(  /2,  Q,  a2,  P? )  6  proc  ) 

V/  :  LABEL,  a  .  ACTION.  r,.c2  :  CONDITION  • 

{l.  Ci,  a.  ci)  €  proc  =>  ( 3  /,:  LABEL  |  /)  method  l  V  l  method  /, ) 

3  a  :  ACTION .  rt.  c2  :  CONDITION  •  (first-step,  a,  c2)  €  proc 


The  variable  method  is  a  relation  that  specifies  the  relationship  between  steps  in  the  procedure. 
This  relationship  gives  an  ordering  between  steps  that  specifies  those  steps  that  can  be  done  after 
any  given  step. 

Because  it  is  not  always  possible  to  specify  a  procedure  for  every  task,  we  define  a  special 
procedure  called  the  NULL-PROCEDl  RE  . 

NULL-PROCEDURE - _ — - — 

PROCEDURE 

proc  =  0 
method  =  0 
first-step=  • 


When  collecting  procedures  to  be  used  in  a  project  it  is  convenient  to  name  each  procedure. 
Kacli  procedure  should  be  well  documented.  Procedure  documentation  should  explain  the  purpose 
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of  a  procedure,  how  it  is  used,  considerations,  possible  problems  that  may  be  encountered  and  how 
to  solve  them,  etc.  At  this  point,  we  will  define  the  procedure  document  as  a  piece  of  text  and 
leave  the  details  of  structuring  the  document  for  later. 

When  a  procedure  is  collected  or  developed  for  a  project  the  procedure  must  be  named,  doc¬ 
umented,  and  the  procedure  itself  specified.  A  procedure  with  these  three  components  is  called  a 
complete  procedure.  The  set  of  all  possible  procedure  names  is  denoted  by  PROCLNAME. 

P ROC -NAME  =  The  set  of  all  possible  procedure  names 
PROC-NAME  C  NAME 

The  set  of  all  possible  complete  procedures  is  called  COMPLETE-PROCEDURE. 

COMPLETE-PROCEDURE  =  PROC'-NAME  x  TEXT  x  PROCEDURE 

The  collection  of  all  complete  procedures  to  be  used  during  a  project  is  called  PROJECT- 
PROCEDURES  and  is  denoted  as: 

PROJECT-PROCEDURES:  F  COMPLETE-PROCEDURES 
(  c,0, NULL-PROCEDURE)  e  PROJECT-PROCEDURES 

Initially,  no  procedures  are  associated  with  a  project.  The  initial  collection  of  procedures  is 
denoted  by  INIT-PROJ-PROCS. 

I  NIT-PRO  J-PROCS  =  (  c.  0.  NULL-PROCEDURE  ) 


Procedure  Operations 

Since  project  procedures  are  merely  a  set  of  complete  procedures  the  only  operations  that  are 
necessary  are  : 

•  Delete-proj- procedure 

•  Clet-proj-procedure 

•  Update-proj-procedure 

•  I.ist-proj-procs 

Several  operations  on  complete  procedures  can  also  be  defined.  These  are: 


•  ( 'reate-  com  plete-p  roc 

•  Update-cproc-name 

•  l  pdate-cproc-text 
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•  U  pdate-cprocedure 

•  Get-cproc-name 

•  Get-cproc-doc 

•  Get-cprocedure 

To  denote  that  an  operation  changes  information  about  a  procedure  the  A  notation  is  used: 

A  PROJECT-PROCEDURES  =  [  PROJECT-PROCEDURES , 

PROJECT-PROCEDURES  ] 

To  denote  that  an  operation  does  not  change  information  about  a  procedure  the  =  notation  is  used: 

=  PROJECT- PROCEDURES  =  [  PROJECT-PROCEDURES , 

PROJECT-PROCEDURES  | 

PROJECT-PROCEDURES  =  PROJECT-PROCEDURES ) 

The  operation  Delete-pro]-procedure  deletes  a  complete  procedure  from  the  list  of  project  pro¬ 
cedures. 

Delete- proj-proced  re _ 

A  PROJECT-PROCEDURES 
proc ?  :  PROC  .NAME 

3  t  :  TEXT ;  p  :  PROCEDURE  •  (peocC  t.p)  € PROJECT-PROCEDURES 
PROJECT-PROCEDURES  =  PROJECT- PROCEDURES- {(proc?,  t,  p)} 


The  operation  List-proj-procs  returns  a  list  of  procedure  names  that  have  been  specihed  for  a 
project. 

_ List-proj-procs _ _ _ 

=  PROJECT-PROCEDURES 
names'.  F  PROC-X A  ME 

names'.  =  {n  :  PROC  SAME  |  3  t:  TEXT ;  p  :  PROCEDURE  • 

( n,  I .  p )  €  PROJECT-PROCEDURES  } 


The  operation  Create-complete-proc  is  used  to  create  a  new  complete  procedure  and  enter  it 
into  the  list  of  project  procedures. 

—  Create-complete-proc _ _ _ _ 

APROJECT-PROCEDURES 
name ?  :  PROC  .NAME 

-3  t  :  TEXT:  p  :  PROCEDURE  • 

( name!.  t.p)€ PROJECT-PRO(  EDURES 
PROJECT-PROCEDURES  =  PROIEi  T-PROCEDURESO 
{( iHimr  !.(.  .V  ULL-  PROC  ED  U  RE) } 
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The  operation  Update- cprocedure  changes  the  procedure  associated  with  a  complete  procedure 
name. 

_  U  pdate-cprocedure - - 

A  PROJECT-  P  ROC  ED  URE 
name ?  :  PROC-NAME 
procl  :  PROCEDURE 

3  t  :  TEXT;  p  :  PROCEDURE  •  ( name?,  t,  p)  €  PROJECT-PROCEDURES 
PROJECT-PROCEDURES  =  (  PROJECT-PROCEDURES- 
(name?,  t,p)  )  U  {(name?,  t,  proc ?)} 


4.2.10  Project  Standards 

One  result  of  project  planning  is  the  collection  and/or  definition  of  standards  t  hat  are  to  be  applied 
to  project  products.  Many  software  organizat  ions  typically  have  a  set  of  standards  that  are  adhered 
to  within  the  organization.  In  addition,  new  standards  may  be  needed  for  a  new  project  and  some 
existing  standards  may  need  to  be  tailored  to  satisfy  project  specific  requirements. 

Standards  define  the  accepted  form  of  results  for  each  task  in  the  process.  Standards  place 
constraints  on  what  is  and  is  not  acceptable.  Typical  project  standards  include  : 

•  Naming  standards  (  variables,  functions,  files) 

•  Documentation  standards 

•  (  oding  standards 

•  Test,  standards 

Standards  tend  to  be  lists  of  rules  that  identify  some  item  or  property  of  an  object  and  then 
place  constraints  on  that  item.  Typical  constraints  limit  the  range  of  valid  values  or  the  forms  that 
may  be  used  to  represent  the  item.  We  formalize  the  concepts  of  items  and  constraints  by  denoting 
them  with  the  sets: 

ITEM  =  The  set  of  all  possible  items 
CONSTRAINT  =  The  set  of  all  constraints 

The  set  ITEM  denotes  the  set  of  all  possible  items.  The  set  CONSTRAINT  denotes  the  set  of  all 
possible  constraints.  For  one  item  there  may  be  several  constraints,  this  is  denoted  bv  a  constraint 
list.  A  constraint  list  is  a  set  of  constraints. 

(  ONSTRA  IN  T-LIST :  P  (  VNS  I  RA  IN  T 

A  rule  identifies  an  item  and  specifies  the  constraints  on  that  item.  For  convenience  each  rule  is 
given  a  name.  The  set  of  all  possible  rule  names  is:  RULE_NA.\1E. 

RULE-NAME  =  Tin  sit  of  nil  pos-il/h  rule  nanus 

RUiE_NAME  C  NAMh 
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RULE  :  RULE-NAME  x  ITEM x  CONSTRAINT-LIST 


A  standard  is  defined  as  a  set  of  rules. 

ST A NDA RD _ 

standard  :  F  RULE 

Vn,  :  RULE -NAME,  i,,»2  :  ITEM ,  c,,c2  -.CONSTRAINT-LIST  • 

(ni,t'i,Ci)  €  standard  =>  ->  (  3(nj,i2,c2)  €  standard  A  ti  ^  ii  A  cj  ^  c2  ) 

The  collection  of  project  standards  is  given  by  PROJECT-STANDARDS. 

PROJECT-STA  NDA  RDS  :  F  57.4  NDA  RD 

Standard  Operations 

Since  the  project  standards  is  merely  a  >et  of  standards  the  only  operations  that  are  necessary  are: 

•  Delete-  proj-standard 

•  (iet-proj-standard 

•  1‘pdate-proj-standard 

•  List  proj-< standards 

Several  operations  on  standards  can  also  be  defined.  These  are: 

•  (  reate-standard 

•  Ipdate-stand-name 

•  Cpdate-stand-item 

•  Add-stand-constraint 

•  Delete-stand-constraint 

•  (let-rule 

•  List-rules 

These  operations  are  not  be  specified  at  this  time. 
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4.2.11  Project  Resources 

The  software  system  being  re-engineered  may  execute  on  special  equipment,  or  special  equipment 
may  be  required  to  test  the  system,  or  even  to  aid  in  the  re-engineering  project.  Perhaps  a  laser 
printer  and  several  workstations  are  to  be  dedicated  to  the  project.  Any  resource  needed  by  a  project 
must  be  identified  and  arrangements  made  to  obtain  the  resource.  Typical  resources  include: 

•  Computers 

•  Printers 

•  Peripheral  devices 

•  Network  access 

•  Simulation  equipment  -  such  as  a  mock-up  cockpit 

•  Knowledge  sources  -  key  personnel  not  assigned  to  the  project.  This  can  include  a  system 
administrator,  a  developer  of  the  existing  system,  a  key  system  user,  a  person  with  hardware 
expertise,  etc. 

The  list  of  resources  is  modeled  by  a  partial  function.  Each  resource  has  a  name  and  an 
associated  set  of  properties.  Different  resources  have  different  properties. 

RES -NAME  =  The  set  of  all  possible  r<  source  names 
RES -NAME  C  NAME 

The  resource  information  is  denoted  bv  the  partial  function,  RESOURCES,  which  maps  the 
name  of  the  resource  to  its  list  of  properties. 

RESOURCES  :  RES-N AM E  -  PROP-LIST 

Initially,  the  resources  required  by  a  project  are  unknown.  The  initial  project  resources  are 
formalized  by  INIT-RESOURCES. 

INIT-RESOURCES  =  0 


Resource  Operations 

Several  operations  can  be  defined  on  resource  information.  These  operations  include. 

•  Add-new-resource 

•  Delete- resource 

•  List- resources 

•  List  resource- props 
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•  Add-resource-prop 

•  Delete- resource- prop 

•  Change- resource- prop 


To  denote  that  an  operation  changes  the  resource  information  the  A  notation  is  used: 

A  RESOURCES  =  [  RESOURCES ,  RESOURCES'  ] 

To  denote  that  an  operation  does  not  change  the  resource  information  the  =  notation  is  used: 

=  RESOURCES  =  [  RESOURCES ,  RESOURCES'  \  RESOURCES'  =  RESOURCES  ] 

The  operation  Add-new-re source  is  used  to  associate  a  resource  with  the  resource  information. 
The  new  resource  should  not  already  be  listed. 

_ Add-new-resource _ 

A RESOURCES 
name ?  :  RES-XAME 

name ?  £  RESOURCES 

RESOURCES'  =  RESOURCES  U  {  name!  ~  0  } 


The  operation  Delete-resource  is  used  to  remove  a  resource  from  the  list  of  resources.  The  may 
be  used  when  project  emphasis  shifts  and  previously  needed  equipment  will  no  longer  be  used. 

_ Delete-resource _ 

A RESOURCES 
name?  :  RES-XA  ME 

name?  £  dom  RESOURCES 
RESOURCES'  =  { name? }  o  RESOURCES 


t  he  operation  List-resources  is  used  to  generate  a  list  of  all  resources  needed  by  the  project. 

_ List-resources _ 

=  RESOURCES 
list'.  P  RES  SAME 

list'  =  dom  RESOI  IU'ES 


The  operation  List-re  source  -props  U  used  to  generate  a  list  of  properties  associated  with  a 
resource  item. 
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List-resource-props _ 

=  RESOURCES 

name ?  :  RES-NAME 

list ! :  P{  PROP-NAME  x  VALUE  } 

name?  €  dom  RESOURCES 

List-properties[RESOURCES(name?)/pl,  RESOURCES'(name?)/pl '  ] 


The  operation  Add-resource-prop  is  used  to  add  a  new  property  to  a  listed  resource. 

Add-resource-prop - 

A  RESOURCES 
name?  :  RES-NAME 
prop?  :  PROP-NAME 
val?  :  VALUE 

name?  €  dom  RESOURCES 
RESOURCES1  =  RESOURCES*  {  name?  — 

/ldd-proper/y[/?£’50(7?CE9( name!)/ pi,  RESOURCES' ( name?)/ pi', 
prop? / p? .  val !/ v  !]  } 


The  operation  Delete- resoura -prop  is  used  to  remove  a  property  from  a  listed  resource. 
Delete-resource-prop _ _ 

A RESOURCES 
name?  :  RES-NAME 
prop?  :  PROP-NAME 

\  name ?  €  dom  RESOURC  ‘ES 

RESOURCES'  =  RESOURCES*  {  name?  — 

Delete-property[RESOURCES(  name?) /pi,  RESOURCES' ( name?)/ pi' . 
prop?/p?\  } 


The  operation  Change-resoum  -prop  is  used  to  change  the  value  associated  with  a  property  of 
a  listed  resource. 

_ Change-re  source- prop - - - 

A RESOURCES 
name!  :  RES-NAME 
prop?  :  PROP-NAME 
val?:  VALUE 

|  RESOURCES' =  RESOURCES  [■  {  name? 

Update-pro]wrly\R ESOURCES{nanu  !)/pl.  RESOURCES'[name?)/ pi' . 

j  prop! / val! /  v  .’1  } 
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4.2.12  Project  Tools 


Tools  can  be  used  to  implement  tasks  that  are  suitable  for  automation.  Actions  that  are  repetitive 
are  candidates  for  automation  by  a  tool.  Tools  are  commonly  used  to  record  information  and 
operate  on  that  information.  The  use  of  tools  can  reduce  project  time,  insure  accuracy  of  results, 
and  promote  consistency. 

During  the  planning  phase  tools  may  be  identified  that  are  needed  on  a  project.  Tools  required 
for  a  project  may  already  exist  in-house  or  may  need  to  be  purchased,  or  as  a  last  resort  developed. 
The  development  of  new  tools  for  a  project  is  often  discouraged  since  the  time  required  to  develop 
a  tool  can  delay  a  project.  Typical  project  tools  include: 

•  Language  compilers 

•  Text  and  code  editors 

•  Configuration  management  system 

•  Language  translators 

•  Software  engineering  environment 

•  Documentation  management  system 

•  Documentation  browsers 

•  Source  code  browsers 

•  File  converters 

•  lest  case  generators 

•  Test  coverage  monitors 

•  File  comparators 

•  Source  code  formatters 

•  Spreadsheets 

•  Schedule  handlers 

•  Diagraming  tools 

For  tools  that  will  be  used  during  a  project,  information  about  these  tools  must  be  collected. 
This  information  can  be  used  during  t ho  project  to  guide  the  use  of  those  tools  and  identify  tasks 
that  will  require  these  tools. 

Each  tool  has  a  name  that  identifies  it.  I  he  set  of  all  possible  tool  names  is  denoted  by 
TO()L_.\AME. 

Each  tool  has  a  computer  system  that  is  resides  on  and  a  location  on  that  system  where  it  is 
stored,  [t  is  possible  for  a  tool  to  exist  on  more  than  one  system.  Mere,  each  svstent  that  has  a 
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copy  of  the  tool  must  be  noted.  Each  tool  lias  an  available  status.  Typically,  a  tool  is  available 
full-time,  but  some  unique  tools  may  be  available  only  on  a  limited  or  scheduled  basis.  A  tool  may 
also  be  on  order,  in  development,  or  in-house  but  not  installed.  The  availability  of  required  tools 
can  afTect  the  project  schedule.  Each  tool  is  used  by  one  or  task  project  tasks. 

Before  formalizing  the  concept  of  tool  information  it  is  necessary  to  define  several  sets.  The  set 
of  all  computers  in  an  organization  is  denoted  by  COMPUTER. 

COMPUTER  =  The  set  of  all  possible  organization  computers 

The  set  of  all  possible  directories  or  places  to  store  a  tool  on  these  computers  is  denoted  by  BIN. 

BIN  =  The  set  of  all  tool  locations 

The  set  of  all  possible  availability  statuses  for  a  tool  is  denoted  by  TOOL-STATUS: 

TOOL-STATUS  =  {  available-full,  available-scheduled,  on-order , 
to-be-ordered ,  not-installed  } 

The  schema  below  formalizes  the  concept  of  tool  information. 

TOOL _ _ 

name  :  TOOL-NAME 
location  :  F  COMPUTER  x  BIN 
availability  :  TOOL-STA  TUS 
used J>y  :  F  TASK -NAME 

used-by  C  dom  PROJECT-  TA  SK 

used -by  =  {  t  :  TASK -NAME  |  /  €  dom  PROJECT-  TASK  A 
name  €  PROJECT-  TASK(  t). tools  } 


The  collection  of  all  tool  information  for  a  project  is  formalized  by  PROJECT-TOOLS,  where  : 

PROJECT-  TOOLS:  P  TOOL 

Init  ially,  the  tools  to  be  used  on  a  project  are  unknown.  This  is  formalized  by  INIT-PROJECT- 

TOOLS. 

IN  IT- PROJECT-  TOOLS  =  0 


Operations  on  Tool  Information 

I  he  following  operations  ran  be  defined  on  tool  information  and  PROJECT- TOOLS. 

•  .\dd-tooi-loration 

•  Delete-tool-location 
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•  Set-tool, -availability 

•  Get-tool-name 

•  List-tool-location 

•  Get-tool-availability 

•  Get-tool-task-list 

The  following  operations  can  be  defined  on  the  list  of  project  tools. 

•  Add-project-tool 

•  Get-project-tool 

•  Update- project-tool 

•  Delete- pro  ject-tool 

•  List- pro  ject-tools 

The  specifications  for  these  operations  are  not  given. 

4.2.13  Test  Planning 

rite  testing  of  programs  and  software  systems  is  done  to  show  that  system  features  work,  to 
search  the  system  for  faults,  and  to  establish  confidence  that  a  system  is  reliable  enough  to  be  put 
into  usage.  Testing  of  a  system  occurs  after  the  implementation  phase  when  coding  is  complete. 
However,  preparations  for  testing  begin  much  earlier  in  a  project.  Test  planning  can  begin  during 
the  early  stages  of  a  project. 

The  result  of  test  planning  is  a  test  plan.  A  test  plan  specifies  the  tests  that  are  used  to  verify 
that  the  system  features  operate  without  errors.  Tests  specified  in  a  test  plan  can  be  developed 
using  different  strategies.  Two  common  strategies  are  black-box  and  white- box  testing.  Black-box 
testing  ignores  source  code  and  focuses  on  testing  system  features.  White-box  testing  focuses  on 
the  source  code.  Two  common  white-box  testing  strategies  are  branch  coverage  and  statement 
coverage.  Branch  coverage  tries  to  execute  all  branches  in  the  code.  Statement  coverage  tries  to 
execute  all  statements  in  the  code.  Typically  in  white-box  testing  a  certain  percentage  of  coverage 
is  required.  Regardless  of  the  strategy  or  method  used  to  create  test  cases,  the  basic  properties  of 
tests  and  test  plans  can  be  formalized. 

A  test  case  consists  of  input  data  and  expected  output  results.  The  input  data  may  be  used  to 
test  a  routine  within  a  program,  an  entire  program,  or  the  interaction  between  programs  (i.e.  the 
system).  The  output  results  are  the  expected  result  to  be  produced  bv  a  test.  The  expected  result 
can  be  compared  against  the  actual  results  to  determine  if  the  test  worked  correctly.  Input  data 
may  consist  of  keystrokes,  output  data  may  consist  of  recorded  screens.  Either  input  data  or  output 
results  may  consist  of  files,  formatted  text,  the  values  assigned  to  specific  source  code  variables. 
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signals  sent  or  received  to  other  programs  or  devices,  etc.  These  types  of  various  information  forms 
are  formalized  by  the  set: 

TEST-FORM  =  The  set  of  all  possible  input  and  output  data 

A  test  case  must  also  specify  what  is  being  tested  by  the  test  data.  The  item  tested  may  the  system, 
a  program  within  the  system,  a  routine  with  a  program,  a  path  through  a  program,  a  program 
statement,  a  feature  of  a  program  or  the  system,  etc.  The  set  of  all  system  items  that  can  be  tested 
is  denoted  by: 

TEST-ITEM  =  The  set  of  all  testable  items 

Finally,  a  test  case  should  be  documented.  The  test  case  documentation  should  explain  the  test, 
how  the  test  is  done,  i.e  using  a  test  tool,  manual  entry,  etc,  and  any  considerations  relevant  to 
'he  test.  Test  case  instructions  are  formalized  as  a  COMPLETE-PROCEDURE.  The  set  of  all  test 
cases  is  formalized  by: 

TEST -CASE _ 

i  test  :  LABEL 
j  item  :  TEST -ITEM 
[  input  :  TEST-FORM 
j  result  :  TEST-FORM 
!  proc  :  P ROC _.V A  ME 


The  label  element  is  an  unique  label  identifying  the  specific  tost  case. 

It  is  common  to  group  tests  together.  This  is  often  done  to  a  set  of  tests  that  have  some 
common  thread,  such  as  they  test  the  same  feature  or  routine.  A  test  group  should  be  named 
for  easy  reference.  A  test  group  should  be  documented,  explaining  the  purpose  the  test  grouping. 
Finally,  there  is  often  an  ordering  among  test  cases  in  a  group,  where  the  ordering  may  specify  the 
order  of  test  case  execution.  The  group  documentation  and  ordering  information  are  formalized 
as  part  of  a  test  group  procedure  using  COMPLETE-PROCEDURE.  The  procedure  will  specify 
the  order  in  which  test  cases  are  applied.  The  concept  of  a  test  group  is  formalized  by  the  set 

TEST-GROUP. 

r-  TEST -CROUP _ 

group -name  :  LABEI. 
group -pror  :  PROC  SAME 
tests  :  F  TEST -CASE 

->  3(/j,  i.  Ill\.  Otlt\,  P\  ).(/'2>  l2.  I'll,  out  2.  Pi)  t  tests  •  /)  =  l2  A 
( i]  ^  i2  V  i/i,  £  in2  v  out  1  out‘2  v  p,  /  p 2)  } 


Finally,  test  groups  themselves  are  often  grouped  together  to  form  a  test  plan.  There  is  typically 
only  o’i"  test  plan  for  a  system.  The  plan  should  be  documented  explaining  the  plan  and  the  purpose 
of  the  t<Ms.  This  should  be  high  level  test  information.  Details  can  be  recorded  in  the  group  or  test 
case  documentation.  Finally,  an  ordering  between  groups  may  be  given,  where  t  lie  ordering  specifies 
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which  groups  should  be  tested  first.  This  ordering  and  test  plan  documentation  are  formalized  by 
a  test  plan  procedure  using  COMPLETE-PROCEDURE.  The  concept  of  a  test  plan  is  formalized 
by  the  set  TEST-PLAN. 


TEST -PLAN _ 

test-plan-proc  :  COMPLETE  -  PROCEDURE 
groups  :  F  TEST-GROUP 

-1  3(/i,pi,  tei, ),  (h,P2,ts2)  €  groups  • 

h  =  h  A  (pi  /  P2  v  ts i  ^  ts2)} 


Initially  there  is  no  test  plan  for  a  re-engineering  project.  There  may  be  an  existing  test  plan  for 
the  existing  system,  but  such  a  plan  must  be  evaluated  before  a  decision  is  made  to  use  it  during  the 
re-engineering  project.  The  initial  test  plan  for  the  target  system  is  denoted  by  INIT_TEST_PLAN. 

IN  IT -TEST-PLAN _ _ _ 

TEST -PLAN 

proc  =  (c,  e, NULL-PROCEDURE) 

groups  =  0 


Test  Plan  Operations 

Operations  on  test  cases,  test  groups,  and  test  plans  are  used  to  either  define,  update,  delete  or 
examine  information.  The  need  to  create  test  information  is  clear.  As  the  system  evolves  and 
undergoes  changes  existing  tests  may  need  to  be  modified  or  even  deleted.  To  specify  that  an 
operation  changes  information  the  A  notation  is  used  : 

A  TEST -CASE  =  (  TEST -CASE.  TEST -CASE'  ] 

A  TEST-GROUP  =  [  TEST-CROUP ,  TEST-GROUP'  ] 

A  TEST-PLAN  =  [  TEST -PLAN.  TEST -PLAN  } 

To  specify  that  an  operation  does  not  change  information  the  =  notation  is  used: 

=  TEST -CASE  =  [  TEST -CASE.  TEST -CASE'  | 

TEST-CASE'  =  TEST-CASE) 

=  TEST-GROUP  =  [  TEST-CROUP,  TEST-GROUP'  \ 

TEST -GROUP'  =  TEST -GROUP  ] 

=  TEST-PLAN  =  :  TEST-PLAN,  TEST -PLAN  \ 

TEST -PL AN' =  TEST -PLAN  ] 


Possible  operations  on  test  cases  include: 


•  ( ‘ reate- tease 

•  Change-tcase-label 
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•  Change-tcase-item 

•  Set-t  case-input 

•  Set- tease-result 

•  Set-tcase-procedure 

•  Get- tease-label 

•  Get- tease-item 

•  Get-tcase-input 

•  Get- tease- result 

•  Get-tcase-procedure 

Possible  operations  on  test  groups  include: 

•  Create-tgroup 

•  Change-tgroup-label 

•  Set-t.group-procedure 

•  Add  tgroup-test 

•  Delete-tgroup-test 

•  Get-tgroup-label 

•  Get-tgroup-procedure 

•  Get- tgroup-test 

•  List-tgroup-tests 

Possible  operations  on  a  test  plan  include: 

•  Create-tplan 

•  Set-t  plan-procedure 

•  Add  tplan-group 


•  Delete-tplan-group 

•  Get-t  plan-group 

•  List- 1  plan-groups 
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Because  the  specification  for  many  of  these  operations  are  similar  only  a  few  representative 
operations  are  defined  below.  Several  operations  on  each  object  are  given. 

The  operation  Create-tcase  is  used  to  create  a  new  test  case.  The  operation  only  accepts  a  name 
for  the  test  case  and  the  item  to  be  tested.  Other  information  about  the  test  case  can  be  added 
using  the  set  operations. 

_  Create-tcase - 

A TEST  .CASE 
name?  :  LABEL 
item?  :  TEST-ITEM 

test'  =  name? 
item'  =  item? 

input'  =  e 
result'  =  t 
proc'  =  f 


The  operation  Change -tense -lube l  is  used  to  change  the  iabel  assigned  to  a  test  case. 

__  Change-tcase-label - 

A  TEST .CASE 
new-label?  :  LABEL 

test'  =  new-label! 

item'  =  item 
input'  =  input 
result'  =  result 
proc'  =  proc 


The  operation  Set-tcase-input  specifies  the  input  test  data  for  the  test  case. 

_ Set-tcase-input _ 

A  TEST .CASE 
idata?  :  TEST-FORM 

input'  =  idata? 

test'  =  neu< -label ’ 
item1  =  item 
result1  =  result 
proc1  =  proc 


I  he  operation  (ict-tcasi -input  i-i  used  to  retrieve  tlte  input  test  data  for  a  test  case. 
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, _ Get- tease-input _ 

|  =  TEST-CASE 
idatai  :  TEST-FORM 


tdatal  —  input 


The  operation  Create-tgroup  creates  a  new  test  group.  Only  the  name  of  the  group  is  specified. 
The  group  test  procedure  and  the  individual  test  cases  can  be  associated  with  the  group  later  using 
the  set  operations. 

C  reate-group _ — - — —  — . . . 

ATEST -GROUP 
name"!  :  LABEL 

name'  =  name ? 

proc'  =  c 
tests'  =  0 


The  operation  Add-tgroup-test  adds  a  test  case  to  the  group  of  test  cases. 
Add-tgroup-test - — — - - — - — 

!  ATEST-GROUP 
|  tc!  :  TEST-CASE 

i - 

!  tc!  £  tests 

S  tests'  =  tests  U  {fc?} 

I  name'  =  name 
I  proc'  —  proc 


The  operation  Delete-tgroup-h  st  removes  a  test  case  from  the  group  of  test  cases. 
l)i  lete-tgroup-test _ _ _ _ 

A  TEST -GROUP 
tc!  :  TEST -CASE 

j  td  €  tests 
!  h  sts'  =  tests  —  {tc! } 

name'  =  name 
i  proc'  =  proc 


The  operation  Get-tgroup-te st  retrieves  a  test  ease  from  the  specified  test  rase  group. 
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Get-tgroup-test _ _ _ 

s  TEST. GROUP 
tname 1  :  LABEL 
test'.  :  TEST-CASE 

3(  l :  LABEL ,  i  :  TEST-ITEM,  in  :  TEST-FORM ,  out  :  TEST-FORM , 
proc  :  PROC-NAME)  •  (l.  i,  in,  out,  proc)  £  tests  A  l  =  tname?  A 
test'.  =  (tname?,  i,  in,  out, proc)  ) 


The  operation  List-tgroup-tests  returns  a  list  of  all  the  test  case  labels  for  a  test  group. 

_ List-tgroup-tests _ _ _ 

s  TEST-GROUP 
cases'. :  F  LABEL 

cases'.  =  {  l  :  LABEL  |  :  TEST-CASE  •  t  £  tests  A 

Get-tcase-label(t)  =  !  } 


4.2.14  Estimates 

Project  planning  requires  the  ability  to  accurately  estimate  the  costs  involved.  A  good  understand¬ 
ing  of  what  influences  the  cost  is  necessary  to  monitor,  control,  and  in  the  future,  reduce  costs. 

These  influences  are  often  referred  to  as  factors.  Typical  factors  include  system  size,  programming 

lauguagc(s).  experience  of  the  staff,  and  project  methodology. 

Estimation  models  make  it  possible  to  predict  how  much  a  project  is  likely  to  cost,  the  number 

of  people  needed,  and  the  amount  of  effort.  An  estimation  model  has  five  major  elements  [Yu90]. 

Estimation  Target  :  Identifies  exactly  what  is  to  be  estimated  such  as  the  project  cost  (mea¬ 
sured  in  dollars),  project  effort  (measured  in  staff-months),  schedule  or  duration  (measured 
in  months). 

Estimation  Formulas  :  A  set  of  algorithms  (formulas)  that  produce  the  estimation  target  values 
by  modeling  the  influence  of  various  factors. 

Estimation  Process  :  The  estimation  process  defines  the  standards  and  methods  to  assist  with 
estimation.  The  process  links  estimation  with  other  management  activities,  directs  the  col¬ 
lection  of  data  used  bv  the  estimation  model,  and  states  when  estimation  formulas  should  be 
used. 

Historical  Project  Database  :  A  collection  of  information  from  completed  projects  with  iden¬ 
tified  effort,  staffing,  and  schedule  results,  as  well  as  known  environmental  values. 

Estimation  Tools  :  As  estimation  model  can  be  implemented  in  a  tool  that  provides  a  user  with 
a  friendly  wav  of  accessing  the  historical  database  and  generating  requested  estimates. 
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Before  formalizing  the  concept  of  an  estimation  model,  several  sets  must  be  defined.  The  set  of 
estimation  model  targets  is  denoted  by  TARGET. 

TARGET  =  {  cost,  effort,  duration,  ...  } 

The  formulas  used  by  an  estimation  model  are  equations.  The  set  of  all  possible  estimation  model 
equations  is  denoted  by  EQUATION. 

EQUATION  =  The  set  of  all  possible  estimation  model  equations 

The  schema  EST_MODEL  denotes  the  information  associated  with  an  estimation  model. 

EST -MODEL _ 

target  :  TARGET 
formulas  :  F  EQUATION 

process  :  F{  STANDARD-NAME  U  PROCEDURE-NAME  } 
tools  :  F  TOOL-NAME 

V  n  :  NAME  •  n  €  process  =>  n  €  dom PROJECT-STANDARDS V 
n  €  dom  PROJECT-PROCEDURES 

tools  C  PROJECT-  TOOLS 

The  collection  of  estimation  models  used  in  a  project  is  denoted  by  PROJEC  T-MODELS. 

PROJECT-MODELS :  F  EST -MODEL 

Initially,  there  are  no  project  models  associated  with  a  project.  Though  project  planning  is  likely 
to  quickly  select  one  or  more  for  use  during  the  planning  phase.  The  initial  lack  of  project  models 
is  denoted  by  INIT-PROJECT  MODELS. 

IN  IT-PROJECT-MOD  E  LS  =  0 

Note  that  a  project  database  is  not  included  in  the  estimation  model  information.  There  is 
only  one  historical  project  database  and  all  estimation  models  use  it.  For  now  the  nature  of  the 
historical  project  database  will  not  be  specified.  The  historical  project  database  will  simply  be 
denoted  by  HIST_PROJ_DH. 

One  element  of  the  historical  project  database  is  a  collection  of  estimates.  During  the  initial 
planning  phase  estimates  for  project  cost,  effort,  and  duration  are  made.  As  the  project  proceeds 
these  estimates  often  need  to  be  updated.  Also  estimates  may  be  broken  down  by  phase  or  even 
task.  This  estimation  information  can  be  formalized.  The  information  for  a  single  estimate  is 
denoted  bv  ESTIMATE. 

ESTIMATE _ 

type  :  TARGE 7 
item  :  ITEM -NAME 
date  :  DA  TE 

\  value  :  Z 
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The  item  element  specifies  what  the  estimate  was  for.  For  example,  if  the  value  of  type  is  "effort” 
and  item  is  "test  system”  then  the  estimate  is  for  the  amount  of  effort  required  to  test  the  target 
system.  The  set  of  all  possible  item  names  is  denoted  by  ITEM_NAME. 

ITEM-NAME  =  The  set  of  all  possible  estimation  item  names 
ITEM -NAME  C  NAME 

The  collection  of  project  estimates  maintained  in  the  historical  project  database  is  denoted  by 

PROJECT-ESTIMATES. 

PROJECT-ESTIMA TES  =  F  ESTIMA  TE 

The  initial  contents  of  the  historical  project  database  depends  on  other  projects  that  previously 
been  recorded.  At  the  beginning  of  a  re-engineering  project  there  are  no  estimates  for  the  current 
project  in  the  database  base.  This  is  denoted  by  INIT-PROJECT-ESTIMATES. 

I  NIT-  PROJECT-  ESTIMA  TES  =  0 

Estimate  Operations 

Necessary  operations  on  estimation  information  include: 

•  Cet-est-type 

•  Cet-est-item 

•  Clet-est-date 

•  Cet-est-value 

Necessary  operations  on  project  estimates  include: 

•  Record-estimate 

•  Cet-estimate 

•  Ciet- latest -estimate 

•  List -all-estimates 

•  l.ist-latest-estimates 

Necessary  operations  on  estimation  models  include: 

•  i)efine-est- model 

•  Add-cst-formuia 
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•  Update-est-formula 

•  Delete-est-formula 

•  Add-est-standard 

•  Delete-est-standard 

•  Add-est-procedure 

•  Delete-est-procedure 

•  Add-est-tool 

•  Delete-est-tool 

•  Get-est-target 

•  List-formulas 

•  List-est-standards 

•  List-est-procedures 

•  List-est-tools 

Necessary  operations  on  project  estimation  models  include: 

•  Add-est-model 

•  Delete-est- model 

•  List-est-models 

The  specifications  for  these  operations  is  not  given. 

4.2.15  Project  Organizational  Structure 

Within  a  project  there  must  bo  an  organizational  structure.  At  the  top  of  the  organizational 
structure  is  the  project  manager.  Underneath  this  person  are  a  number  of  teams  each  with  an 
assigned  role  and  responsibilities  within  the  project.  There  may  also  be  several  people  that  serve 
as  links  between  teams,  or  between  teams  and  external  groups  such  as  the  project  client  and  users. 

Once  defined,  the  responsibility  of  the  project  organization  is  to  work  towards  and  achieve  the 
successful  planning,  re-engineering,  and  accomplishment  of  the  project  goals.  This  is  never  an  easy 
job  and  the  job  can  be  made  harder  or  easier  depending  on  the  characteristics  of  the  organizational 
structure  and  the  key  people  within  the  project. 

Io  formalize  the  concept  of  a  project  organizational  structure,  we  begin  by  formalizing  the 
concept  of  a  role.  Each  project  member  fills  one.  possibly  several,  roles  within  a  project.  Also  each 
team  fills  a  role  within  a  project.  There  are  a  number  of  different  project  roles  including  project 
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manager,  team  leader,  coordinator,  speciality  staff,  programmer,  DBMS  administrator,  test  group, 
support  group,  implementation  group,  etc.  Each  role  has  a  title.  Each  role  has  an  assigned  group 
of  responsibilities.  Each  role  has  a  level  of  authority  associated  with  it.  Each  role  has  a  set  of  skills 
that  a  person  or  team  fulfilling  that  role  is  expected  to  possess. 

To  formalize  the  concept  of  project  roles  several  sets  must  be  defined.  The  set  of  all  possible  skills 
is  denoted  by  the  set  SKILL.  The  set  of  all  possible  authority  levels  is  denoted  by  AUTHORITY. 
The  set  of  all  possible  responsibilities  is  denoted  by  RESPONSIBILITY.  The  set  of  all  possible 
titles  is  denoted  by  TITLE. 

SKILL  =  The  set  of  all  possible  skills 
AUTHORITY  =  The  set  of  all  possible  authority  levels 
RESPONSIBILITY  =  The  set  of  all  possible  responsibilities 
TITLE  =  {  project-manager,  team-leader,  coordinator,  programmer, 
DBMS-administrator,  project-librarian,  task-leader,...  } 

A  project  role  is  denoted  by  the  schema  ROLE. 

ROLE _ 

title  :  TITLE 

duties  :  F  RESPONSIBILITY 
auth  :  F  AUTHORITY 
skills  :  F  SKILLS 


At  the  lowest  level  of  detail  within  the  organizational  structure  information  about  project 
members  is  given.  Each  member  has  a  name.  Each  member  has  a  date  that  they  are  expected  to 
start  work  on  the  project  and  a  date  after  which  they  are  expected  to  be  re-assigned  off  the  project. 
Such  information  is  useful  for  scheduling  multiple  projects  within  an  organization.  Each  member 
has  one  or  more  tasks  to  which  they  are  assigned.  Each  member  is  assigned  to  a  team.  One  issue 
is  whether  a  person  can  serve  on  more  than  one  team.  Each  person  has  an  assigned  role  within  a 
team.  Thus  a  person  assigned  to  multiple  teams  may  have  multiple  roles  to  perform.  Information 
about  a  project  member  is  formalized  by  the  schema  MEMBER. 

MEMBER _ 

start-date  :  DATE 
stop -date  :  DA  TE 

assigned -to  :  TEAM -SAME —  ROLE 
assignments  :  F  TASK  SAME 


When  a  new  member  i>  initially  aligned  to  a  project  there  may  be  no  information  about 
t  hat  member  vet.  Therefore,  a  constant  M-hema  NEW-MFMBER  is  defined  that  represents  initial 
member  information. 
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NEW-MEMBER 

MEMBER 

start-date  =  c 
stop-date  =  c 
assign-to  =  0 
assignments  =  0 


The  list  of  all  personnel  assigned  to  a  project  is  denoted  by  PROJECT-PERSONNEL. 

PROJECT-PERSONNEL  :  PERSON -NAME  MEMBER 

Project  members  are  assigned  to  teams.  A  team  is  the  focal  point  for  achieving  task  goals. 
Each  team  is  assigned  one  or  more  tasks  and  is  responsible  for  accomplishing  those  tasks  efficiently, 
on-time,  and  according  to  the  task  procedure  (task  plan).  Teams  map  closely  to  tasks.  Each  team 
has  a  team  leader  who  is  also  the  leader  of  at  least  one  task.  Workers  assigned  to  tasks  lead  by  this 
person  are  drawn  from  this  person’s  team.  Note  it  is  possible  to  assign  members  of  different  teams 
to  a  task.  For  example,  a  review  task  often  requires  people  from  different  teams.  The  concept  of  a 
team  is  denoted  bv  TEAM. 

TEA  M _ - - - - - 

leader  :  PERSON-NAME 
members  :  F  PERSON -NAME 
tasks  :  F  TASK -NAME 
role  :  ROl.E 


Information  about  project  teams  is  denoted  by  the  function  PROJECT-TEAMS. 
PROJECT-TEAMS:  TEAM-NAME  —  TEAM 
Where  the  set  TEAM_NAME  is  defined  as: 

TEAM -NAME  =  The  set  of  all  possible  team  names 
TEAM -NAME  C  NAME 

Organizational  Structure  Operations 

There  are  several  operations  that  apply  to  role  information.  These  operations  include: 

•  Create- role 

•  Add-role-dutv 

•  Add  role-authority 

•  Add-role-skill 
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•  Delete- role-duty 

•  Delete-role-authority 

•  Delete- role-skill 

•  List- role-duty 

•  List-role-authority 

•  List-role-skill 

There  are  several  operations  that  can  be  defined  on  member  information.  These  include: 

•  Set-memb-start 

•  Set-memb-stop 

•  Assign-memb-team 

•  Assign-memb-task 

•  Remove-memb-team 

•  Remove-memb-task 

•  List-memb-teains 

•  List-mernb-roles 

•  List- memb- team- role 

•  List-meinb-tasks 

There  are  several  operations  that  can  be  defined  on  project  personnel  information, 
uclude: 

•  Add-proj-member 

•  Delete  r  oj-member 

•  Get-proj-member 

•  I’pdate-proj-member 

•  List- personnel 

There  are  several  operations  that  can  be  defined  on  team  information.  These  include: 

•  Set-team  leader 

•  Add-team-member 


These 
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•  Add-team-task 

•  Delete- team-member 

•  Delete- team- task 

•  Get- team-leader 

•  List-team-members 

•  List-team-tasks 

There  are  several  operations  that  can  be  defined  on  project  team  information.  These  include: 

•  Add-proj-team 

•  Delete-proj-team 

•  Get-proj-team 

•  Update- pro  j- team 

•  List-proj-teams 

To  denote  that  an  operation  changes  information  the  A  notation  is  used. 

A  ROLE  =  [  ROLE ,  ROLE'  ] 

A  MEMBER  =  [  MEMBER.  MEMBER'  ] 

A  PROJECT- PERSON  NEL  =  [  PROJECT-PERSONNEL , 

PROJECT-PERSONNEL'  ] 

A  PROJECT-  TEA  MS  =  [  PROJECT-  TEA  MS,  PROJECT-  TEA  MS  ] 

To  denote  that  an  operation  does  not  change  information  the  =  notation  is  used. 

=  ROLE  =  [  ROLE.  ROLE'  |  ROLE'  =  ROLE  } 

=  MEMBER  =  [  MEMBER.  MEMBER'  |  MEMBER'  =  MEMBER] 
-PROJECT-PERSONNEL  =  (  PROJECT-PERSONNEL,  PROJECT-PERSONNEL'  \ 
PROJECT- PERSONNEL'  =  PROJECT-PERSONNEL] 

=  PROJECT-  TEA  MS  =  [  PROJECT-TEAMS,  PROJECT-TEAMS  \ 

PROJECT-TEAMS  =  PROJECT-  TEAMS] 

The  operation  Create-role  is  used  to  create  a  new  role  object  to  hold  information  about  some¬ 
body  or  some  team.  At  the  time  of  creation  only  the  role  title  is  set.  The  remainder  of  the  role 
information  must  be  filled  in  using  the  add-role  operations. 

_ <  ’rente-role _ _ 

A  ROLE 
title'!  :  TITLE 

j  title '  :  title! 

<luti''s'  —  0 

with'  —  0 
-kills1  =  0 
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The  operation  Add-role-duty  is  used  to  add  a  new  responsibility  to  an  existing  role. 


Add-role-duty _ 

A  ROLE 

new.duty?  :  RESPONSIBILITY 

new.duty?  £  duties 

duties'  =  duties  U  {  new.duty?  } 

title'  =  title 
auth'  =  auth 
skills'  =  skills 


The  operation  Delete-role-duty  is  used  to  remove  an  assigned  responsibility  from  a  role. 

Delete-role-duty _ 

AROLE 

ex. duty?  :  RESPONSIBILITY 

ex. duty?  €  duties 

duties'  =  duties  -  {  ex. duly?  } 

title'  =  title 
auth'  =  auth 
skills'  =  skills 


The  operation  List-role-duty  is  used  to  list  all  the  responsibilities  assigned  to  the  specified  role. 

_ List-role-duty _ 

=  ROLE 

list'.  :  F  RESPONSIBILITY 
list'.  =  duties 


The  other  operations  on  role  information  that  manipulate  the  role’s  authorities  and  skills  are 
similar  to  the  duty  operations.  These  other  operations  are  not  given  here. 

The  operation  Set-memb-start  is  used  to  set  the  starting  date  for  a  person  assigned  to  a  project. 
This  is  the  date  on  which  the  person  is  to  start  work  on  the  project. 

_ Set-memb-start _ 

A  MEMBER 
start?  :  DATE 

start. date'  =  start? 

stop.datr'  =  slnp.dat  f 
assigned _t o'  -  assigtu  il_h> 
assignments'  =  assignmt  nts 
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The  operation  Set-memb-stop  specifies  the  date  on  which  the  person  is  to  stop  working  on  the 
project. 

_ Set-memb-stop _ _ _ _ — — — — - 

AM  EMBER 
stop ?  :  DATE 

stop-date'  =  stop ? 

start-date'  =  start-date 
assigned-to'  =  assigned-to 
assignments'  =  assignments 


The  operation  Assign-memb-team  is  used  to  assign  a  person  to  a  project  team.  Note  that  it  is 
possible  for  one  person  to  be  assigned  to  more  than  one  team.  Whether  this  is  a  good  idea  or  not 
is  up  to  management.  When  assigning  a  person  to  a  team  that  person  must  also  be  given  a  role  in 
the  team. 

_ Assign-memb-team _ _ _ — — . . . - 

AM  EMBER 
team ?  :  TEAM-NAME 
role ?  :  ROLE 
=  PROJECT-TEA  MS 

team  ?  €  dom  PROJECT-  TEA  MS 
team ?  £  dom  assigned-to 

assigned-to'  =  assigned-to  U  {  team ?  •—  role  !  } 

start -date'  =  start-date 
stop-date'  =  stop-date 
assignments'  =  assignments 


The  operation  Assign-memb-task  is  used  to  assign  a  person  to  a  task. 

_ Assign-memb-task _ _ _ - 

A  MEMBER 

task ?  .  TASK -NAME 

=  PROJECT-  TASK 

task ?  €  dom  PROJECT- TASK 
task ?  $  assignments 

assignments'  =  assignments  U  {  task ?  } 

start-date1  —  start-date 
stop-date’  =  stop-date 
assigned— to'  —  assigned-to 


The  operation  Re  move -memb-t earn  used  to  take  a  person  off  a  team  to  which  the  person  is 

assigned. 
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_ Remove-memb-team _ 

A  MEMBER 

team ?  :  TEAMJVAME 

= PROJECT-TEAMS 

team ?  €  domPROJECT-TEAMS 
teaml  £  dom  assigned -to 
assigned-to'  =  {  team ?  }  assigned-io 

start-date'  =  start-date 
stop-date  =  stop-date 
assignments'  =  assignments 


Other  operations  on  the  organizational  structure  information  are  not  given  at  this  time. 

4.2.16  Scheduling 

Every  project  has  a  schedule  that  specifies  when  a  task  will  begin  and  when  it  is  expected  to  end. 
At  the  beginning  of  a  project  an  initial  schedule  is  developed  and  as  the  project  progresses  the 
schedule  must  often  be  changed.  The  schedule  is  closely  watched  by  project  management  and  a 
great  deal  of  importance  is  frequently  given  to  scheduling  tasks  such  that  the  promised  due  date 
can  be  met. 

There  is  no  separate  schedule  object  within  this  specification.  Information  used  by  a  schedule 
are  task  names  and  planned  start  and  stop  dates.  The  start  and  stop  information  for  each  task 
is  held  within  the  task  information  as  formalized  bv  TASK.  By  defining  a  schedule  the  planning 
information  within  each  task  is  simply  updated. 

Schedule  Operations 

While  there  is  no  schedule  object  there  are  several  operations  on  the  schedule  information  that  are 
necessary.  These  operations  include: 

•  Set-sched-dates 

•  Change-sched-start 

•  Change-sched-stop 

•  List-schedule 

In  the  specification  of  these  operations  that  follow,  one  disadvantage  that  occurs  when  a  new 
date  is  specified,  is  that  the  currently  specified  date  is  lost.  During  project  tracking  the  slippage 
of  scheduled  dates  should  be  noted  and  recorded.  This  shortcoming  can  be  corrected  by  modifying 
the  definition  of  the  TASK  object  and  these  operations.  However,  for  now  the  shortcoming  is  left 
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The  operation  Set-sched-dates  is  used  to  initially  set  the  start  and  stop  dates  for  a  single  specified 
task. 


_  Set-sched-dates _ 

^PROJECT-TASK 
task ?  :  TASK-NAME 
start ?  :  DATE 
stop ?  :  DATE 

PROJECT-TASK  =PROJECT-TASK®{  task ?  ~ 

Assign-task-dates{  PROJECT-TASK(taskt),  start?,  stop ?  )  } 


The  operation  Change-sched-start  is  used  to  update  just  the  planned  starting  date  assigned  for 
a  task. 


_ Change-sched-start _ 

SPROJECT-TASK 
task ?  :  TASK -NAME 
start ?  :  DATE 

PROJECT-  TA  SK  =  PROJECT-  TA  SI< ©  {  task ?  -> 

Replace-task-pstart(PROJECT-TASK{task? ),  start ?  )} 


The  operation  C'hange-sched-stop  is  used  to  update  just  the  planned  stopping  date  assigned  to 
a  task. 


_ Change-sched-start _ 

A PROJECT-TASK 
task ?  :  TASK -NAME 
stop ?  :  DATE 

PROJECT-TASK  =PROJECT-TASK®{  task ?  ~ 

Replace-task-pstop{  PROJECT-TASK(taskl),  stop ?  )} 


The  operation  List-schedule  is  used  to  generate  a  report  on  the  project  schedule.  Here  a  relation 
is  created  that  specifies  an  ordering  among  project  tasks. 

_ List-schedule _ 

=  PROJECT-  TASK 

schedule'.  :  TASK-NAME  -  TASK -NAME 

schedule'.  =  {  ni.n2:  TASK -NAME  \  nt  6  domPROJECT-TASK  A 
n2  e  dom PROJFA  T-  TASK  A 
(  (It  t-task-pstart(ny)  =Get-task-pstart(n2)  V 

(!<  t-task-pstartt  n\)A  FTERGrt-tnsk-pstart(n2)  )  } 
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4.3  Tasks 


There  are  many  tasks  that  must  be  done  to  prepare  a  re-engineering  project.  These  tasks  require 
input  from  the  outside  world.  These  tasks  in  turn  create  new  information  and  knowledge  that  is 
used  during  a  project.  The  information  objects  and  the  operations  defined  in  the  previous  section 
are  used  by  these  tasks. 

Much  of  the  input  to  the  Analysis  and  Planning  phase  is  human  knowledge  and  information. 
However,  there  are  several  other  inputs  as  well.  These  other  inputs  include: 

Input 

•  Existing  System  Source  Code 

•  Existing  System  Documentation 

•  Existing  System  Test  Plans 

•  PROJECT-MODELS 

•  HIST_PROJ_DB 

There  are  many  outputs  from  this  task  as  well.  Each  of  the  outputs  listed  below  is  created  by 
a  sub-task  of  the  Analysis  and  Planning  phase. 

Output 

•  AC'CEPT.CRITERIA 

•  DEFINITION 

•  ENVIRONMENTS 

•  INVENTORY 

•  RESOURCES 

•  PROJECT-ESTIMATES 

•  PROJECT-PERSONNEL 

•  PROJECT-PLAN 

•  PROJECT-PROCEDURES 

•  PROJECT-STANDARDS 

•  PROJECT-TASKS 

•  PROJECT-TEAMS 

•  PROJECT-TOOLS 

•  TEST-PLAN 

The  sub-tasks  of  the  Analysis  and  Planning  phase  are  discussed  in  'he  following  subsections. 
Remember  the  Analysis  and  Planning  phase  is  a  task  itself.  For  each  .subtask,  a  description  of  the 
task,  a  list  of  inputs  and  outputs  is  given,  followed  bv  a  discussion  of  concerns  or  processing  within 
the  task.  These  are  not  complete,  in-depth  descriptions  of  these  subtasks.  The  complete  details  of 
each  task  is  too  voluminous  to  be  given  here. 
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4.3.1  Determine  Motivations  and  Objectives 

This  is  the  first  task  of  the  Analysis  and  Planning  phase.  The  purpose  of  this  task  is  to  identify 
the  motivations  for  re-engineering  the  existing  system  and  the  general  project  objectives.  This 
task  focuses  on  two  questions.  Why  does  the  system  need  to  be  re-engineered?  What  is  this  project 
supposed  to  achieve?  The  answers  to  these  questions  can  be  used  to  determine  the  scope  of  the 
project.  This  allows  the  project  planners  to  state  what  can  and  can  not  be  done  during  the  project. 

Input 

•  Application  Domain  Knowledge 

•  Environment  Knowledge 

•  Existing  System  Knowledge 

•  Programming  Language  Knowledge 

•  Software  Re-engineering  Knowledge 

Output 

•  DEFINITION 

DISCUSSION : 

The  first  step  in  this  task  is  to  identify  the  motivations  for  the  re-engineering  project.  Either 
there  are  problems  with  the  system  that  must  be  addressed  or  there  are  changes  in  the  system’s 
environment  that  affect  the  system.  Perhaps  there  are  new  requirements  that  need  to  be  addressed. 
Future  plans  for  the  system  should  also  be  considered  and  preparations  incorporated  into  the 
system. 

The  second  step  is  to  identify  the  project  objectives.  The  objectives  describe  what  the  re¬ 
engineering  project  is  to  achieve,  and  in  general  terms,  describe  the  desired  characteristics  of  the 
target  system.  For  each  project  motivation  there  is  an  objective  to  satisfy  that  motivation.  In 
addition,  by  deciding  to  embark  on  a  re-engineering  project  other  objectives  that  are  not  related 
to  the  original  motivations  are  often  included. 

4.3.2  Analyze  Environments 

This  task  is  only  necessary  if  a  motivation  for  the  re-engineering  project  is  to  convert  the  system  to 
operate  in  a  new  environment.  The  new  environment  is  called  the  target  environment.  In  the  lesser 
case  the  target  environment  results  from  an  upgrade  or  minor  change  in  the  existing  environment, 
in  the  more  extreme  case  the  existing  system  is  to  be  ported  to  a  new  operating  system  or  computer. 
Here,  software  conversion  [OSD/FSMSC89]  becomes  part  of  the  re-engineering  project. 

The  goal  of  this  task  is  to  identify  differences  in  the  existing  and  target  environments  that  can 
affect  the  system.  For  example,  if  the  system  reads  binary  encoded  files  then  word  size,  byte  order, 
and  data  representations  must  be  examined.  There  may  be  differences  in  operating  system  utilities 
that  the  system  uses.  Any  difference  in  the  environments  that  affects  the  system  will  have  to  be 
handled  during  the  re-engineering  project.  Better  to  know  about  the  differences  in  advance  than 
to  stumble  across  them  during  re  implementation  or  testing. 
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Input 


•  DEFINITION 

•  Environment  Knowledge 

•  Programming  Language  Knowledge 

•  Re-engineering  Knowledge 

•  System  Knowledge 

Output 

•  ENVIRONMENTS 

The  first  step  in  this  task  is  to  determine  if  the  project  is  motivated  by  a  change  in  the  existing 
environment.  If  the  existing  and  target  environments  are  the  same  then  this  task  is  not  necessary. 
If  there  have  been  (or  will  be)  changes  in  the  existing  environment  or  the  system  is  being  ported 
to  a  new  environment  then  this  step  is  necessary. 

If  there  is  a  change  in  the  environment  the  second  step  is  to  analyze  the  existing  environment 
and  list  its  properties  and  characteristics.  In  practice,  the  system  will  be  examined  and  possible 
environment  properties  that  may  affect  the  system  operation  identified.  This  list  is  used  to  guide  the 
environment  analysis.  Though  it  is  likely  to  not  be  complete  so  other  properties  of  the  environment 
should  still  be  determined. 

The  third  step  is  to  analyze  the  target  system.  Again  in  practice,  the  list  of  environment 
properties  that  could  affect  the  system  will  be  used  to  guide  the  analysis.  Experience  in  software 
conversions  will  also  help  guide  the  analysis. 

The  fourth  step  is  to  compare  the  results  of  the  two  analyses  and  identify  important  differences. 
The  comparison  looks  for  several  types  of  differences.  If  the  system  uses  a  feature  of  the  existing 
environment  and  that  feature  is  different  in  the  target  environment  that  the  difference  must  be 
noted.  If  the  system  uses  a  feature  in  of  the  existing  environment  and  that  feature  is  not  present 
in  the  target  environment  that  difference  must  be  noted.  This  is  actually,  the  most  difficult  case, 
since  the  missing  feature  must  be  developed  on  the  target  environment.  Finally,  any  features  in  the 
target  environment  that  maybe  useful  and  that  are  not  found  in  the  existing  environment  should 
be  noted.  This  is  particularly  true  when  the  target  environment  is  an  upgrade  of  the  existing 
environment. 

4.3.3  Inventory  Task 

The  purpose  of  the  inventory  task  is  to  identify  all  objects  that  make  up  the  system  and  all  the 
supporting  objects  that  are  needed.  These  are  objects  that  will  either  be  re-engineered  or  used 
during  the  project.  A  system  is  composed  of  programs,  JCL  commands,  data  files,  and  databases. 
User  created  or  vendor  supplied  libraries  may  also  be  used  in  system  programs.  In  addition,  there 
mav  be  system  documentation,  test  plans,  test  cases,  lest  data  files,  etc,  that  must  be  identified. 

Input 
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•  DEFINITION 

•  Cost  Estimation  Knowledge 

•  Environment  Knowledge 

•  Existing  Documents 

•  Existing  System  Knowledge 

•  Programming  Language  Knowledge 

•  Re-engineering  Knowledge 

Output 

•  INVENTORY 

DISCUSSION : 

The  inventory  task  could  follow  this  sequence  of  steps. 

1.  Use  the  project  objectives  to  identify  the  system  being  re-engineered. 

2.  Use  the  system  knowledge  to  identify  all  objects  in  the  system.  This  includes  supporting 
objects. 

3.  Use  environment  knowledge  and  system  knowledge  to  locate  objects. 

1.  Use  estimation  knowledge,  system  knowledge,  environment  knowledge,  programming  lan¬ 
guage  knowledge,  project  objectives,  and  re-engineering  knowledge  to  determine  the  informa¬ 
tion  to  be  collected  for  each  object. 

5.  Use  system  knowledge,  environment  knowledge,  and  programming  language  knowledge,  esti¬ 
mation  process  knowledge,  file  names,  file  names,  and  existing  documentation  to  collect  the 
required  object  information. 

The  first  problem  is  to  determine  what  is  needed.  If  the  entire  system  is  to  be  re-engineered 
then  information  about  the  entire  system  must  be  collected.  If  only  parts  of  the  system  are  to 
be  re-engineered  then  these  parts  must  be  identified  and  information  about  them  collected.  The 
project  objectives  will  help  clarify  what  is  to  be  affected.  If  the  entire  system  is  not  to  be  re¬ 
engineered  then  parts  of  the  system  that  will  not  be  changed  but  which  are  coupled  with  the  parts 
to  be  re-engineered  must  be  identified  also. 

Another  problem  is  if  multiple  versions  of  the  system  exist.  A  re-engineering  goal  may  be  to 
combine  the  different  versions.  Here,  it  will  be  necessary  to  inventory  the  separate  versions  of  the 
system. 

Documents  for  the  system  must  be  collected.  Some  documents  will  only  exist  as  hardcopies, 
others  will  have  parts  or  the  entire  text  stored  in  computer  files.  A  possible  project  goal  might  be 
to  update  or  replace  documents.  The  quality  of  the  existing  documents  must  be  determined. 

Uesides  identifying  the  system  objects  it  is  necessary  to  collect  information  about  each  object. 
The  necessary  information  will  differ  from  project  to  project  and  from  organization  to  organi¬ 
zation.  At  the  least,  information  required  to  make  estimates  of  project  cost  and  effort  must  be 
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collected.  Knowledge  about  the  estimation  models  and  their  required  inputs  will  help  determine 
the  information  to  be  collected. 

Techniques  are  also  required  to  collect  information.  The  estimation  process  may  specify  tech¬ 
niques  and  metrics  to  use  to  collect  information. 

The  inventory  task  uses  a  wide  variety  of  inputs.  Knowledge  about  the  system  is  necessary  to 
identify  its  components.  Operations  for  manipulating  the  inventory  information  have  been  specified. 
Operations  and  procedures  for  collecting  information,  and  for  identifying  objects  are  also  needed 
by  this  task. 

4.3.4  Define  Approach 

This  task  is  an  important  and  time-consuming  task  within  the  Analysis  and  Planning  phase.  This 
task  focuses  on  one  question:  Given  the  current  software  system  and  the  properties  of  the  target 
system  how  can  the  target  system  be  created?  The  answer  to  this  question  is  the  project  plan. 

Input 

•  ENVIRONMENTS 

•  DEFINITION 

•  PROJECT-PROCEDURES 

•  Application  domain  knowledge 

•  Existing  system  knowledge 

•  Management  knowledge 

•  Re-engineering  knowledge 

•  Software  engineering  knowledge 

Output 

•  PROJECT-PLAN 

•  PROJECT-TASK 

DISCUSSION : 

The  major  phases  of  a  re-engineering  project  are  given  by  re-engineering  knowledge.  However, 
this  does  not  simplify  the  job  of  organizing  and  planning  for  each  phase.  One  difficult  question  that 
must  be  addressed  is  whether  t he  system  is  to  be  re-engineered  in  one  "lump  sum,”  in  fragments, 
or  incrementally. 

Sometimes  a  system  is  just  too  large  to  completely  re-engineer  at  one  time.  Doing  so  many 
result  in  a  long  duration  project.  Sometimes  it  is  better  to  divide  the  system  into  fragments  where 
each  fragment  can  be  re-engineered  separately.  The  problem  with  this  approach  is  identifying 
fragments  that  can  be  isolated  from  the  rest  of  the  system.  An  alternative  is  to  re-engineer  the 
system  incrementally.  This  involves  setting  a  series  of  minor  goals  and  re-engineering  the  system 
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to  met  these  goals  one  increment  at  a  time.  The  result  of  each  increment  is  an  improved  system 
that  can  be  used  while  the  next  increment  is  under  work. 

A  combination  of  re-engineering  knowledge,  project  objectives,  and  existing  system  knowledge 
must  be  used  to  identify  how  the  project  will  be  conducted.  Besides  establishing  the  grand  or  macro 
plan  that  identifies  major  phases  and  tasks,  it  is  necessary  to  establish  micro  plans,  specifying  the 
details  of  each  major  task.  Re-engineering  knowledge  will  identify  useful  tasks,  project  objectives 
will  enable  the  selection  of  necessary  tasks,  and  existing  system  knowledge  will  determine  the  details 
of  how  the  system  can  be  re-engineered. 

4.3.5  Define  Acceptance  Criteria 

The  acceptance  criteria  are  used  during  acceptance  testing  to  verify  that  the  target  system  is 
acceptable  to  the  client.  The  acceptance  criteria  must  be  agreed  on  early  in  the  project,  typically 
before  the  technical  work  begins. 

Input 


•  DEFINITION 

•  Environment  knowledge 

•  Existing  system  knowledge 

•  Application  domain  knowledge 

Output 

•  ACC  E  PT_C  R  IT  K  R I A 

DISCUSSION : 

The  criteria  specified  must  be  specific.  Each  criterion  must  be  verifiable.  A  criterion  such  as, 
"The  system  will  operate  correctly  on  valid  inputs."  is  not  a  good  criterion  since  this  typically  can 
not  be  proven  for  a  large  system. 

Acceptance  criteria  should  complement,  not  duplicate  the  internal  project  quality  assurance 
procedures.  Of  course,  one  criterion  can  be  that  the  system  successfully  pass  all  unit  and  system 
level  tests. 

4.3.6  Define  Project  Procedures 

The  purpose  of  the  project  procedures  task  is  to  assemble  the  set  of  procedures  to  be  used  during 
a  re-engineering  pro  ject.  The  organization  may  have  a  collection  of  standard  procedures  that  must 
be  followed.  Some  of  these  procedures  may  need  to  be  adjusted  to  fit  specific  characteristics  of  the 
current  project  and  its  environment.  Each  project  is  different,  so  a  certain  amount  of  adjustment 
of  procedures  is  to  be  expected.  New  procedures  may  also  need  to  be  developed  to  handle  tasks 
unique  to  re-engineering  or  the  current  project. 
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Input 


•  See  Below 

Output 

•  PROJECT-PROCEDURES 

DISCUSSION  :  The  input  for  this  task  was  left  unspecified.  The  reason  for  this  is  that  to 
collect  and  tailor  existing  procedures  knowledge  of  each  procedure  and  its  domain  is  required.  Also 
knowledge  of  the  specific  project  details.  To  develop  new  procedures  knowledge  of  the  procedure 
domains  is  again  needed.  To  collect,  tailor,  and  develop  a  complete  and  useful  set  of  procedures 
for  a  project,  complete  knowledge  of  every  aspect  of  the  project  and  work  required  is  necessary. 

4.3.7  Define  Project  Standards 

The  purpose  of  the  standards  task  is  to  assemble  the  set  of  standards  that  will  be  enforced  during 
the  project. 

Input 


•  See  Below. 


Output 


•  PROJECT-STANDARDS 
DISCUSSION  : 

The  input  for  this  task  was  left  unspecified.  The  reason  for  this  is  the  broad  range  of  knowledge 
sources  that  must  be  used  to  examine,  select,  define,  and  change  the  standards. 

This  task  will  typically  begin  by  collecting  existing  standards.  The  initial  project  objectives 
will  help  identify  useful  and/or  necessary  standards.  Some  of  these  may  need  to  be  modified  to 
conform  with  project  specific  requirements.  An  examination  of  the  proposed  project  plan  will  also 
be  helpful  in  identifying  standards  that  are  needed  may  not  exist  within  the  organization.  For  such 
standards  it  will  be  necessary  to  define  new  standards. 

4.3.8  Identify  Resources 

Part  of  the  task  of  planning  a.  project  is  identifying  any  special  resources  that  a  project  will  require. 
The  tasks  that  require  these  resources  should  be  noted,  since  the  availability  of  a  resource  can 
affect  the  project  schedule.  The  need  for  a  resource  may  be  typical  of  the  system  or  may  become 
apparent  as  particular  tasks  are  planned. 

Input 


•  Existing  System  Knowledge 
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•  Environment  Knowledge 

•  Organization  Knowledge 

•  Re-engineering  Knowledge 

•  Tool  Knowledge 

Output 

•  RESOURCES 
DISCUSSION : 

While  defining  the  re-engineering  environment  special  equipment  needs  may  be  identified.  This 
can  range  from  computers  to  be  used  to  re-engineering  the  code,  such  a  workstations,  to  special 
purpose  hardware  used  by  the  system.  The  need  for  special  purpose  hardware  can  affect  the  system 
tests  and  system  transition  plans. 

Resources  can  be  identify  by  considering  several  factors: 

•  Does  the  system  required  special  purpose  hardware  to  operate?  For  example,  the  software 
that  drives  a  HUD  display  will  need  a  HUD  hardware  device.  Another  example,  is  the  need 
for  a  color  graphics  system  to  use  a  color  display  device. 

•  Does  the  system  require  special  purpose  hardware  for  testing? 

•  Does  equipment  need  to  be  allocated  to  the  project  such  as  computers,  terminals,  or  printers? 

•  Does  the  system  require  special  capabilities  such  as  access  to  a  network? 

•  Are  there  key  people  associated  with  the  system  or  the  environment  who  will  not  be  assigned 
to  this  project,  but  whose  knowledge  and  expertise  is  important? 

The  answers  to  these  questions  and  others  will  identify  the  resources  required  by  a  project. 
Identifying  needed  resources  early  in  the  project  simplifies  the  task  of  obtaining  the  resources 
before  they  are  needed. 

4.3.9  Identify  Tools 

The  task  of  identifying  tools  to  be  used  during  a  project  is  likely  to  span  almost  the  entire  planning 
^tage.  As  work  proceeds  in  other  planning  tasks,  tools  will  be  identified  and  added  to  the  list  of 
project  tools. 

Input 

•  P  R  0.1  ECT  -  P  R  OC  E  D  V  R  ES 

•  PROJECT-TASKS 

•  Environment  knowledge 
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•  Management  knowledge 

•  Re-engineering  knowledge 

•  Testing  knowledge 

•  Tool  knowledge 

Output 

•  PROJECT-TOOLS 

DISCUSSION  : 

The  selection  of  the  appropriate  tools  can  have  a  great  effect  on  the  cost  and  duration  of  a 
project.  Some  tools  are  taken  for  granted  such  as  text  editors  and  language  compilers.  Other 
tools  such  as  configuration  managers,  document  management  systems,  software  engineering  envi¬ 
ronments,  and  test  case  generators  are  not  used  in  all  organizations.  The  value  of  some  tools  may 
not  be  appreciated  by  those  who  have  not  used  them  before  or  by  those  who  have  not  confronted 
the  problems  such  tools  address. 

One  problem  that  a  project  may  face  is  that  a  desired  tool  does  not  exist  in-house.  The 
alternatives  are  to  purchase  the  tool,  develop  it  in-house,  or  do  without.  The  pro’s  and  con’s  of 
each  of  these  alternatives  must  be  judged  carefully  as  the  decision  can  have  an  effect  on  the  project 
schedule. 


4.3.10  Test  Planning 

This  task  focuses  on  preparing  the  test  plan  for  the  target  system.  It  may  not  be  possible  to 
specify  the  details  of  each  individual  test  during  the  planning  stage,  but  it  is  possible  to  plan  the 
testing  strategy.  Test  planning  for  a  re-engineering  project  differs  from  test  planning  for  software 
development  since  there  may  already  be  an  existing  test  plan  for  the  existing  system.  Work  on  the 
target  system  test  plan  can  begin  by  examining  the  existing  system  test  plan. 


Input 

•  INVENTORY 

•  DEFINITION 

•  ENVIRONMENTS 

•  Application  domain  knowledge 

•  Existing  system  knowledge 

•  Existing  test  plan 

•  Programming  language  knowledge 

•  Tool  knowledge 

Output 


•  TEST_PLAN 
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DISCUSSION  : 


The  first  step  is  to  identify  a  testing  strategy  for  the  target  system.  Will  white-box  or  black-box 
or  both  strategies  be  used.  What  percentage  of  coverage  will  be  required  of  white-box  tests?  Will 
automatic  test  case  generators  be  used?  How  will  the  system  be  tested?  The  answers  to  these 
questions  will  be  used  to  guide  the  development  of  a  test  plan  for  the  target  system. 

The  second  step  is  to  analyze  the  existing  system  test  plan  and  determine  its  suitability  for 
testing  the  target  system.  The  focus  of  this  step  is  to  determine  which  parts  of  the  existing  test 
plan  can  be  re-used  without  changes,  re-used  with  changes,  and  which  parts  must  be  deleted.  The 
existing  test  plan  is  compared  against  the  planned  changes  to  the  target  system,  this  can  be  done 
using  the  project  DEFINITION  information.  For  example,  if  the  existing  test  cases  are  based  on 
white-box  testing  and  a  project  goal  is  to  re-design  and  re-implement  a  portion  of  the  system,  then 
the  test  cases  that  exercise  that  portion  of  the  system  can  not  be  re-used.  Existing  test  cases  must 
be  modified  or  changed  if  they  exercise  features  of  the  existing  system  (black-box  tests)  that  will 
be  changed  during  the  project. 

The  third  step  is  to  identify  portions  of  the  target  system  for  which  no  existing  tests  will  exercise. 
For  example,  new  features  to  be  added  to  the  target  system,  or  code  that  will  be  replaced  and  is 
currently  tested  by  white-box  based  tests.  Plans  for  testing  these  portions  of  the  target  system 
must  be  made. 

Notice  that  tests  that  are  designed  to  cover  target  system  features  can  be  generated  during  the 
planning  stage  once  the  requirements  for  the  target  system  are  complete.  White-box  test  cases 
that  are  designed  to  cover  statements  or  branches  in  the  target  system  source  code  can  not  be 
generated  until  later  in  the  project  when  the  target  source  code  exists.  However,  plans  for  testing 
such  sections  of  the  system  can  be  made  during  the  planning  stage. 

The  product  of  this  task  is  a  test  plan  for  the  target  system.  The  test  plan  may  contain  test 
groups  for  unit  and  system  tests.  To  create  the  test  plan  an  understanding  of  system  testing,  the 
existing  system,  the  target  system,  and  changes  between  the  two  is  necessary.  To  specify  details  of 
the  test  cases  knowledge  about  how  the  tests  will  be  applied  is  also  required. 

4.3.11  Estimation 

The  key  question  behind  a  re-engineering  estimation  model  is:  Given  an  existing  software  system 
how  much  effort  will  it  take  to  generate  the  desired  target  system?  Thus,  estimation  models  for 
software  re-engineering  are  characterized  by  the  concept  of  change.  A  fundamental  assumption 
underlying  software  engineering  is  that  an  existing  software  system  with  undesirable  properties 
is  altered  to  create  a  new  system  that  is  somehow  better.  To  estimate  the  effort  required  to  re¬ 
engineer  an  existing  software  system  it  will  be  necessary  to  estimate  the  degree  of  change  the 
existing  system  must  undergo.  The  greater  the  expected  amount  of  change  the  greater  the  effort 
required  to  produce  the  change. 

Input 


•  DEFINITION 

•  ENVIRONMENTS 
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•  PROJECT-TOOL 

•  PROJECT-MODELS 

•  HIST_PROJ_DB 

•  Re-engineering  knowledge 

•  Management  knowledge 

Output 

•  PROJECT-ESTIMATES 
DISCUSSION  : 

The  concept  of  change  can  be  embodied  in  a  top-down  estimation  model  for  software  re¬ 
engineering.  A  top-down  model  focuses  on  the  re-engineering  product,  which  is  typically  source 
code  (it  can  also  be  documentation  or  test  plans).  A  top-down  model  must  include  at  least  three 
components.  The  first  component  is  some  measure  of  the  properties  of  the  existing  system.  The 
second  component  is  some  measure  of  the  properties  of  the  target  system.  These  two  measures  can 
be  used  to  estimate  the  expected  degree  of  change.  The  third  component  is  some  measure  of  the 
influence  exerted  by  various  factors. 

The  concept  of  change  can  also  be  embodied  in  a  bottom-up  estimation  model  for  software  re¬ 
engineering.  A  bottom-up  model  focuses  on  the  re-engineering  process.  Software  re-engineering  is 
based  on  three  fundamental  principles:  abstraction,  alteration,  and  refinement.  These  principles  are 
implemented  in  the  re-engineering  process  by  the  technical  phases:  reverse  engineering,  re-design, 
and  re-implementation.  The  expected  degree  of  change  affects  each  of  these  major  phases.  The 
portions  of  the  system  affected  by  the  proposed  changes  must  be  reverse  engineered  to  a  suitable 
level  of  abstraction.  The  expected  degree  of  change  will  affect  the  amount  of  work  done  in  the 
re-design  phase.  Finally,  all  portions  of  the  system  that  undergo  change  must  be  re-implemented. 

4.3.12  Define  Organizational  Structure 

The  organizational  structure  of  a  project  must  be  assembled  carefully.  It  is  more  than  just  assigning 
the  right  people  to  the  right  jobs.  The  structure  will  define  the  flow  of  work  between  teams.  There 
are  several  possible  structures: 

Assembly  line  Divides  the  project  into  functional  teams.  There  is  a  management  team,  a  reverse 
engineering  team,  a  quality  assurance  team,  an  implementation  team,  etc.  Each  team  works 
only  in  its  functional  area. 

Parallel  teams  Each  team  performs  >imiiar  tasks,  but  work  on  different  portions  of  the  system. 
Thus  one  team  will  handle  tasks  from  many  functional  areas.  This  approach  often  causes 
better  project  morale  since  the  work  assignments  vary  for  a  person,  but  this  approach  requires 
a  longer  learning  curve. 

Input 
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•  PROJECT-TASK 

•  Management  knowledge 

•  Re-engineering  knowledge 

•  Organization  knowledge 

Output 

•  PROJECT-TEAMS 

•  PROJECT-PERSONNEL 

DISCUSSION : 

The  organizational  structure  and  the  size  of  the  teams  may  vary  from  phase  to  phase.  Fac¬ 
tors  that  influence  the  structure  include  the  staffing  and  resource  requirements  of  the  tasks,  team 
functions,  number  and  type  of  tasks,  and  the  skill  level  of  the  personnel. 

Initially  the  number  of  people  assigned  to  a  project  will  be  small.  The  number  will  increase  as 
the  planning  phase  finishs  and  the  other  phases  begin.  It  is  important  that  the  key  project  people 
be  assigned  to  the  project  at  the  beginning  or  as  early  as  possible.  These  people  must  have  the 
best  understanding  of  the  project  and  what  it  involves.  This  knowledge  can  best  be  gained  by 
participating  in  the  project  planning.  Another  important  point  is  that  project  personnel  must  be 
assigned  to  the  project  full  time.  Re-engineering  is  not  a  part-time  job. 

4.3.13  Schedule  Tasks 

Developing  a  good  project  schedule  means  solving  a  difficult  puzzle.  Estimating  how  long  a  task 
will  take  to  complete  is  difficult.  Typically,  a  project  schedule  will  be  revised  several  times  during 
a  project  to  reflect  slippage  in  the  schedule. 

Input 

•  PROJECT-TASK 

•  Management  knowledge 

•  Re-engineering  knowledge 

•  Tool  knowledge 

•  Organization  knowledge 

Output 

•  PROJECT-TASK' 

DISCUSSION : 

To  adequately  schedule  the  project  tasks  an  understanding  of  what  each  task  involves  is  required. 
I'he  tools  available  for  a  task  and  tlm  people  assigned  to  a  task  are  also  factors  that  must  be 


53-111 


considered.  Personnel  may  have  learning  curves  to  climb  either  in  doing  re-engineering  work  or 
using  new  tools,  or  a  new  environment.  This  lengthens  the  task  time. 

Care  must  be  given  to  the  ordering  of  tasks.  Bottlenecks  must  be  detected  and  if  possible 
removed  so  that  portions  of  a  project  don’t  come  to  a  halt  while  waiting  for  other  tasks  to  finish. 
There  are  a  number  of  commercial  scheduling  tools  available.  While  these  tools  have  been  developed 
to  schedule  software  development  projects,  there  is  no  real  difference  between  the  scheduling  needs 
of  software  development  and  software  re-engineering,  though  the  technical  work  is  clearly  different. 

4.3.14  Review  Project  Plans 

The  final  task  of  the  Analysis  and  Planning  phase  is  to  review  the  information  collected  during  this 
phase.  The  project  approach,  procedures,  objectives,  acceptance  criteria,  etc,  must  be  reviewed 
and  accepted  before  the  next  phase  of  the  project  can  begin. 

Input 

•  ACCEPT_CRITERIA 

•  DEFINITION 

•  ENVIRONMENTS 

•  INVENTORY 

•  RESOURCES 

•  PROJECT-ESTIMATES 

•  PROJECT-PERSONNEL 

•  PROJECT-PLAN 

•  PROJECT-PROCEDURES 

•  PROJECT-STANDARDS 

•  PROJECT-TASKS 

•  PROJECT-TEAMS 

•  PROJECT-TOOLS 

•  TEST-PLAN 

Output 

•  PROJECT-TASKS 


DISCUSSION  : 

Before  the  software  re-enginering  project  can  continue  the  project  plan  must  be  reviewed  and 
accepted  not  only  bv  project  management,  but  by  upper  management  as  well.  The  support  of 
upper  management  is  importan’  to  a  re-engineering  project  and  upper  management  must  be  aware 
of  what  such  a  project  involves.  Try  to  avoid  surprises  down  the  road. 
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If  errors  are  found  in  the  project  information,  or  potential  problems  are  spotted,  it  may  be 
necessary  to  go  back  and  repeat  one  or  more  analysis  and  planning  tasks  to  correct  the  problems. 
Potential  problems  should  not  be  passed  on  with  the  attitude  that  they  will  be  dealt  with  when 
they  arise.  It  is  better  to  resolve  problems  before  they  become  problems! 

When  the  project  plan  is  accepted,  authority  to  begin  the  next  phase  of  the  project  is  given. 
This  causes  an  update  in  the  status  of  the  project  which  is  why  PROJECT-TASKS  is  listed  as  an 
output  of  this  task. 
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Chapter  5 

Conclusions 


An  issue  that  arises  while  modeling  and  describing  the  software  re-engineering  process  is  whether 
any  such  model  is  valid.  To  model  the  process  is  to  write  a  description  of  how  software  re-engineering 
is  done.  The  sources  of  information  used  to  create  a  process  model  must  be  evaluated.  One  source 
of  information  is  experience  taken  from  successful  and  unsuccessful  re-engineering  projects.  This 
results  in  a  description  of  how  the  process  can  be  done.  Another  source  is  conjecture  based  on  the 
modeler's  understanding  of  re-engineering.  This  results  is  a  description  of  how  the  process  could 
be  done. 

A  process  modeler  faced  with  a  poorly  understood  portion  of  a  process  may  create  a  model 
that  seems  reasonable  but  is  not  based  on  re-engineering  experience.  Here,  the  modeler  creates  a 
possible  model  of  the  process.  This  is  a  plausible  process  model,  one  that  is  not  based  on  proven 
experience.  A  plausible  process  model  describes  an  untried  process.  A  plausible  model  may  not 
be  suitable  for  guiding  a  real  project,  i.e.  it  may  not  allow  valid  actions  within  the  process  or  may 
omit  necessary  actions. 

A  process  model  based  on  re-engineering  experience  has  more  validatity,  but  only  so  far  as  it 
captures  a  true  understanding  of  the  dynamics  of  the  re-engineering  process.  Experience  must  be 
accumulated  from  many  projects.  Each  project  is  different.  It  is  necessary  to  abstract  common 
process  elements  from  each  project.  When  reviewing  multiple  projects  it  is  easy  to  focus  on  some 
aspects  of  the  process  and  ignore  others.  This  leads  to  an  incomplete  process  model. 

A  high-level  process  model  is  emerging  from  reported  software  re-engineering  experience  and 
research.  Unfortunately,  reports  on  re-engineering  projects  tend  to  focus  on  the  technical  aspects 
of  the  projects.  This  has  resulted  in  a  lack  of  reported  information  about  other  aspects  of  re¬ 
engineering  projects  such  as  management  and  support  concerns. 

One  of  the  contributions  of  this  work  is  a  more  complete  model  of  the  software  re-engineering 
process.  Chapter  3  presented  a  high-level  process  model  for  software  re-engineering  that  extends  the 
traditional  re-engineering  process  model  by  including  other  phases  that  should  occur  within  software 
re-engineering  projects.  The  traditional  model  is  derived  from  reported  experience  with  software  re¬ 
engineering.  The  extented  process  model  presented  in  Chapter  3  is  a  plausible  process  model.  The 
basis  for  the  extensions  derives  from  experience  with  software  development  and  software  conversion. 
There  is  more  to  software  re-engineering  that  just  technictii  work  and  the  extented  process  model 
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addresses  these  previously  ignored  phases. 

The  extensions  to  the  traditional  process  model  create  a  more  complete  model  of  the  software 
re-engineering  process.  It  is  apparent  from  studying  the  model  that  there  is  a  broad  range  of  issues 
and  concerns  that  must  be  addressed  to  adequately  capture  and  model  the  re-engineering  process. 
There  are  many  details  to  be  addressed  within  re-engineering,  far  more  than  just  the  technical 
concerns. 

The  specification  of  the  Analysis  and  Planning  phase  given  in  Chapter  4  demonstrates  the  range 
of  concerns  and  volume  of  information  that  must  be  modeled.  This  specification  focused  on  only 
one  phase,  and  within  that  only  focused  on  information  and  operations  on  this  information.  There 
are  numerous  details  about  this  phase  that  are  not  covered  in  the  specification.  The  Analysis  and 
Planning  phase  has  many  tasks  that  must  be  specified  in  greater  detail.  The  specification  of  this 
phase  is  a  good  start,  but  there  is  much  work  left  to  be  done.  The  goal  is  to  describe  the  process 
in  enough  detail  that  the  model  can  be  used  to  successfully  guide  a  re-enginering  project. 

There  are  several  difficulties  that  prevent  the  quick  specification  of  software  re-engineering 
process  details.  First,  software  re-engineering  has  not  received  much  attention  until  lately.  Our 
understanding  of  the  problem  and  possible  solutions  is  growing.  Software  development  has  been 
addressed  for  over  30  years  and  it  is  still  not  possible  to  complete  specify  the  development  process. 
A  detailed  model  of  the  software  re-engineering  process  will  take  time.  First,  more  experience  with 
re-engineering  must  be  gained. 

Software  re-engineering  project  details  must  be  reported.  Re-engineering  projects  must  be 
tracked  and  detailed  information  about  the  processes  used  within  a  project  must  be  recorded.  This 
information  must  be  shared  so  that  common  process  elements  and  approachs  can  be  identified.  A 
good  high-level  model  of  the  re-engineering  process  is  emerging.  If  the  model  is  to  be  filled  out  to 
lower  levels  of  detail  then  experience  must  be  collected  and  shared. 

The  software  re-engineering  process  is  complex.  Given  the  range  of  concerns  and  issues  within 
re-engineering  it  may  be  better  to  focus  on  individual  tasks  within  the  process  and  specify  these. 
This  means  solving  the  problem  a  piece  at  a  time.  Eventually,  the  pieces  can  be  assembled  to  create 
a  complete  process  model. 

The  high-level  process  model  is  needed  to  provide  a  framework  for  understanding  and  describing 
the  process.  A  high-level  process,  such  as  the  one  presented  in  Chapter  3,  identifies  the  pieces  that 
must  be  investigated.  Even  the  specification  in  Chapter  4,  which  fills  in  many  details  about  one 
phase  of  the  process,  reveals  tasks  that  require  more  indepth  research  and  modeling. 

Some  software  re-engineering  tasks  are  similar  to  tasks  in  other  domains  such  as  software  devel¬ 
opment  and  software  conversion.  There  is  a  wealth  of  detailed  experience  in  software  development. 
By  identifying  similarities  between  software  development  tasks  and  software  re-engineering  tasks 
additional  process  details  can  be  identified.  For  example,  task  scheduling,  the  schedule  for  a  re¬ 
engineering  project  is  just  as  important  as  the  schedule  for  a  software  development  project.  There 
is  a  great  deal  of  research  results,  experience,  and  tools  available  that  address  scheduling  software 
development  tasks,  much  of  this  is  applicable  to  software  re-engineering  as  well. 

The  current  approach  to  modeling  the  software  re-engineering  process  is  to  continue  to  evolve 
the  high-level  model  and  learn  where  more  process  details  are  necessary.  Bv  studying  re-engineering 
projects  additional  process  details  will  be  detected.  By  comparing  software  re-engineering  and  soft- 


53-1 15 


I 


ware  development,  experience  from  development  can  be  reused  to  further  describe  the  re-engineering 
process.  Finally,  future  research  can  investigate  better  means  of  re-engineering  software,  leading  to 
more  useful  models  of  the  re-engineering  process. 

It  must  be  realized  that  there  is  no  one  software  re-engineering  process  model.  Every  project 
is  different,  just  as  different  approachs  are  used  to  develop  software  for  business,  scientific,  and 
military  applications,  so  are  different  re-engineering  approachs  needed  also.  Re-engineering  process 
models  must  be  able  to  address  different  re-engineering  approachs. 

One  re-engineering  approach  is  lump  sum  re-engineering.  Here,  an  entire  software  system 
is  re-engineered  at  the  same  time,  starting  with  the  existing  system  and  ending  with  the  desired 
target  system.  However,  this  approach  is  not  always  suitable.  One  reason  is  the  drain  on  resources 
to  re-engineer  a  large  system. 

Another  re-engineering  approach  is  incremental  re-engineering.  Here,  an  existing  system 
is  re-engineered  in  increments.  The  entire  system  is  re-engineered,  but  instead  of  producing  the 
desired  target  system,  one  or  more  intermediate  systems  are  produced.  Each  intermediate  svstem 
has  better  properties  than  the  preceding  system  version  and  is  closer  to  the  desired  target  s  =tem. 
This  approach  has  the  advantage  that  improved  versions  of  the  system  are  produced  more  quickly, 
but  the  length  of  time  required  to  produce  the  desired  target  system  may  be  longer  than  with  lump 
sum  re-engineering. 

A  third  re-engineering  approach  is  partial  re-engineering.  Here,  only  a  portion  of  a  system  is 
re-engineered  instead  of  the  entire  system.  This  places  constraints  on  the  resulting  target  system, 
since  the  re-engineered  code  must  integrate  with  the  existing  non  re-engineered  system  code.  The 
range  of  flexibility  to  redesign  the  system  is  reduced  with  this  approach. 

Another  issue  in  modeling  the  re-engineering  process  is  how  to  incorporate  requirement  changes 
into  a  software  system.  Software  re-engineering  is  a  change  in  the  form  of  a  software  system,  not 
a  change  in  functionality.  However,  a  re-engineering  project  is  often  seen  as  a  good  time  to  change 
the  system  requirements.  This  introduces  the  full  range  of  forward  engineering  activities  into  a 
re-engineering  project.  The  problem  is  that  two  logically  separate  tasks  are  being  blended  into  one. 

One  solution  is  to  first  re-engineer  a  system  without  changing  the  functionality  and  then  up¬ 
grade  the  system  by  changing  the  functionality  of  the  target  system.  This  means  two  systems  are 
produced,  the  first  is  an  intermediate  target  system  that  is  functionally  equivalent  to  the  existing 
system.  The  target  system  is  then  upgraded  to  reflect  the  necessary  requirement  changes.  The 
problem  is  the  amount  of  time  required  to  create  the  final  target  system. 

Another  solution  is  to  blend  forward  and  re-engineering  together  into  one  process.  This  will 
work  if  reverse  engineering  can  be  used  to  reconstruct  a  specification  of  the  system.  Requirements 
analysis  can  be  done  and  the  reconstructed  specification  changed.  Then  normal  forward  engineering 
practices  can  be  used.  However,  at  this  time  we  really  are  not  able  to  reconstruct  a  specification 
from  source  code  for  large  real  world  systems.  We  are  better  at  reconstructing  designs.  Should  a 
reconstructed  design  be  modified  to  incorporate  requirement  changes? 

1'hese  are  some  of  the  issues  that  must  be  addressed  as  research  into  software  re-engineering 
continues.  More  knowledge  gleaned  from  re-engineering  project  is  needed.  An  understanding  of 
different  re-engineering  approachs  is  required.  The  answers  to  these  issues  will  lead  to  a  fuller 
understanding  of  the  software  re-engineering  process. 
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Appendix  A 


Summary  of  Z  Notation 

Z  notation  is  based  on  typed  set  theory  and  first-order  logic. 

Schemas 

Z  provides  a  construct  called  a  schema,  to  describe  a  specification’s  state  and  operations.  A 
schema  groups  together  variable  declarations  with  a  list  of  predicates  that  constrain  the  variable’s 
possible  values.  The  predicates  also  state  relationships  between  variables. 

There  are  three  types  of  schema. 

Vertical  Schema 

SchemaName _ — — - — - — — - - 

Yanable  declarations 

Predicates  on  variables 


Horizontal  Schema 

Schema  Name  =  [  Variable  declarations  |  predicates  ] 

Axiomatic  Schema 

Declaration 

Predicates 

The  axiomatic  schema  is  used  to  define  global  functions  and  constants. 
Other  notation  used  with  schemas  are  shown  below. 

p?  :  T  States  that  p  of  type  T  is  input  to  a  operation  denoted 
by  a  schema. 

p!  :  I  States  that  p  of  type  T  is  output  bv  an  operation  denoted 
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by  a  schema. 

p  Denotes  the  value  of  p  before  an  operation  begins, 

p'  Denotes  that  the  value  of  p  after  an  operation  is  complete. 

[x  /  t]  Substitute  all  occurrences  of  variable  t  with  variable  x. 

Definitions  and  Declarations 

L  =  R  Defines  L  to  be  equivalent  to  R. 

v  :  T  Declares  v  to  be  of  type  T. 

Sets 

0 

P.Y 
F.Y 
SCT 

SC  T 
x  6  5 
its 
SuT 
Sn  T 

Relations 

x  •—  y  (x,y)  is  an  element  of  a  relation, 

dom  R  The  domain  of  a  relation, 

ran  R  The  range  of  a  relation. 

S  <  R  Domain  restriction.  The  domain  of  relation  R  is 

restricted  such  that  it  equal  to  set  S. 

S  <  R  Domain  subtraction.  The  domain  of  relation  R  is 

restricted  such  that  it  does  not  contain  an  element  from 
set  S. 

R\  0  R 2  Overriding,  given  /?] ,  /?2  :  .Y  —  V 

=  (  dom  /?2  <  R]  )  U  Ri- 

Functions 

-Y  -+»  Y  The  set  of  partial  functions  from  X  to  Y. 

Y  ►+■*  V  The  set  of  one-to-one  partial  functions  from  X  to  Y. 

No  two  elements  of  X  map  to  the  same  element  of  Y. 
fix)  The  function  f  given  value  x. 


The  empty  set. 

The  powerset  of  set  X.  The  set  of  all  subsets  of  X. 
The  set  of  finite  subsetx  of  X. 

The  set  S  is  a  subset  of  the  set  T,  and  S  is  not 
equal  to  T. 

The  set  S  is  a  subset  of  the  set  t  or  equal  to  T. 
The  item  x  is  an  element  of  the  set  S. 

The  item  x  is  not  an  element  of  the  set  S. 
The  union  of  sets  S  and  T. 

The  intersection  of  sets  S  and  T. 
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PROBABILITY  MODELING  IN 
AUTOMATIC  TARGET  RECOGNITION 


Dr.  R.  H.  Coffer 


1.0  BACKGROUND 

This  is  the  latest  in  a  series7'77  of  research  efforts  developing  the  theory  for  high  perfor¬ 
mance  ATR  systems.  It  is  based  on  a  theoretically  provable,  optimally  accurate  approach 
to  ATR  -  that  of  Bayesian  model  matching.  The  optimality  of  the  Bayesian  approach  lies 
in  its  theoretical  ability  to  accumulate  and  rigorously  fuse  all-source  evidence:  incoming 
multi-sensor  imagery;  target,  scene,  and  sensor  models  and  collateral  information. 

Florida  Institute  of  Technology,  FIT,  is  primarily  working  on  the  advanced  theoretical 
issues  shown  in  Figure  1.  The  Target  Recognition  Group,  AARA,  of  Wright  Laboratories, 
is  studying  the  practical  implementation  issues. 

The  1989  SFRP  period  successfully  laid  the  keystone  block  of  Bayesian  probability 
decomposition.7  This  continues  to  permit  attack  of  individual  subproblems  with  assurance 
that  the  results  can  be  fitted  together.  Chief  of  these  subproblems  is  the  determination  of 
joint  probability  density  functions  of  targets  since  these  functions  permit  the  most  accurate 
utilization  of  image  evidence  for  automatic  target  recognition,  ATR. 

Reference  11  documents  very  successful  results  achieved  under  the  1989-90  RIP  effort 
toward  solving  the  LADAR  target  probability  characterization  problem. 

The  1990  SFRP  program10  developed  considerable  insight  into  robustness  of  the  Bayesian 
approach  to  modeling  errors.  It  also  developed  insights  into  the  IR  ATR  problem  including 
the  concepts  of  underlying  and  local  homogeneity  of  target  temperatures. 


2.0  THE  CURRENT  PROBLEM  BEING  RESEARCHED 

High  performance  recognition  of  targets  in  IR  imagery  continues  to  be  a  strong  goal  of 
AARA  and  thus  the  subject  of  this  current  research. 

The  key  to  higher  performance  recognition  is  the  detailed  understanding  of  just  how  the 
target  can  appear  in  the  image  7  -  necessitating  careful  modeling  of  the  scene  formation 
and  sensing  processes. 

An  IR  image  is  formed  as  a  result  of  imaging  the  temperatures,  Ts,  of  a  scene.  Figure  2.  If 
the  target  in  the  scene  were  to  have  constant  temperature  throughout,  then  its  visual  detail 
within  the  IR  image  would  disappear  and  thus  become  impossible  to  recognize. 
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BAYESIAN  ATR  DEVELOPMENT 


PRACTICE 


Figure  1.  Long  Term  Research  Strategy 


Fortunately,  flows  of  heat  to  and  from  targets  are  continuously  varying  due  to  the  diurnal 
thermal  cycle  and  the  random  vagrancies  of  wind,  cloud  cover,  air  temperature,  rain  and 
the  temperature  of  the  target’s  surroundings.  These  external  heat  flows  set  up  heat  flows 
within  the  target  in  accordance  with  the  conduction  equation 


dr  =  aV  (  Ydqj 

dt  az2  2fTz 


(EQl) 


The  resulting  heat  flows  internal  to  the  target  cause  corresponding  variations  of  tempera¬ 
ture  within  the  target  It  is  these  variations  of  target  temperature  which  provide  recogniz¬ 
able  target  detail  in  IR  imagery. 

Unfortunately,  much  of  the  flow  of  heat  to  and  from  the  target  is  random,  causing  random 
variations  in  target  temperature  and  thus  random  variation  of  the  visual  detail  of  targets 
within  IR  imagery.  This  results  in  a  problem  for  ATR.  Since  one  can  not  know  just  how 
the  target  will  appear  in  the  IR  image,  how  can  it  be  recognized  with  any  certainty?  The 
answer,  rather  reasonably,  is  that  the  target  should  be  recognized  as  residing  in  any  IR 
image  that  itself  has  high  probability  of  arising  from  the  target. 

The  problem  for  current  research  reduces  now  to  “How  to  obtain  the  critical  probability 
function  of  the  target’s  surface  temperatures?”  Given  the  years  of  IR  target  modeling  to 
date,  this  is  surprisingly  still  an  open  question.  Nevertheless,  study  shows  that  previous  IR 
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image  modeling  efforts  have  concentrated  on  finding  a  target's  IR  appearance  given  full 
knowledge  of  its  thermal  history.  Since  one  will  never  have  ‘full  knowledge  of  the  thermal 
history’  of  enemy  targets  on  the  battlefield,  the  utility  of  these  prior  efforts  for  ATR  is 
problematical. 


Figure  2.  The  IR  Sensing  Process 


3.0  THE  RESEARCH  TASKING 

The  following  tasks  for  the  current  research  effort  were  originally  proposed  in  accordance 
with  guidance  from  AARA: 

•  Task  1  -  Determination  of  Underlying  Temperatures  of  IR  Targets 

Description:  Use  all  available  means  to  move  toward  meaningful  theoretical  modeling 
characterization  of  the  major  underlying  temperatures  of  IR  targets. 

•  Task  2  -  Characterization  of  Near  BIOT  Conditioning 

Description:  Theoretically  consider  the  development  of  the  probabilities  of  the  surface 
temperature  homogeneity  of  target  parts  under  near  BIOT  conditioning. 

•  Task  3  -  Develop  LADAR  Target  Detection  and  Recognition  Bounds 

Description:  Develop  ROC  and  recognition  confusion  matrix  bounds  on  LADAR  target 
detection  and  recognition,  respectively. 

As  the  current  research  effort  began,  AARA  had  already  begun  to  develop  code  for  ROC 
and  recognition  bounds  for  LADAR  targets  as  a  result  of  earlier  FIT  theory.  AARA  thus 
determined  that  FIT  would  act  as  a  consultant  on  demand  against  the  scope  of  task  3, 
while  placing  the  maximum  effort  against  tasks  1  and  2. 

As  originally  conceived,  tasks  1  and  2  contained  both  experimental  and  theoretical  consid¬ 
erations.  Since  there  was  no  reasonable  avenue  for  the  Government  to  provide  the  neces- 
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sary  equipment  for  timely  experimental  investigations,  it  was  jointly  agreed  that  the 
current  effort  at  FIT  should  be  totally  theoretical  in  nature. 


A  multi-step  theoretical  approach  was  then  undertaken  which  has  produced  much  greater 
success  than  reasonably  to  be  expected  -  given  the  distributed  thermal  memory  of  an  IR 
target  It  has  become  possible  to  develop  techniques  for  determining  the  joint  probability 
density  function  of  IR  target  regions  given  only  the  statistical  characterization  of  their 
thermal  environmental  drivers.  There  is  no  recourse  to  Gaussian  approximations  or 
restriction  to  first  and  second  moments.  Determination  of  these  probability  functions  for 
targets  will  find  wide  use  in  the  field  of  IR  ATR. 

4.0  TECHNICAL  RESULTS 

The  results  of  this  research  are  sets  of  mathematical  results  gathered  into  appendices:  pro¬ 
cesses  which  can  be  used  in  a  variety  of  situations  to  gain  insight  and  derive  probabilities 
of  target  presence  over  a  wide  range  of  IR  sensing  and  environmental  situations. 

The  first  set  of  results.  Appendix  A,  transforms  the  thermal  problem  in  solids,  such  as 
metal,  to  a  form  more  easily  manipulated:  that  of  resistor-capacitor  passive  electrical  cir¬ 
cuits  -  either  in  distributed  or  lumped  form.  This  action,  for  electrical  engineers  at  least, 
allows  more  intuitive  consideration. 

The  second  set  of  results.  Appendix  B,  uses  the  tools  of  modem  signal  analysis  to  deter¬ 
mine  target  thermal  responses  to  planar  environmental  drivers.  Distributed  system 
responses  to  sinusoidal,  step,  and  impulse  inputs  are  found.  These  results  enable  immedi¬ 
ate  use  of  Fourier  and  convolution  methodologies  to  develop  exact  target  thermal 
responses  to  arbitrarily  complex  input  planar  drivers  without  recourse  to  finite  element 
approximations.  Quantitative  expressions  for  the  speed  of  thermal  propagation  and  attenu¬ 
ation  of  the  traveling  signal  in  target  solids  are  given.  It  is  seen  that  the  thermal  wave  dies 
out  quite  rapidly  with  an  attenuation  of  55  db  per  wavelength.  The  wavelength  of  the  ther¬ 
mal  wave  is  seen  to  be  a  function  of  the  specific  heat  of  the  solid  and  the  frequency  of  the 
externally  applied  thermal  driver.  Examples  are  given  which  quantify  target  potentials  for 
signal  propagation  within  targets  over  frequencies  on  the  order  of  the  daily  diurnal  cycle 
down  to  brief  wind  gusting. 

The  third  set  of  results.  Appendix  C,  considers  the  thermal  accuracy  of  lumped  constant 
modeling  of  solids,  Appendix  E,  with  respect  to  the  exact  results  obtained  in  Appendix  B. 
Here  it  is  seen  that  a  lumped  stage  per  2.6  cm  of  target  distance  at  brief  wind  gusting  fre¬ 
quencies  and  per  1/2  meter  at  diurnal  frequencies  is  more  than  sufficient  to  maintain  rea¬ 
sonable  signal  accuracy.  These  results  are  important  for  the  remaining  appendices,  which 
utilize  lumped  rather  than  distributed  modeling. 

The  forth  set  of  results.  Appendix  D,  is  concerned  with  extension  to  spatially  distributed 
drivers  over  the  target  body,  i.e.  a  thermal  bath.  This  is  achieved  by  spatial  superposition 
and  voltages  (temperatures)  of  the  capacitor  analogs  as  state  variables.  This  allows  state 
equation  representation  of  arbitrarily  large  target  solids  while  removing  restrictions  on  tar- 
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get  homogeneity.  Specific  attention  is  given  to  developing  a  straight-forward  and  mechan¬ 
ical  approach  for  reducing  arbitrarily  complex  target  solid  models  to  the  state  equation 
formulation. 

The  fifth  set  of  results,  Appendix  E,  is  devoted  to  obtaining  the  impulse  response  of  the 
target  solid  as  represented  by  the  state  equations  and  while  responding  to  a  spatially  and 
temporarily  diverse  set  of  thermal  drivers.  The  impulse  response  is  found  to  be  in  matrix 
form  and  is  used  to  develop  the  matrix  convolution  equation  capable  of  giving  any  joint 
set  of  thermal  outputs  from  the  target  solid  in  response  to  arbitrary  thermal  baths.  This  is  a 
key  result,  as  analytic  expressions  of  joint  sets  of  outputs  is  required  by  the  next  two 
appendices. 

The  sixth  set  of  results.  Appendix  F,  provides  the  key  results  of  the  current  study:  the  deri¬ 
vation  of  means  of  determining  the  joint  probability  density  functions  of  target  regions  in 
IR  imagery  -  given  only  statistical  knowledge  of  the  thermal  environmental  drivers.  This 
is  accomplished  for  all  three  cases:  equal  numbers  of  thermal  drivers  and  IR  target  pixels, 
more  thermal  drivers  than  IR  target  pixels,  and  less  numbers  of  thermal  drivers  than  IR 
target  pixels.  The  question,  “Does  one  needs  to  know  the  full  statistical  behavior  of  the 
environmental  driver  set  or  is  some  law  of  large  numbers  at  work  alleviating  the  need  for 
full  statistical  knowledge?"  is  answered  for  the  first  subcase.  For  equal  numbers  of  thermal 
drivers  and  IR  target  pixels.  Appendix  F  shows  one  indeed  needs  the  full  statistical  knowl¬ 
edge.  Appendix  F  then  shows  for  more  thermal  drivers  than  IR  target,  there  is  a  “law  of 
projection”  at  work  which  includes  the  law  of  large  numbers  as  a  special  case.  Given  the 
“law  of  projection,”  it  is  shown  that  the  complexity  of  the  IR  target  pixels  joint  probability 
density  function  can  be  greatly  reduced  from  that  of  the  environmental  drivers.  Appendix 
F  also  shows  how  to  solve  the  subcase  of  more  IR  target  pixels  than  environmental  driv¬ 
ers.  Finally  this  appendix  shows  extensions  to  the  theory  also  useful  in  the  design  of  ATR 
systems  including  incorporation  of  low  sensor  resolution,  and  characterizing  thermal  spa¬ 
tial  gradients. 

The  seventh  and  final  set  of  results,  Appendix  G,  considers  the  effects  of  nonlinear  cou¬ 
pling  between  the  target  and  its  surrounding  environment  on  the  above  techniques  for 
deriving  joint  probability  density  functions.  It  gives  means  for  deriving  the  IR  target  joint 
probability  density  function  under  nonlinear  environmental  coupling.  These  new  tech¬ 
niques  will  require  additional  mathematical  steps  but  each  step  will  be  shorter  and  poten¬ 
tially  less  complex. 


5.0  CONCLUSIONS 

The  current  research,  reported  here,  represents  a  significant  step  toward  solving  IR  ATR 
problems.  Specifically,  it  overcomes  the  problem  to  date  in  IR  model  matching  that  the 
complete  thermal  environment  history  is  needed  to  derive  a  synthetic  view  of  the  target 
model  for  ATR.  Given  only  the  statistics  of  the  thermal  environment  drivers  acting  on  the 
target,  techniques  are  shown  that  will  mathematically  give  the  joint  probability  of  the  tar¬ 
get  IR  region  as  imaged  and  as  conditioned  on  specific  target  modeling  and  indexing. 
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Additionally,  the  mathematical  techniques  which  are  presented  are  theoretically  capable 
of  solving  nonlinear  couplings  to  the  environment  All  of  the  resulting  techniques  avoid 
need  to  develop  sample  statistics  via  Monti-Carlo  simulation.  This  is  fortunate  since 
Monti-Carlo  methods  are  inappropriate  due  to  the  high  dimensionality  of  the  problem. 


6.0  RECOMMENDATIONS 

Recommendations  based  on  the  current  research  lie  in  three  areas:  extension  of  die  theory, 

application  of  the  theory,  and  ATR  systems  implementation.  Other  strongly  related  recom¬ 
mendations  will  be  found  in  prior  reports  of  this  series.7'11 

Recommended  extensions  to  the  currently  developed  theory  include: 

•  Developing  statistics  of  the  thermal  environment.  Specifically  needed  are  the  spatial 
distribution  and  temporal  independencies  of  thermal  drivers. 

•  Further  amplification  of  the  concept  of  the  “projection  law”  for  joint  probability  distri¬ 
bution  functions.  There  is  vanishingly  little  discussion  in  the  statistical  literature  of  die 
degree  of  complexity  that  joint  probability  density  functions  can  achieve  in  nature  and 
how  this  complexity  is  compressed  out  in  moving  to  lower  order  signal  spaces,  yet  it  is 
of  first  order  importance  to  the  ATR  problem. 

•  Further  study  of  the  entire  concept  of  linearizing  the  problem  under  nonlinear  environ¬ 
mental  couplings.  Under  what  conditions  does  it  make  sense,  how  can  the  concept  be 
expanded,  etc.? 

•  Further  mathematics  should  be  developed  which  will  lead  to  algorithmic  methods  for 
handling  nonlinear  environmental  couplings  with  less  effort  and  more  clarity. 

•  Use  of  modem  circuit  synthesis  techniques  that  provide  accuracy  with  fewer  “circuit” 
components. 

•  Conversion  of  all  results  to  the  distributed  case  by  passing  to  the  infinitesimal  limit 

Recommended  applications  of  the  theory  include: 

•  Using  the  joint  probability  density  function  of  IR  target  regions  as  features  within  vari¬ 
ous  formulations  of  ATR  systems,  including  final  evidence  accumulation. 

•  Using  the  joint  probability  density  functions  of  IR  target  regions  to  develop  bounds  on 
recognition  accuracy  achievable  by  any  ATR  system. 

•  Using  the  joint  probability  density  functions  of  IR  target  regions  to  effect  target-decoy 
discriminations. 

•  Continuing  research  toward  a  truly  optimal  IR  ATR  system  based  on  Bayesian  mini¬ 
mum  error  of  recognition  concepts  by  researching  the  IR  statistical  characteristics  of 
background. 

•  Merging  the  results  of  the  current  research  with  those  of  past  efforts  in  this  series  to  bet¬ 
ter  allow  pixelwise  sensor  fusion. 

•  Use  of  joint  probability  density  functions  to  aid  in  detecting  target  index  mismatch. 


54-8 


Use  of  Biot  inspired  macro  modeling  to  determine  joint  statistics  of  the  undedying  joint 
temperature  distributions  of  major  target  components. 


Recommendations  for  further  research  into  system  implementation  issues  include: 

•  Determination  of  the  crossover  point,  if  any,  at  which  the  complexity  of  learning  the 
joint  probability  density  function  of  target  regions  becomes  less  that  the  calculation 
from  first  principles  developed  here. 

•  Determination  of  the  likely  complexities  of  the  joint  probability  density  function  of  tar¬ 
get  regions  and  their  impact  on  ATR  algorithmic  accuracy  vs.  storage  and  training 
requirements. 

•  Study  of  efficiency  gains  due  to  approximation  of  the  probability  density  function  of 
target  regions  at  various  levels  of  IR  resolution. 
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Appendix  A 

ELECTRICAL  ANALOGS  OF  THERMAL 

CONDUCTION 


A1.0  INTRODUCTION 


The  experimental  law  for  heat  conduction  transfer  through  solids  is  due  to  Fourier.  The 
law  states  that  the  amount  of  heat  flowing  through  a  surface  per  unit  time  is  proportional  to 
the  temperature  gradient  normal  to  the  surface 


*Q  _ 
5 ~t  ~ 


dx 

-kAA— 

dx 


(Al) 


where  Q  is  the  quantity  of  heat  in  joules,  t  is  time  in  seconds,  k  is  conductivity  in  watts/m- 
C,  x  is  the  temperature  in  degrees  C  and  the  geometry  is  shown  in  Figure  Al. 


Figure  Al  An  Infinitesimal  Volume  Of  A  Solid 


A2.0  CONVERSION  TO  AN  ELECTRICAL  CURRENT 


In  order  to  convert  the  thermal  conduction  problem  to  one  of  electrical  analogies,  it  is  con¬ 
venient  to  express  heat  as  a  vector  rate  of  flow  per  unit  area,  q 


=  lim 

AA  — *  0 


1  dQ 
(A A)2^i 


A A 


(A2) 


This  is  a  flux  density  corresponding  to  electrical  current  density. 


54-10 


A3.0  CONVERSION  TO  A  RESISTOR 


If  a  plane  wave  of  heat  transfer  is  assumed,  so  only  one  component  of  the  vector,  q ,  is 
needed,  then  Eq.(Al)  becomes 

th 

q  =  -*55  (A3) 

or  when  rewritten 


dx 

1  di 

k  -q 


(A4) 


one  can  make  the  conversion 


R  = 


v 

i 


to  give  the  electrical  analog  of  a  resistor,  Figure  A2. 


(AS) 


Figure  A2  Notation  Relating  lb  A  Resistor 


A4.0  CONVERSION  TO  A  CAPACITOR 

Consider  Figure  Al.  The  heat  flowing  in  from  the  left  is 

^fi(x)  =  q(x)AA  (A6) 

and  the  heat  flowing  out  from  the  right  is 

^2(x  +  Ax)  =  <7(x  +  Ax)AA  (A7) 

The  difference  between  these  two  values  represents  the  amount  of  heat,  SpAV.being 
removed  from  the  volume,  AV.  S  is  termed  the  specific  heat  per  unit  mass,  i.e.  a  constant 
giving  the  change  in  heat  stored  within  a  given  mass  to  the  change  in  temperature,  and  p  is 
the  density  in  kilograms  per  cubic  meter  of  the  mass 

^Q(x  +  Ax) =  q(x  +  Ax)AA-q(x)AA  =  -SpAV  (AS) 


In  passing  to  the  limit 


£  ■  ~SpTt 

and  upon  rewriting 

i  =  Cv 

one  see  tire  relation  to  the  electrical  analog  of  a  capacitor,  Figure  A3. 


(A 9) 


(A10) 


It. 


Figure  A3  Notation  Pertaining  To  A  Capacitor 


A5.0  CONVERSION  TO  CIRCUIT  ELEMENTS 

Eq.’s  (A5)  and  (A10)  taken  together  give  the  low  pass  RC  filter.  Figure  A4,  which  is 

R 


Figure  A4  The  RC  Filter  Analog 


equivalent  to  the  thermal  diagram  of  Figure  A 1. 

To  develop  more  complex  circuits,  divide  the  solid  into  subvolumes  placing  a  grounded 
capacitor  in  each.  Then  place  a  resistor  between  capacitors  of  adjoining  subvolumes. 

As  an  example,  consider  planar  heat  flow  through  a  plate.  The  equivalent  circuit  is  shown 
in  Figure  A5. 
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Figure  A5  Example  Conversion  To  The  RC  Filter  Analog 
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Appendix  B 

SIGNAL  ANALYSIS  OF  THERMAL 
CONDUCTION 

B1.0  INTRODUCTION 

Appendix  A  gave  the  electrical  analog  of  thermal  conduction  in  solids.  As  a  first  step  to 
the  understanding  of  probabilistic  heat  flows,  this  appendix  uses  modem  signal  analysis 
techniques  to  derive  further  understanding  of  the  conducting  heat  flow  signal  itself. 


B2.0  RESPONSE  TO  STEADY-STATE  ALTERNATING 
TEMPERATURES 

The  target  in  the  field  is  subjected  to  various  alternating  environmental  drivers  ~  the  chief 
of  which  is  the  diurnal  thermal  cycle.  As  a  first  step  of  toward  understanding,  this  section 
considers  the  response  of  a  flat  target  plate  to  an  alternating  sine  wave  of  temperature,  T j, 
in  Figure  AS.  Since  temperature  can  not  go  negative,  all  signals  of  this  section  are  with 
respect  to  some  average  temperature. 

Combining  Eq.’s  (A5)  and  (A10)  gives 

57  -  Rcsi  ®» 

As  usual,  let  the  instantaneous  values  of  steady  state  alternating  voltage  and  current  be 

v  =  /?e(Vre,CM)  =  V sin  or  (B2) 

*  =  Re(le'<tu)  =  Fcostor  (B3) 

which  when  substituted  into  Eq.  (Bl)  gives  at  any  angular  frequency,  co,  the  harmonic 
equation 


(B4) 

where  T  is  called  the  propagation  constant 
In  terms  of  heat  quantities  T  is 


^  =  j(oRCv  =  T2v 
dxr 
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d+7) 


So 

where  D  is  -y,  the  thermal  diffusivity. 

The  well  know  solution1  to  the  harmonic  equation  is 


.  -Ti,  n  Yx 

v  =  Ae  +Be 


Now,  solving  for  i  from  Eq.  (A4) 


=  i(AY<fTx-BY*Tx) 

A 

1  tk  -Tx  D  Txv 
=  -  ■■  (A«  -  Be  ) 


R 

•ijuiC 


=  ±-(Ae~rx-BeTx) 


where  Zq  is  the  impedance  of  the  solid.  In  thermal  terms 


Zo  = 


JjwkSp  y2(okSp 


(1-7) 


which  shows  that  the  flow  of  heat  is  always  45  degrees  ahead  of  the  temperature. 

Assuming  the  solid  plate  is  of  infinite  length,  then  B  must  be  zero  in  Eq.  (B7),  otherwise 
the  ex  term  would  grow  to  infinity  as  x  — >  <».  The  eTx  term  comes  into  play  for  finite 
plate  length  only  in  terms  of  signal  reflections.  Signal  reflections  will  be  of  little  interest  in 
the  thermal  case  for  reasons  seen  in  Subsection  B2.2. 


Since  B  must  be  zero  valued 


1.  An  alternate  solution  form  in  terms  of  hyperbolic  functions  can  be  given 

v  =  CcoshYjc  +  DsinhYx 

This  alternate  formulation  tends  to  destroy  the  meaning  for  the  thermal  problem,  and  so 
will  be  avoided  in  this  work. 


54-15 


V  =  Ae-r* 

«  =  ~ATe~rx 
Lo 

and  of  course 

z«  =  7 

Let  T  be  represented  as 

T  =  a+j& 

where  a  is  termed  the  attenuation  constant  and  f$  is  the  phase  constant 
In  electrical  terms 


(B9) 


(BIO) 


(Bll) 


(B12) 


(B13) 


In  ordinary  circuit  analysis,  the  time  factor  is  frequently  neglected  (as  above).  Since  in  a 
solid  there  are  both  time  and  space  factors,  it  is  best  to  include  both  explicitly.  So  from 
Eq.’s  (B9)  and  (Bll) 


ve*m 

,V°*  =  ~  (e~axe/(at~Px)) 
zo 

or  when  expressed  in  real  terms,  as  a  function  of  x  and  t 

v  ( x ,  t)  =  Ae^cos  (cot -  fix) 

i(x,t)  =  ^-e^cos  (cot-  fix -6) 
lq 


where 

Z0  =  Pol** 


(B14) 


(B15) 


(B16) 
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B2.1  Propagation  Of  The  Thermal  Wave 

To  see  the  meaning  of  the  time/distance  relationship  of  the  signal  in  the  solid,  concentrate 

on  the  term,  of  Eq.  (B15).  Figure  Bla  shows  a  plot  of  the  voltage  (temperature) 

across  the  normalized  length,  (3je,  of  the  solid  for  three  different  normalized  times, 

(or  =  0,  cor  =  ^  and  or  =  k. 

Figure  Bib  shows  a  plot  of  the  voltage  (temperature)  as  a  function  of  normalized  time,  tor, 
for  three  different  normalized  positions  on  the  solid.  Comparison  of  the  two  sets  of  plots 

reveals  that  they  are  the  same  in  shape1.  Figure  B1  is  thus  a  picture  of  a  traveling  wave  of 

2k 

temperature  launched  across  the  solid.  The  wavelength  in  time  is  —  and  the  wavelength 

cor  -  0 


(a)  voltage  (temperature)  vs.  normalized  distance 


pjc  =  0 


Figure  B 1  Spatial  and  Temporal  Snapshots  Of  The  Traveling  Wave 


2  K 


m  space  is 


T* 


The  space  wavelength  in  electrical  terms  is 


1 .  Actually  this  is  a  partial  result  of  the  use  of  an  even  function,  the  sinusoid.  In  general  the  space 
and  time  plots  are  mirror  images.  This  will  be  better  seen  when  dealing  with  transients  in  following 
sections. 
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In  thermal  terms 


(B17) 


(B18) 


The  most  important  characteristic  of  this  equation  is  that  it  shows  the  wave  length  of  ther¬ 
mal  propagation  in  solids  to  be  a  direct  function  of  the  driving  (time)  frequency.  The 
higher  the  driving  frequency  the  faster  the  propagation  and  the  shorter  the  wavelength. 


The  phase  velocity,  vp,  is  the  velocity  at  which  the  thermal  wave  is  moving.  It  can  be 
found  from  setting  oat—  P*  in  Eq.  (BIS)  to  zero  giving 


dv  to 
v'  '  Tt  =  p 


(B19) 


In  electrical  terms 


/ 2© 

vp  =  Jxc 

and  in  thermal  terms 

2  (Ok  r— 

vp  -  iip  -  v*5 


(B20) 


(B21) 


B2.2  Attenuation  Of  The  Thermal  Wave 

In  Eq.  (B15),  the  envelope  term  of  the  thermal  wave  is  e~€Lx.  To  see  how  much  the  thermal 
wave  is  reduced  in  amplitude  as  it  travels  across  the  solid,  consider  that  at*  the  voltage  is 


v*  = 


ce 


-ax 


andat*  +  /,  it  is 


vx+/  =  ce 


i(x+0 


The  ratio  of  these  two  envelopes  is 


(B22) 


(B23) 
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*x+l 


-  e 


.-ad 


(B24) 


for  any  x.  This  gives  an  attenuation  of 


a  /  =  log 


*x+l 


e  v. 


(B25) 


in  nepers  per  length  /.  The  equivalent  attenuation  of  the  traveling  wave  in  db  is 


201o^±i  =  (201ojj<?)  al 


Taking  /  to  be  one  wavelength  gives 


al  = 


(B26) 


(B27) 


which  is  a  very  large  attenuation  -  2ic  nepers  or  55  db  per  wavelength.  Thus  in  just  one 
wavelength  the  absolute  attenuation  is  a  factor  of  526.  Even  at  one  half  wavelength  is 
absolute  attenuation  is  quite  high,  a  factor  of  25.  As  shown  in  Eq.  (B18),  the  absolute  dis¬ 
tances  for  these  levels  of  attenuation  depends  on  the  driving  (time)  frequency  and  the  ther¬ 
mal  diffusivity  for  the  solid.  Figure  B2  gives  representative  values  for  one  wavelength  of 
distance. 


Type  Solid 

Time  Wavelength 

1  Day 

1  Hour 

1  Minute 

1  Second 

Copper 

10.4  M 

2.1  M 

27  cm 

3.5  cm 

Steel 

3.3  M 

67  cm 

8.6  cm 

1.1  cm 

Concrete 

1  M 

21  cm 

2.7  cm 

0.35  cm 

Figure  B2  One  Wavelength  In  Various  Solids  And  At  Various  Driving  Frequencies 
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Appendix  C 

ACCURACY  CONSIDERATIONS  IN 
LUMPED  CONSTANT  MODELING 

C1.0  INTRODUCTION 

Appendix  A  developed  electrical  analogs  to  the  thermal  conduction  process  that  can  be 
utilized  in  the  distributed  or  discrete  (lumped  constant)  case.  Appendix  B  developed  signal 
responses  to  a  variety  of  environmental  drivers  based  on  the  distributed  analog  model. 
These  responses  are  therefore  exactly  those  of  the  thermal  conduction  processes  since  no 
approximations  have  been  made.  On  the  other  hand.  Appendix  D  through  Appendix  G  are 
based  on  the  discrete  (lumped  constant)  form  of  the  analog  model.  This  raises  accuracy 
considerations.  The  lumped  constant  model  must  be  made  fine  grained  enough  to  prevent 
significant  error.  This  appendix  develops  the  necessary  theory  to  make  direct  comparisons 
against  the  exact  results  of  Appendix  B. 


C2.0  THE  APPROACH 

A  lumped-constant  analog  model  for  a  planar  traveling  wave  in  the  solid  of  Figure  A5  is 
seen  to  be  an  iterative  structure,  i.e.  a  cascade  of  identical  stages.  The  solid  has  the  same 
characteristic  impedance,  given  by  Eq.  (B8)  at  every  point  along  its  length.  The  iterative 
analog  in  Figure  A5  also  has  a  constant  repetitive  characteristic  impedance  at  the  output  of 
each  of  its  stages. 

As  the  number  of  stages  of  the  analog  rises  per  unit  length  of  the  modeled  solid,  the  char¬ 
acteristic  impedance  and  other  parameters  of  the  analog  converge  to  those  of  the  solid 
itself.  Thus  a  reasonable  goal  is  to  determine  the  number  of  iterative  stages  required  per 
unit  length  to  achieve  a  desired  fidelity  to  the  actual  parameters  of  the  solid  itself. 


C3.0  PARAMETERS  OF  THE  ITERATED  STRUCTURE 

In  order  to  determine  the  values  of  the  parameters  of  the  iterated  analog  structure  of  Figure 
A5,  it  is  easier  to  pass  to  the  T  form.  Figure  Cl ,  of  the  RC  filter  section  prior  to  performing 
calculations.  R  can  be  taken  as  the  resistance  per  meter  of  the  solid  divided  by  the  number 
of  iterative  stages  per  meter.  C  can  similarly  be  taken  as  the  capacitance  per  meter  of  the 
solid  divided  by  the  number  of  iterative  stages  per  meter. 

C3.1  The  Iterative  Characteristic  Constant 

When  the  T  filter  is  property  terminated,  its  input  impedance,  Z0,  is  equal  to  its  load 
impedance.  Solving  for  this  case 
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(b)TheT  form  of  the  cascaded  RC  filter 
Figure  Cl  Equivalence  Of  The  L  And  T  Cascade 


C3.2  The  Iterative  Propagation  Constant 

Y\  the  propagation  constant  of  each  interative  stage,  can  with  sufficient  manipulation  be 
found  as 


r,=,0i(1  +  2£c+S> 


C4.0  A  FIRST  ORDER  ACCURACY  ANALYSIS 

The  thermal  wave  attenuates  by  55  db  in  one  wavelength  in  a  solid.  Appendix  B,  and 
becomes  insignificant  after  that  point  A  sufficient  number  of  discrete  lumped  constant 
stages  should  then  be  used  in  the  first  wavelength  in  the  electrical  analog  to  insure  good 
fidelity  to  attenuation  and  phase  of  the  true  traveling  thermal  wave  of  the  actual  solid.  A 
first  order  analysis,  carried  out  here,  indicates  that  relatively  few  stages  per  unit  length  are 
sufficient 

Z0  can  be  converted  to  the  form 
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» 


=  zoJi+ 


(TAx) 


(C4) 


where  Y  is  the  actual  propagation  constant  for  the  solid  and  Ax  is  the  number  of  meters 

(TAx)2 

per  iterative  analog  T  filter  stage.  Since  Z0  should  be  close  in  value  to  Z0,  — - — 
should  be  selected  to  be  much  less  in  value  than  1,  or 


Ax« 


I  =*F 

Y  VG> 


<C5) 


Assuming  this  is  so,  then  expanding  the  square  root  in  Eq.  (C4)  by  the  binomial  expan¬ 
sion,  and  retaining  the  first  two  significant  terms  is  sufficient 


(TAx)2 

r0  =  Z0(  1  +  -L--L) 


(C6) 


Substituting  Eq.  (C6)  into  Eq.  (C3),  using  the  significant  terms  of  a  series  expansion  for 
the  natural  logarithm,  and  after  considerable  further  manipulation 


r  =  TA*-i(TAx)3-i(TAx)4  +  .. 


(C7> 


which  reduces  with  sufficient  accuracy  to 


r  =  (TAx)(i-i(TAx)2+...) 


(C8) 


Since  Y’  is  the  propagation  constant  of  an  interative  stage,  let  Y"  be  the  propagation  con¬ 
stant  for  n  sections  of  the  iterative  cascade  of  stages 


T"  =  nY' 


(C9) 


and  so 


(YAx)2 

Y"  =  YnAx(l  -  -~2^~  +  •••) 


Since  from  Eq.  (B12) 


■n  MC 

-  °+jP  =  hr 


then 


(1+7) 


a"  =  nAxa*«( (1+;)  (l  -  + ...  j) 


(CIO) 


(Oil) 


(C12) 
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P"  =  nAxaJm(  ( 1  +y) 


This  reduces  to 


4-^ 


0+;): 


+  ~D 


(Axa)  ^ 

a"  =  nAxa(l+  -  L) 


(C13) 


Error  in  attenuation  over  the  n  sections  is  best  expressed  as  a  ratio 

J^  =  (,+i^L2) 

nAxa  v  12  ' 

and  error  in  radian  phase  over  the  n  sections  is  best  expressed  by  subtraction 


P"  -  nAjtp  =  - 


n(Axp): 


(C16) 


(C17) 


C4.1  Interpretation 

Reflection  on  Eq.’s  (C4),  (C16)  and  (C17)  will  show  that  it  is  harder  to  maintain  reason¬ 
able  phase  error  than  reasonable  attenuation  or  characteristic  impedance  error.  Since  the 
thermally  traveling  wave  will  have  decayed  to  -55db  over  a  single  wavelength,  there  is  lit¬ 
tle  need  to  consider  the  phase  error  over  longer  distances.  A  quite  conservative  allowable 
phase  error  is  36  degrees  over  this  single  wavelength,  thus 

n(AxP)3  2tc 

12  "  36  360  (C18) 

Setting  the  number  of  filter  stages  per  wavelength 


X  =  nAx  » 


Eq.  (C18)  reduces  to 


(Ax2)  P3  =  7.54 


(C20) 


Since 


Eq.  (C20)  further  reduces  to 


A*  =  0.618 


For  a  target  of  mild  steel,  then  a  conservative  lower  length  per  iterative  stage  is 


Ax  = 


0.002 

Jr 


(C21) 


(C22) 


(C23) 


C4.2  Conclusions 

If  the  highest  frequency  environmental  driver  of  interest  is  on  the  order  of  one  cycle  per 
three  minutes,  then  for  mild  steel  it  would  be  appropriate  to  insert  an  iterative  stage  no 
more  often  than  one  per  2.6  cm.  Were  the  highest  frequency  of  interest  on  the  order  of  one 
cycle  per  day,  for  mild  steel  it  would  be  appropriate  to  insert  an  iterative  stage  no  more 
often  than  one  per  half  meter.  These  figures  are  quite  conservative.  Further  analysis  would 
most  likely  double  or  triple  the  minimum  requited  spacings. 
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Appendix  D 

STATE  REPRESENTATION  OF  THE 
HEAT  CONDUCTION  NETWORK 

D1.0  INTRODUCTION 

Appendix  B  determined  the  thermal  traveling  wave  responses  of  an  infinite  length  solid  to 
a  single  planar  deterministic  thermal  driver.  This  appendix  is  devoted  to  a  formulation 
which  is  to  be  used  in  Appendix  E  to  allow  determination  of  the  target’s  response  to  an 
arbitrarily  large  number  of  arbitrarily  placed  and  complex  deterministic  thermal  drivers. 

This  appendix  also  provides  a  straightforward  and  mechanical  method  of  converting  the 
RC  electrical  analogs  of  target  thermal  conduction  to  the  new  formulation. 


D2.0  THEORY 

The  general  form  of  the  state  equations  of  a  continuous  deterministic  system  can  be  given 
as: 

x(t)  =  Ax (tQ,  t))  (Dl) 

y(t)  =  gix(t0),  v(t0,t))  (D2) 

In  those  cases  where  this  continuous  deterministic  system  can  be  given  as  a  set  of  linear 
ordinary  differential  equations,  i.e.  thermal  conduction,  the  state  equations  become: 

i(r)  =  A  (r)x(r) +B  (r)  v(r)  (D3) 

y(t)  =  C(t)x(t)  +D(t)v(t)  (D4) 

Finally  when  the  system  is  time-invariant,  i.e.  has  constant  thermal  properties,  the  state 
equations  can  be  given  as: 

x(r)  =  Ax(t)  +Bv(t)  (D5) 

y(t)  =  Cx(t)  +Dv(t)  (D6) 

as  illustrated  in  Figure  Dl. 
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Figure  D1  The  General  State  Representation 


D3.0  REDUCTION  OF  HEAT  CONDUCTION  NETWORKS 
TO  STATE  MATRIX  REPRESENTATIONS 

As  discussed  in  Appendix  A,  the  heat  conduction  network  is  equivalent  to  an  electrical 
resistor  capacitor  network.  Without  loss  of  generality,  this  resistor  capacitor  network  can 
be  represented  as  shown  in  Figure  D2 

Let 


x(t)  = 


v(r) 


*/+i 
*/  +  2 


(D7) 


If  all  voltage  and  current  sources  are  set  to  zero,  i.e.  respectively  short-circuited  and  open- 
circuited,  and  a  unit  voltage  is  supplied  in  series  with  the /th  capacitor  at  time  tp,  then  let 
the  current  at  tg¥  in  capacitor  i  be  noted  as  idftp+).  Since  capacitors  initially  appear  as 
short-circuits,  can  be  computed  as  shown  in  Figure  D3.  Since  iCi/.to+)  is  a  current 

through  the  i’th  capacitor,  then  ceij  (f0+ )  =  -Cixi  (f0+ ) . 
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Under  these  conditions,  Eq.  (D5)  reduces  to 


and  further  to 


giving 


*,(0  =  'Laijxj 
j=  i 


Xi  ( t )  —  aij*j 


C cij  (r0+) 

ay  ”  c, ; 


(D8) 


(D9) 


(DIO) 


Note  that  computation  of  the  a-’ s  require  only  straight  forward  and  mechanical  analysis 
of  the  purely  resistive  network.  This  leads  to  ease  of  conceptualizing,  but  for  large  prob¬ 
lems  other  more  efficient  methods  may  be  desired. 


Computation  of  the  B  matrix  is  conducted  along  similar  lines.  For  each  voltage  source,  Vj, 
let  its  value  be  1,  open  it’s  shorting  switch,  and  leave  all  other  switches  in  the  position  of 
Figure  D3.  Then 


W1 o+> 
C, 


(Dll) 


For  each  current  source,  is,  let  its  value  be  1,  close  its  opening  switch,  and  leave  all  other 
switches  in  the  position  of  Figure  D3.  Then 


b 


v 


(D12) 


The  C  matrix  is  the  observation  matrix  of  temperatures  of  the  conducting  material.  Each 
pixel,  y,  of  Y  can  be  considered  to  be  measuring  a  summation  of  target  surface  tempera¬ 
tures,  x.  Thus  the  C  matrix  can  be  found  by  elementary  analysis  of  the  sensing  environ¬ 
ment 


The  D  matrix  is  the  observation  matrix  of  the  set  of  thermal  drivers.  One  can  safely  set  it 
to  0  for  the  far  IR  case.  For  the  near  IR  case,  it  is  an  analyzable  function  of  the  thermal 
driven. 


D4.0  AN  EXAMPLE 

Assume  that  a  thermally  conducting  target  can  be  represented  by  the  resistor-capacitor  cir¬ 
cuit  of  Figure  D4,  where  the  v’s  are  thermal  flows  from  the  environment  to  the  target 
Then  Eq.  (D5)  becomes 
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and  the  A  matrix  is  Toeplitz  and  the  B  matrix  is  the  identity  matrix. 
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Figure  D4  An  Example  Thermal  Network 


Now  assume  that  each  far  IR  imagery  sensor  pixel,  y,-,  covers  X2i-/.  x2i>  m(^  X2i+I  with 
weights  0.2, 0.6  and  0.2  respectively.  Then  Eq.(D6)  becomes 


>1 

0.2  0.6  0.2  0  0  0 

>2 

— 

0  0  0.2  0.6  0.2  0 

y\ 

.0  0  0  0  0.20.6 

*2 

*3 

*4 

*5 

*6 


(D14) 


The  D  matrix  is  the  0  matrix,  since  in  the  far  IR  there  is  no  direct  pass-through  of  the  envi¬ 
ronmental  thermal  drivers. 
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Appendix  E 

THE  THERMAL  IMPULSE  RESPONSE 

FUNCTION 


El.O  RATIONAL 

Appendix  C  developed  the  state  differential  equations  for  heat  conduction  networks, 
implicitly  providing  a  solution  for  the  output,  Y. 

For  cases  where  the  environmental  driving  thermal  forces  are  not  functions  of  the  state 
variables,  X,  then  it  is  possible  to  derive  the  solution  for  the  output,  Y,  directly  in  terms  of 
the  input  driver  vector,  V 


r(.t)  =  cw<i><<,»0)jr<g+J[C(f)®(»,t)*(T)+D<<)s(<-T)]v(T),ft  <ed 

*0 

where 


<D(r)  =  e4'  =  /+At+LiV+iA3i3+  •  •  • 

23  (E2) 

This  formulation  is  key,  since  in  Appendix  F  it  will  be  found  the  key  to  developing  the 
probability  of  thermal  responses  to  collections  of  random  environmental  drivers. 

The  section  below  provides  further  simplification  of  Eq.  (El)  under  reasonable  ATR  target 
conditions. 

E2.0  SIMPLIFICATIONS 

<I>  (t)  is  know  in  the  literature  as  the  fundamental  matrix  or  the  transition  matrix.  For 
known  values  of  r,  it  can  be  calculated  to  any  required  precision  by  computer  matrix  oper¬ 
ations.1  Other  faster  numeric  methods  are  based  on  Sylvester’s  theorem  and  the  Cayley- 
Hamilton  technique.2  Laplace  transform  based  symbolic  methods  include  the  frequency- 
domain  and  the  transfer  function  method.2  Symbolic  mathematics  methods  include 
manipulation  of  the  underlying  differential  equations,  and  recognition  of  infinite  sums.3 

In  the  case  of  far  IR,  D  (r)  is  a  0  matrix,  giving 

1 

Y(t)  =  C  (t)<P(t,  t0)X(tg)  +JC(t)<t>(t,T)B(T)V(T)dT  (E3) 

*0 


54-31 


For  heat  conduction  networks  with  time-invariant  parameters,  e.g.  metal,  this  further 
reduces  to 


Y(t )  =  CO*(f0)  +  jC<J}(t-x)B(x)V(x)dx  (E4) 

*0 

Under  normal  conditions,  there  is  sufficient  thermal  interaction  between  targets  and  their 
surrounding  environments  for  r » r0  to  allow 

t 

Y(t)  -jC<J>(t-x)B(x)V(x)dt  (E5) 

*0 

Letting  H  (r)  =  CO  (t) ,  termed  the  impulse  response  matrix  of  the  network,  gives 

t 

Y(t)  -  $H{t-x)B(x)V(x)dx  (E6) 

'o 

the  matrix  convolution  solution  for  the  network  output,  Y(t),  in  terms  of  the  network  input, 
V(t). 

The  discrete  time  version  of  Eq.  (E6)  is 

-  t»,  -Wj  <E7> 

>=  1 
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Appendix  F 

DERIVING  THE  JOINT  PROBABILITY 
DENSITY  FUNCTION  FOR  THERMAL 

NETWORKS 

Fl.O  INTRODUCTION 

The  work  of  Appendix  A  through  Appendix  E  culminates  in  Eq.  (El).  Eq.  (El)  gives  any 
desired  set  of  thermal  conduction  outputs  as  a  weighted  summation  of  the  amplitude  of  the 
thermal  input  drivers  to  the  target.  This  appendix  extends  this  deterministic  characteriza¬ 
tions  into  the  ATR  probabilistic  realm  where  the  exact  history  of  the  targets’s  thermal 
environment  can  not  be  known. 

F2.0  NOTATION 

First,  the  notation  must  be  simplified.  For  our  purposes,  let  i  be  the  discrete  time  index  cor¬ 
responding  to  the  time  that  the  IR  sensor  takes  its  image.  Being  well  understood,  we  can 
drop  it  from  Eq.  (E7)  to  give 


y  =  I  H-jBjVj  (FI) 

y=i 

Letting  Gj  =  H-jBj  gives 

n 

Y  =  £  GjV.  (F2) 

j  =  1 

Thus  Y  is  seen  to  be  a  weighted  linear  summation  of  all  previous  environmental  input 
drivers;  that  is,  all  of  the  individual  elements  of  all  Vk  vectors. 

To  make  this  more  obvious  and  to  prepare  for  further  work,  let  k  =  (J  -  1 )  m  + 1 ,  where 
m  is  the  size  of  the  V  vector.  Thus  let  v*  be  the  i’th  element  of  Vj  of  Eq.  (F2).  This  action 
makes  a  single  input  vector  out  of  the  many  time  sequenced  input  vectors.  Upon  proper 
substitution  of  the  g*>  this  gives  the  set  of  equations 

mn 

Yi  =  Z  8kvk  <**> 

*=  1 

for  1  £  i  £  /,  where  /  is  the  size  of  the  output  vector,  Y. 
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Each  output  thermal  response  of  a  solid  is  now  easily  seen  to  be  a  weighted  summation  of 
all  of  the  instantaneous  drivers  operating  over  the  body  of  the  target 

Expressing  Eq.  (F3)  in  matrix  form  gives 

Y  =  GV  (F4) 


F3.0  A  FIRST  SOLUTION 


Let 


/  =  mn. 


Then 


(F5) 


V  =  G'lY  (F6) 

,  i.e.  the  inputs  arc  linear  functions  of  the  outputs.  Let  each  element  of  V  be  written  as 

v,-  =  gi(Y)  (FT) 

If  the  joint  probability  density  function  of  the  set  of  input  thermal  drivers, 
vi*v2*  •  •  •  ,  v^,  conditioned  on  a  set  of  target  conditions,  H,  is 

P (V|» v2,  •  •  •  ,y^H)  (F8) 

then  the  joint  probability  density  function  of  die  set  of  output  target  observables,  Y ,  is 

\J\p(gi(yi),g2(y2)>  •  •  •  •fa.O'J  |*0  (W> 

where  /  is  the  Jacobian  of  the  transformation  of  V  to  K,  i.  e.  the  determinant 


dvj  dvj 

dVj 

5yi  W2 

•  •  •  5 - 

dv2  dv2 

dv2 

Wx  Wx 

*  ‘  '  wz, 

•  • 

•  •  •  • 

Wl  a?2 

•  •  •  5 - 

^y  mn 

(F10) 


F3.1  An  Interpretation 

Eq.’s  (F9)  and  (F10)  form  the  first  truly  important  results  of  this  research.  When  die  num¬ 
ber  of  time  sequenced  random  environmental  driver  variables  is  equal  to  the  number  of 
pixels  being  observed,  then  the  probability  density  of  the  imaged  pixels  is  a  simple  distor¬ 
tion  of  the  probability  density  of  the  joint  pdf  of  the  inputs: 
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•  Each  axis  of  the  original  probability  density  function  is  linearly  rescaled.  As  the  input 
variables  recede  in  time  from  the  time  of  imaging,  their  axis  lengths  are  reduced. 

•  Each  value  of  the  original  probability  density  function  is  rescaled  by  the  Jacobian,  J,  to 
insure  that  the  output  cdf  sums  to  1. 

F3.2  An  Example 

For  illustrative  purposes,  consider  a  small  scale  problem.  Let  the  target  region  be  the  ther¬ 
mal  conduction  network  shown  in  Figure  FI. 


Figure  FI  A  Small  Example  Conduction  Network 


Then  the  state  equation,  Eq.  (D5) 


can  be  given  as 


x  (0  =  Ax  {t)  +Bv(t ) 


*i  (0  =  [-0.1  0.1*1  *i  (0  +  [i  61  vi 

jf2(r)  0.1  -0.1  x2(t)  0  1  v2 


where  the  v’s  are  environmental  driver  flows,  i.e.  currents  rather  than  voltages  in  the  elec¬ 
trical  analog  of  Figure  FI. 

Taking  an  IR  image  at  time  T,  let  the  two  pixels  on  the  taiget  bey;  (J)  and y2(T).  Then  the 
state  observation  equation,  Eq.  (D6),  can  be  written  in  discrete  time  form  as 


y  (/)  =  Cx(f)  +Dv(t) 


and  given  as 


yx(t) 

yiit) 


=  [i  61  x\ 

Lo  ij  x2 


This  leads  to  the  impulse  response  matrix 
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« 


<&(0  = 


1  —  c  (r)  c(t) 
c(t)  1  -c(t) 


(F15) 


where 


c(t)  =  0.1 1- 0.01?  +  0.000666/3  -  0.0000333r4+  •  •  •  (F16) 

c(r)  is  plotted  in  Figure  F2. 


Now  assume  that  there  are  discrete  heat  inputs  at  times  T j  and  72. 


v(0  = 


uM*-Tx) 

0 


0 


L«25(f-r2) 

Making  appropriate  substitutions  in  the  above  equations 


yi(T) 

y2(T) 


<D(r-r,) 


+  d>(r-r2) 


Simplifying 

yx(T)  =  (1  -c(T-Tx))ux  +  c{T-T2)u2 

y2(D  =  c(T-Tx)ux+(l-c(T-T2))u2 

Solving  for  uj  and  i<2  gives 

_  (l-c(T-T2))yx-c(T-T2)y2 
Ul  1  -c(T-Tx)  -c(T-T2) 


(F17) 


(F18) 


(F19) 

(F20) 


(F21) 
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(F22) 


_  -c(T-Tl)yl+  (l-c(T-Tl))y2 
^  l-c(7-7,) -c(T-T2) 

From  Eq.  (F9),  the  joint  probability  function  of  yj  and  y2  conditioned  on  H ,  the  event  that 
the  IR  system  is  looking  at  the  target  of  this  example  is 


=  171  • 

(\-c(T-T2))yl-c(T-T2)y2  -c(T-Tx)yi+  (l-c(T-Tx))y2\ 


rn-c{ 
*'•'*{  1-, 


c(T-Tx)-c(T-T2)  ’ 

where  171  is  the  Jacobian  of  the  transformation 

1 


i-c(r-r1)-c(r-r2) 


171  = 


1, 


(F24) 


l-c(T-Tx) -c(T-T2) 

Visually,  Eq.  (F23)  can  be  illustrated  by  Figure  F3.  The  original  joint  pdf  space  of  uj  and 


decreasing  values 
of  T2 


(transformation 
from  input  space 
to  output  space) 


►Mi 


distorted 
pdf  space 
of  uj  and  u2 

decreasing 
values  of  Tj 


pdf  space  of  Uj  and  u2  pdf  space  of  yj  and  y2 

Figure  F3  The  pdf  Space  Transformation 


u2,  is  shown  to  the  left  The  joint  pdf  space  of  yj  and  y2  is  shown  to  the  right  The  thermal 
conduction  transformation  from  the  input  of  Uj  and  u2  to  the  output  of  yj  and  y2  moves 
and  distorts  the  pdf  space  of  uj  and  u2  into  the  pdf  space  of  yj  and  y2  as  shown.  For  pur¬ 
poses  of  illustration,  set  T2<TX<T. 

As  7/  and  T2  approach  the  value  of  7,  the  uj  axis  in  the  yj  and  y2  pdf  space  to  the  right  of 
Figure  F3  approaches  the  yj  axis  and  the  u2  axis  approaches  the  y2  axis. 

As  T]  and  T2  recede  into  the  past  away  from  the  value  of  7,  the  uj  and  the  u2  axes  move  in 
the  directions  shown  by  the  curved  arrows  of  Figure  F3.  Ultimately  the  uj  and  u2  axes  will 
meet  on  the  yj  =  y2  line,  which  implies  that  yj  and  y2  will  then  be  totally  dependent,  i.e.  yj 
=  y2.  This  is  caused  by  the  fact  that  given  sufficient  time  with  no  further  thermal  inputs, 
the  example  target  of  Figure  FI  will  become  constant  temperature  throughout 

Since  the  size  of  the  uj  and  u2  pdf  space  changes  as  it  is  distorted  into  the  pdf  space  of  yj 
and  y2,  the  Jacobian  of  Eq.  (F23)  serves  to  renormalize  the  cdf  sum  of  yj  and  y2  to  1. 
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The  forgoing  results  are  independent  of  the  specific  shape  of  the  conditional  joint  proba¬ 
bility  density  function  of  Uj  and  1*2-  However  to  make  the  concepts  more  concrete,  arbi¬ 
trarily  choose  a  uniform  pdf  for  uj  and  U2  in  the  shaded  region  to  the  left  of  Figure  F4. 
This  is  transformed  into  the  uniform  pdf  for>;  and  >2-  as  shown  to  the  right 


(transformation 
from  input  space 
to  output  space) 


distorted 
pdf  space 
of  uj  and  1*2 


pdf  space  of  uj  and  «2  space  of  yj  and  >2 

Figure  F4  Transform  of  pdf  Spaces 


►y  1 


F3J  Conclusions 

One  of  the  major  questions  which  this  research  into  probability  modeling  of  IR  target 
imagery  sought  to  answer  is,  “Must  one  know  the  joint  pdf  for  the  environmental  drivers 
in  order  to  determine  the  joint  pdf  of  the  pixels  of  an  IR  target?”  This  appendix  shows  that 
for  certain  cases  where  the  number  of  pixels  equals  the  number  of  environmental  drivers 
the  answer  is  “Affirmative.”  Under  these  conditions  the  target  is  taking  a  ‘motion  picture’ 
of  its  thermal  environment  This  ‘motion  picture’  continuously  decays  in  amplitude  and 
spatially  blurs  with  time.  The  pdf  of  the  IR  image  is  thus  equivalent  to  the  pdf  of  this 
‘motion  picture.’  This  equivalency  makes  the  ER  ATR  problem  both  easier  and  harder. 

•  The  problem  becomes  harder  since  one  must  now  probabilistic  understand  the  thermal 
environment  This  will  necessitate  further  research  into  the  time  and  spatial  statistics  of 
thermal  drivers  in  the  battlefield  (Reference  4  is  a  first  step).  It  also  guarantees  that  field 
gathering  of  statistical  parameters  of  thermal  drivers  can  be  used  to  improve  the  IR 
ATR  target  detection/recognition  rate.  Example  possibilities  are  statistical  collection  by 
the  airframe  immediately  prior  to  its  collection  of  the  IR  image  of  radiant  flux  levels 
under  the  cloud  cover  and  of  ground  temperatures. 

•  The  problem  becomes  easier  since  large  regions  of  the  target  are  normally  affected  by 
the  same  driver  values,  e.g.  solar  loading.  The  state  equation  formulation  of  Appendix 
D  lends  itself  well  to  incorporation  of  single  drivers  acting  over  regions  as  well  as  for 
points  of  the  target  Also  the  theory  provides  a  clear  theoretical  path  for  considering 
discrete  “large  part”  modeling  of  IR  targets  along  the  line  of  Mr.  Foley’s  reasoning  at 
AARA. 
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F4.0  THE  CASE  OF  MORE  THERMAL  DRIVERS  THAN 
TARGET  PIXELS 

The  section  above  considered  a  special  subcase  in  that  the  number  of  thermal  drivers,  i.e. 
inputs  to  the  conductive  network,  was  equal  to  the  number  of  target  pixels,  i.e.  the  number 
of  outputs  from  the  conductive  network.  Since  the  thermal  environment  extends  arbitrarily 
far  back  into  the  past,  the  more  usual  case  is  the  one  where  the  number  of  thermal  drivers 
exceeds  the  number  of  target  pixels.  Drawing  from  intuition,  one  sees  also  that  the  signifi¬ 
cance  of  thermal  drivers  recedes  as  the  drivers  recede  in  time.  This  section  is  devoted  to 
quantitatively  developing  the  necessary  mathematical  detail  and  further  insight  into  the 
more  usual  case. 

In  general,  the  input  joint  probability  density  function  conditioned  on  a  set  of  environmen¬ 
tal/target  conditions,  H,  is 


P  (Vj,  v2, 


•  •  • 


(F25) 


Now  assume  /,  the  size  of  the  output  vector,  Y.,  is  less  than  mn,  the  number  of  input  driv¬ 
ers.  Temporarily  augment  or  lengthen  the  Y  vector  by  setting 


=  vi  l<i<,mn 

Then  by  the  technique  of  the  section  above 

P(Y)  =  \J\p(gl(Yl)tg2(Y2),  •  •  .  ,gmn( Y^H) 


(F26) 


(F27) 


or,  equally 

p(Y)  =  \J\p(gl(Yl),  •  •  •  »£/(T/),  v/+1,  •  •  •  ,vwn|  H)  (F28) 

Now  reduce  the  size  of  the  Y  vector  by  integrating  out  the  temporarily  introduced  vari¬ 
ables 

P(Y)  = 

J  •  •|lilp(«,0,1),  •  •  ,gl(Yl),vuv  •  •vmnj//)dv/+1  •  •dvmH  (p29) 

Of  the  set  of  mn  drivers,  accuracy  will  be  best  served  by  integrating  out  the  time  weighted 
drivers  of  the  smallest  amplitude  at  the  time  of  imaging.  These  will  generally  be  the  old¬ 
est. 

F4.1  An  Interpretation 

Eq.  (F29)  takes  the  projection  of  the  multi-dimensional  pdf  from  an  mn  dimensioned  space 
to  an  /  dimensioned  space.  While  this  can  be  computationally  complex,  the  insight  is  clear. 
The  characteristics  of  the  pdf  will  stay  the  same  (usually  an  unlikely  situation)  or  become 
simpler.  This  is  an  answer  to  the  second  question  of  this  research,  “Is  there  a  law  of  large 
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numbers  at  work  which  makes  complete  knowledge  of  the  environmental  statistics  less 
important?”  The  example  of  the  next  subsection  makes  the  concept  more  concrete. 


F4J  An  Example 

Assume  that  the  example  given  above  in  Section  F3.2  is  modified  by  adding  another  driver 
at  time  Tj 


v(  t)  = 


ux8(t-Tx) 

0 


U2^(t-T2) 


0 

u38(t-T3) 


(F30) 


where  T^<T2<TX. 

Working  through  the  mathematics,  this  gives 

yx(T)  =  ( l-c(T-Tx))ux  +  c(T-T2)u2  +  c(T-T3)u3  (Bl) 


and 

y2(T)  =  c{T-Tx)ux+{\-c{T-T2))u2  (F32) 

Now  /  =  2  <  mn  =  3  and  we  augment  the  output  Y  vector  by 

y3{T)  =  (i-c(r-r3))u3  (R3) 

Let  the  input  pdf  be  given  by  Figure  F5. 


Figure  FS  An  example  Input  pdf 
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This  input  3  dimensional  pdf  is  then  projected  into  the  3  dimensional  Y  space  as  shown  in 
Figure  F6  where  the  input  U  space  is  distorted  into  the  output  Y  space  as  shown.  The  uj 

>3 


Relative  axis 


(a)  The  u  axis  system  in 
Y space 


(b)  The  remapped  pdf  in  Y  space 


Figure  F6  The  Remapping  Of  The  Input  pdf 


and  u2  axes  reside  in  the  yj,  y2  plane.  The  u?  axis  resides  in  the  yj,  y3  plane. 

When  the  y3  axis  is  integrated  out  of  Figure  F6,  the  distorted  pdf  function  is  orthogonally 
projected  onto  the  yj,  y2  plane  with  projection  shown  in  Figure  F6. 


Yl 


Figure  F7  The  Projection  Of  The  Input  pdf  Into  The  Output  Space 


The  key  detail  illustrated  here  is  that  the  complexity  of  the  pdf  goes  down  by  the  projec¬ 
tion  process  (the  pdf  of  Figure  F7  is  simpler  than  that  of  Figure  F5). 


54-41 


F43  Conclusions 


When  there  are  more  environmental  drivers  than  pixels  on  the  IR  target  region,  the  joint 
environmental  driver  pdf  is  projected  down  to  a  lower  space.  This  reduces  the  pdf  com¬ 
plexity  by  eliminating  any  pdf  variability  along  the  axes  eliminated.  If  the  original  pdf 
function  was  fractal  with  dimension  d  and  if  e  axes  were  eliminated,  then  in  many  cases 
the  projected  pdf  will  have  the  fractal  dimension  d-e.  It  is  strongly  hypothesized  that  this 
reduction  of  complexity  is  the  enabling  mechanism  behind  whatever  success  is  enjoyed  by 
various  ad  hoc  IR  ATR  algorithms. 


F5.0  THE  CASE  OF  FEWER  THERMAL  DRIVERS  THAN 
TARGET  PIXELS 

This  case  was  not  considered  during  this  research  period  since  it  should  seldom  occur  in 
practice.  If  it  occurs  there  are  simple  extensions  to  take  care  of  it.  One  approach  can  be 
based  on  adding  sufficient  deterministic  0  or  nearly  0  valued  random  drivers  to  make  the 
number  of  target  pixels  and  thermal  drivers  equal.  In  any  case,  random  variables  in  the  ER 
sensor  would  most  likely  take  up  any  available  freedom. 


F6.0  OTHER  EXTENSIONS  OF  THE  THEORY 

•  Often  there  will  be  poor  IR  resolution.  This  can  be  handled  in  the  state  equation 
approach  by  setting  up  an  observation  vector,  Y,  that  is  a  weighted  sum  of  appropriate* 
values. 

•  The  ATR  system  may  need  to  know  the  spatial  thermal  gradient  on  the  taiget  at  a  partic¬ 
ular  point  on  its  surface.  This  can  be  accommodated  in  the  state  equation  approach  by 
first  setting  up  two  *  nodes,  *,•  and  *2  on  the  target  surface  A d  apart  along  some  direc¬ 
tion,  d.  Then  form  an  output  y  j  value 


(F34) 


as  an  approximation  to  the  spatial  partial  derivative 


a* 

’53‘ 


By  use  of  the  pdf  remapping 


dx 


approach,  one  can  then  derive  the  pdf  of  -5-..  This  approach  can  be  carried  further  to 

du 


find  total  derivatives,  etc. 
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Appendix  G 

DERIVING  THE  JOINT  PROBABILITY 
DENSITY  FUNCTION  FOR  THERMAL 
CONDUCTION  NETWORKS  WITH 
NONLINEAR  ENVIRONMENTAL 

COUPLINGS 

Gl.O  INTRODUCTION 

Appendix  E  considered  a  thermal  conduction  network  with  linear  couplings  to  the  envi¬ 
ronment  Obviously,  it  may  be  possible  to  linearize  nonlinear  couplings,  i.e.  radiation  and 
convection,  to  the  environment  over  the  region  of  interest  and  continue  to  use  these 
results.  Here  the  theory  is  extended  to  include  nonlinear  couplings  which  are  so  strong  as 
to  preclude  linearization. 

Nonlinear  effects  add  another  dimension  of  difficulty.  Indeed,  many  authors  consider  that 
nonlinear  differential  equations  pose  an  insuperable  mathematical  problem.  The  problem 
of  nonlinear  probabilistic  differential  equations  is  even  harder.  This  appendix  sidesteps 
these  difficulties  by  sufficiently  discretizing  the  underlying  problem  to  the  point  that  it  can 
be  solved. 

G2.0  THEORY 

It  is  reasonable  to  assume  that  the  environment  is  not  a  function  of  the  target’s  thermal 
condition.  This  makes  sense  in  the  ATR  case  since  the  environment  is  much  larger  that  the 
target.  Alternatively,  the  case  where  the  environment  is  a  function  of  the  target’s  thermal 
condition  can  be  solved  by  extending  the  target  to  include  the  affected  portions  of  the 
environment. 

Next,  assume  that  the  coupling  between  the  environment  and  the  target  is  a  monotonically 
increasing  function  of  the  difference  of  specific  environmental  temperatures  and  target 
surface  temperatures,  (to  assume  otherwise,  would  be  equivalent  to  allowing  negative 
resistances  in  the  electrical  resistor-capacitor  analog  -  a  condition  that  increases  energy 
transfer  as  differential  environment/target  temperatures  get  smaller) 

Nonlinear  couplings  are  equivalent  to  inserting  nonlinear  resistances  in  the  state  represen¬ 
tation  of  circuits  such  as  Figure  D4.  These  nonlinear  resistances  can  not  be  handled  by  the 
linear  state  formulation  of  Eq.’s  (D5)  and  (D6)  for  the  reason  that  they  nonlinearly  couple 
the  effects  of  previous  inputs.  As  a  result,  the  nonlinear  couplings  void  the  condition  of 
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linearity  on  which  Appendix  D  is  built,  requiring  an  extended  approach  to  the  problem. 
The  approach  which  is  chosen  here  is  discrete  time  unwarping.  Time  unwarping  is  the  pro¬ 
cess  of  converting  a  system  with  memory  and  sequential  inputs  into  an  equivalent  system 
with  no  memory  and  paralleled  inputs,  Figure  Gl. 

system  with  memory 


m. 

input 


system 
with  no 
memory 


state  variables 


memory 


*(Af) 


Y(t) 

’  output 


(a)  Representation  of  a  system  with  memory 


t/(0)  1/(1) 


U(  n) 


(b)  Equivalent  repiesentadon  by  a  time  unwarped  cascade 
of  memoryless  systems 

Figure  Cl  The  Concept  Of  Time  Unwarping 


As  shown,  time  is  divided  into  n  finite  but  appropriately  small  increments.  One  could 
potentially  pass  to  the  limit  as  infinitesimal  time  increments  are  chosen,  although  this  case 
is  not  considered  here  as  it  is  not  needed  in  the  real  world  of  ATR  design. 

Figure  Gib  has  simplified  the  mathematics  by  separating  the  problems  incurred  by  use  of 
memory  and  nonlinearity  through  elimination  of  the  consideration  of  memory.  Now  one 
can  solve  the  problem  by  working  through  each  nonlinear  stage  of  the  cascade  from  left  to 
right  in  Figure  Gl  on  a  one  by  one  basis.  Consider  any  individual  stage.  Figure  G2.  This 
nonlinear  stage  contains  all  nonlinear  environment-target  couplings  present  for  time  i. 
These  couplings  are  function  of  environmental  drivers,  U( i),  and  target  surface  tempera¬ 
tures,  Af(i),  only.  Being  thus  without  memory,  the  couplings  although  nonlinear  are  easily 
written  down  in  mathematical  form  once  properly  modeled.  The  output,  Z(i),  then  is 
passed  into  the  linear  system  box  representing  the  conduction  network  of  the  target  The 
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equations  for  this  linear  box  can  be  written  in  the  discrete  state  equation  form  discussed 
earlier  in  Appendix  D. 


Figure  G2  Decomposition  Of  A  Stage  Of  The  Cascade 


If  the  problem  was  only  to  get  the  values  of  output,  the  F(i+1),  then  the  problem  would  be 
comparatively  easy.  The  problem;  however,  is  to  obtain  the  pdf  of  T(n),  where  n  represents 
the  time  of  IR  imaging.  Various  subcases  axe  discussed  below. 

G2.1  The  Case  of  Independent  U( i)  Over  i 

If  the  t/(i)  inputs  are  jointly  independent  over  i,  the  pdf  calculation  problem  is  simplified. 
It  is  then  straightforward  to  show  that  for  any  i,  *(i+l)  is  statistically  independent  of  U( i). 
This  allows  the  calculation  of  any  stage  to  proceed  as  follows.  From  the  pdf  of  U(i), 
develop  the  pdf  of  Z(i)  via  very  slight  extensions  of  the  pdf  remapping  processes  of 
Appendix  F.  The  extensions  required  are  predominately  to  take  care  of  the  many  to  one 
mappings  caused  by  the  nonlinearities. 

Now  form  the  joint  pdf  of  *(i)  and  Z(i) 

p(X(i),Z(i))  =  p(*(/))p(Z(i))  (Gl) 

The  pdf  determination  of  *0+1)  can  then  proceed  exactly  as  determined  in  Appendix  F, 
using  *(i)  and  Z(i)  as  environmental  drivers  to  the  linear  system. 

G2 2  The  Case  of  Dependent  t/(i) 

The  case  where  U( i)  is  dependent  over  i  is  a  case  more  likely  to  physically  occur.  This  is 
the  case  where  the  environment  maintains  some  decreasing  correlation  over  time.  The 
engineering  literature  seldom  treat  any  case  other  than  Gaussian  or  to  provide  more  than 
the  first  and  second  moments  of  more  complicated  cases.  Again  wider  solution  is  broadly 
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considered  to  be  intractably  difficult  On  the  other  hand,  the  full  probabilistic  solution  is 
needed  here  for  the  IR  ATR  problem. 


This  more  difficult  case  of  non  independent  environmental  drivers  may  actually  be  solved 
as  follows.  First  one  needs  the  joint  pdf 

P  (X (l)  ,U (l) ,  U (2) ,  ,  U (n -  1) )  (G2) 

Obviously,  without  a  prior  thermal  characterization  of  the  target  it  will  be  difficult  to  fac¬ 
tor  in  the  effects  of  3f(l)  since  it  depends  on  its  thermal  environment  up  to  that  time. 
Appendix  B  shows,  however,  that  Af(l)’s  effect  deteriorates  with  time  so  initial  assumption 
errors  are  reduced  as  n  grows  larger.  Thus  one  approach  is  to  choose  large  n. 

Other  techniques  can  also  be  used  to  reduce  the  initial  assumption  error.  Since  the  absolute 
time  of  IR  imaging  corresponding  to  n  of  Eq.  (G2)  will  be  known,  one  can  substitute  the 
marginal  statistics  forjf(l)  at  absolute  time  corresponding  to  indexed  time  1  based  on  time 
of  day  thermal  averages.  Prior  climate,  weather,  and  cloud  cover  data  can  be  factored  in  as 
well. 

In  minor  extension  of  the  pdf  mapping  techniques  of  Appendix  F,  one  can  next  transform 
the  pdf  of  Eq.  (G2)  to  that  of 

P(X(1),Z(2),U (2),  •••,  U(n-l))  (G3) 

by  considering  the  first  nonlinear  environmental  couplings  of  Figure  G2  at  time  1. 

Then  again  by  the  pdf  mapping  techniques  of  Appendix  F,  one  can  transform  the  pdf  of 
Eq.  (G2)  to  that  of 


P  (X(2),U (2),  •••,  U(n-l))  (G4) 

thus  removing  consideration  of  stage  1  from  the  cascade  of  Figure  Gl.  The  mathematical 
process  loops  from  here  on  until  all  stages  are  removed  from  the  cascade.  At  this  point,  the 
output  pdf  of  T(n)  is  found  from  the  pdf  of  X(n). 

G2J  Partial  Independence  of  The  £/(i) 

It  is  unthinkable  that  there  is  no  partial  independence  between  the  U( i),  since  total  depen¬ 
dence  would  imply  a  very  high  order  of  dependence  of  natural  events  between  the  various 
processes  of  the  environment.  Markov  chains  for  instance  consider  only  pairwise  depen¬ 
dence.  The  syntax  of  spoken  English  is  thought  to  be  only  of  3-  or  4-wise  dependence. 

If  partial  independence  of  the  U( i)  are  observed,  say  i-wise  independence,  then  the  pdf  of 
Eq  (G2)  can  be  factored.  Each  factoring  will  require  only  part  of  the  pdf  of  Eq.  (G2)  to  be 
subjected  to  mathematical  remapping  at  any  particular  stage  of  Figure  Gl.  This  provides 
corresponding  increases  in  computational  efficiency  in  the  mathematical  algorithm  out¬ 
lined  in  Section  G2.2  above. 
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Abstract 

This  report  covers  research  into  the  use  of  connectionist  models  to  identify  airplanes 
from  their  radar  signature.  The  radar  returns  used  have  been  generated  by  the  SRCRCS 
simulator.  In  addition  a  small  study  has  been  done  on  the  actual  returns  from  two 
airplanes.  The  problem  is  specified  as  an  unknown  aircraft  appears  on  radar  and  is 
compared  by  a  connectionist  model  to  known  aircraft  radar  signatures.  The  study  has 
primarily  examined  recognizing  planes  as  their  aspect  angle  changes.  A  90  degree  aspect 
angle  spectrum  has  been  studied.  The  ideal  situation  is  to  have  the  aspect  angles  in 
the  train  set  far  apart  for  a  smaller  training  set.  The  major  problem  becomes  one 
of  representing  the  data  to  the  connectionist  network.  The  proper  representation,  or 
feature  extraction  from  the  data,  allows  good  recognition  to  occur.  Several  methods 
of  encoding  the  radar  returns  for  recognition  are  studied.  Two  types  of  connectionist 
networks  are  used  in  this  study,  a  feedforward  backpropagation  network  and  an  instance- 
based  learning  hybrid  connectionist  network  (SC-net).  It  is  shown  that  the  use  of  self 
configuring  forms  of  backpropagation  networks  are  needed  in  this  type  of  problem. 
Recognition  rates  of  better  than  90%  with  training  at  every  5°  of  aspect  angle  have 
been  achieved.  Aspect  angle  recovery  to  within  an  average  of  3°  of  the  actual  aspect 
angle  has  been  achieved  on  correctly  recognized  planes. 


1  Introduction 

The  research  described  here  involves  recognizing  airplanes  from  their  radar  signatures.  The 
goal  is  to  use  air  to  air  radar  because  of  its  long  range  potential  and  all-weather  capability. 
Test  results  have  shown  that  the  returns  from  a  high  range  resolution  (HRR)  radar  can 
uniquely  describe  targets.  Returns  from  HRR  radar  represent  illuminated  targets  as  range 
profiles  along  the  line  of  sight  of  the  radar.  These  range  profiles  often  vary  appreciably  over 
small  changes  in  aspect  angle.  The  changes  can  be  attributed  to  the  illumination  of  different 
scatterers  and/or  a  compression  of  inter-scatter  distances  due  to  a  change  in  the  geometry  of 
the  problem,  though  the  latter  is  a  slight  effect.  Since  the  HRR  data  is  so  aspect-dependent, 
target  recognition  with  all  necessary  aspects  currently  requires  a  very  large  database.  The 
recognition  task  is  further  complicated  by  the  fact  that  the  radar  returns  must  be  aligned 
with  the  stored  data  for  comparison.  The  issues  discu  Jed  above  motivate  the  need  to  develop 
an  efficient /com pact  method  of  representing  and  comparing  HRR  data. 

Supervised  connectionist  techniques  are  used  as  the  means  to  recognize  the  planes  from 
radar  signatures.  Their  generalization  capabilities  are  exploited  to  limit  the  training  set  to  a 
manageable  size.  Feedforward  backpropagation  networks  in  the  form  of  Quickprop  [5]  are  ex¬ 
perimented  with  unsuccessfully.  The  instance- based  connectionist  learning  system  embodied 
by  the  hybrid  symbolic,  connectionist  SC-net  system  is  used  primarily.  Several  other  learning 
systems  are  explored  briefly  in  this  research.  They  consist  of  two  connectionist  models  that 
grow  their  own  structure,  Cascade  Correlation  [6]  and  Divide  and  Conquer  Networks  [20]. 

There  were  seven  airplanes  studied  in  this  research.  They  consisted  of  models  of  the  F4, 
F14,  F15,  F16,  F18,  T38  and  Lear,  as  generated  by  the  SRCRCS  tool  developed  by  the 
Syracuse  Research  Corp.  The  data  simulates  a  wide  band  width  and  high  range  resolution 
radar  (HRR)  with  a  20  meter  range  window,  consisting  of  256  discrete  points.  The  magnitude 
of  the  return  at  each  point  corresponds  to  the  radar  cross  section  at  a  specific  range  on  the 
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aircraft.  The  planes  were  studied  at  aspect  angles  from  0  to  90  degrees,  where  0  degrees 
indicates  a  plane  coming  directly  at  the  viewer.  This  range  was  felt  to  be  sufficient  to 
determine  whether  our  approach  would  be  viable  for  actual  planes.  It  covers  the  space  of 
views  that  might  be  seen  of  an  approaching  aircraft.  Since,  the  models  were  symmetrical, 
only  up  to  plus  90  degrees  was  studied. 

A  small  amount  of  real  data  was  available  to  us  on  this  study.  One  type  of  aircraft  included 
examples  with  2  different  types  of  armaments.  There  were  812  measured  points  in  the  return. 
We  had  21  returns  of  high  range  resolution  radar  from  3  aircraft  of  2  different  types.  The 
aspect  angles  were  in  a  3°  range  around  180°  meaning  the  planes  were  going  directly  away 
from  the  radar  site.  One  issue  that  this  data  forced  us  to  acknowledge  was  the  following. 
The  planes  were  not  necessarily  centered  in  the  window  of  the  radar  return  as  they  were  with 
the  generated  returns.  This  fact  about  real  data  shaped  all  of  the  methods  we  devised  to 
represent  radar  returns  to  our  learning  systems.  The  problem  of  representation  for  learning 
is  clearly  much  simpler  in  the  case  that  it  is  known  where  the  center  of  the  airplane  return 
resides.  There  is  less  chance  of  encoding  noise  as  a  feature  for  identification,  which  would 
lead  to  widely  varying  and  inaccurate  results. 

The  most  difficult  problem  in  our  view  is  determining  what  features  to  use  for  the  con- 
nectionist  models  to  learn  to  make  their  identifications.  A  radar  return  consists  of  a  set  of 
peaks  which  result  from  scatterers  on  the  airplane.  There  will  be  noise  in  the  return,  which 
can  provide  false  peaks.  The  first  task  is  to  exclude  as  much  noise  as  possible  without  losing 
real  information.  This  involves  establishing  a  noise  floor  as  a  basis  from  which  to  measure  the 
height  of  peaks.  The  problem  then,  is  how  to  turn  the  peaks  into  meaningful  features  from 
which  accurate  recognition  can  be  done.  It  is  our  contention  that  if  this  is  accurately  done, 
many  learning  algorithms  may  be  able  to  generalize  effectively  from  a  well-chosen  training 
set. 

The  method  of  recognizing  airplanes  from  radar  returns,  described  here,  is  not  solely 
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based  upon  training  a  connectionist  network  and  using  the  trained  network  for  recognition. 
It  also  depends  upon  encoding  the  radar  returns  for  training  and  testing.  From  one  return  a 
number  of  patterns  are  created  which  are  then  used  for  training.  Given  a  return  for  testing, 
the  return  is  used  to  generate  multiple  patterns  in  the  same  manner  as  for  the  training  phase. 
The  patterns  are  then  presented  to  the  trained  network  and  the  plane  is  identified  as  the  one 
which  has  its  output  turned  on  for  the  most  patterns. 

In  the  following,  a  description  of  the  main  learning  algorithms  employed  is  given,  then  the 
crucial  topic  of  how  to  represent  the  returns  as  features  to  the  algorithms  is  discussed,  the 
actual  recognition  algorithm  is  discussed,  then  the  results  from  experiments  with  different 
representations  schemes  is  presented  and  finally  a  summary  of  our  research  findings. 

2  The  connectionist  learning  algorithms 

The  algorithms  described  below  are  supervised  learning  algorithms  which  require  the  examples 
in  the  training  set  to  be  labeled  or  classified.  Quickprop  requires  a  fixed  architecture  be  chosen 
by  the  user  of  the  system,  while  SC-net  configures  its  connectionist  architecture  based  upon 
the  training  examples  presented  it.  Cascade  Correlation  is  a  connectionist  algorithm  that 
uses  the  Quickprop  training  method,  but  grows  its  own  structure  based  on  the  training  set 
presented  it. 

2.1  Quickprop  and  Cascade  Correlation 

The  Quickprop  algorithm  is  a  generalization  of  the  Backpropagation  algorithm,  which  is 
significantly  faster  during  training  than  Backpropagation  [5].  It  can  be  up  to  an  order  of 
magnitude  faster.  Further,  comparable  results  have  been  observed  [14]  when  compared  against 
the  backpropagation  trained  network. 

Cascade  correlation  begins  with  just  input  and  output  layers.  Every  input  is  connected 
to  every  output,  and  all  the  connections  have  adjustable  weights.  Moreover,  there’s  a  bias 
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input,  permanently  set  to  +1.  The  Cascade  architecture  is  shown  in  Figure  1.  The  output 
nodes  may  just  produce  a  linear  sum  of  their  weighted  inputs,  or  they  can  use  some  non-linear 
activation  function.  As  the  network  is  trained,  new  hidden  nodes  are  added  to  the  network, 
one  by  one  as  needed,  and  as  dictated  by  the  learning  algorithm.  Cascade-correlation  starts 
with  no  hidden  nodes,  and  using  Quickprop,  it  trains  the  direct  connections  between  the 
input  and  output  nodes  over  the  entire  training  set  until  no  significant  error  reduction  is 
achieved.  Then  the  error  is  computed  by  running  the  network  over  the  whole  training  set.  If 
the  error  is  less  than  e  (set  by  the  user),  the  algorithm  terminates;  otherwise,  a  new  hidden 
node  is  added  in  an  attempt  to  eliminate  or  reduce  the  residual  error.  The  inputs  of  the  new 
node  come  from  every  input  node  and  all  other  hidden  nodes  in  the  network;  each  hidden 
node  adds  a  new  one-node  “layer”  to  the  network.  Initially,  the  outputs  of  the  new  node 
are  not  yet  connected  to  the  active  network.  A  number  of  passes  are  run  over  the  training 
patterns,  adjusting  the  weights  of  the  input  connections  to  the  new  nodes  after  each  pass. 
This  adjustment  is  done  in  the  direction  that  maximizes  5,  the  sum  over  all  output  nodes  of 
the  magnitude  of  the  correlation  between  the  candidate  node’s  output  value  z  as  shown  in 
Figure  1;  and  the  residual  output  error,  E0 ,  observed  at  each  output  node  o. 

Finally,  quickprop  is  used  for  a  fast-converging  ascent  to  maximize  5.  When  the  magnitude 
of  the  correlation  stops  improving,  the  new  candidate  node  is  installed  in  the  active  network; 
its  outputs  are  now  connected  to  the  output  nodes.  Then  its  input  weights  are  frozen,  and  its 
output  weights  are  trained  to  minimize  the  network’s  output  error.  This  is  done  in  the  same 
way  the  direct  connections  were  trained,  using  quickprop.  If  the  network’s  performance  is  not 
satisfactory  and  no  significant  error  reduction  is  achieved,  a  new  hidden  node  is  introduced; 
the  cycle  is  restarted.  This  cycle  is  repeated  until  the  error  Ecc(w)  becomes  acceptably  small 
(or  until  we  give  up).  In  addition,  cascade-correlation  can  use  a  pool  of  candidate  nodes 
instead  of  just  one  node.  These  nodes  are  initialized  with  different  random  weights;  the 
candidate  with  the  best  correlation  is  installed.  Some  implementation  improvements  can  also 
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be  found  in  (6). 

2.2  Divide  and  Conquer  Networks 

The  algorithm  described  here  differs  from  other  self-configuring  algorithms  that  make  use 
of  a  backpropagation-like  training  method  in  that  it  will  add  cells  in  multiple  hidden  layers 
unlike  the  algorithms  reported  by  Ash  and  Hirose,  et.al.  [1,  11].  However,  it  can  add  multiple 
nodes  on  each  hidden  layer  unlike  Cascade  Correlation  [6]  and  does  not  make  use  of  any 
correlation  measure.  Further,  divide  and  conquer  networks  use  only  the  examples  that  have 
not  already  been  correctly  classified  in  the  incremental  training  process.  The  algorithm 
was  partly  inspired  by  the  difficulty  in  learning  to  recognize  planes  with  Backpropagation 
or  Quickprop.  The  algorithm  is  new  and  relatively  untried.  It  is  completely  described  in 
Appendix  A. 

2.3  SC-net 

SC-net  is  a  hybrid  symbolic,  connectionist  learning  system  [9].  Rules  may  be  directly  encoded 
into  the  network  [18].  A  limited  form  of  variable  binding  is  supported.  After  learning  is 
complete  rules  may  be  generated  from  the  network.  The  learning  algorithm  is  instance- 
based  and  causes  the  structure  of  the  connectionist  network  to  be  configured  based  upon 
the  examples  presented  to  SC-net  in  the  training  phase.  Only  the  inputs  and  the  number  of 
outputs  are  fixed  initially.  The  learning  algorithm  is  also  incremental  which  allows  examples 
to  be  presented  one  at  a  time  or  in  batches  without  requiring  retraining  on  examples  seen 
earlier. 

The  learning  algorithm  works  as  follows.  For  each  labeled  example  presented  to  the  system 
a  forward  pass  is  made.  There  are  three  possible  outcomes  for  each  output. 

1.  The  example  is  recognized  correctly. 
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Initial  State 
No  Hidden  Units 


Add  Hidden 
Unit  1 


X1 

Inputs 

*P 


cc. 

Outputs 

cc2 


Add  Hidden 
Unit  2 


Figure  1:  The  Cascade  architecture  from  initial  state  through  the  addition  of  two  nodes.  The 
weights  to  the  hidden  nodes  are  frozen  when  the  hidden  nodes  are  added. 
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2.  The  example  is  not  recognized  correctly,  but  is  within  a  threshold  of  the  desired  output 
value. 

3.  The  example  is  not  correctly  recognized. 

In  the  first  case,  no  action  is  taken.  In  the  second  case  the  bias  of  a  cell  in  the  existing 
network  will  be  modified  to  enable  the  system  to  recognize  the  new  example  while  retaining 
the  ability  to  recognize  previously  seen  examples.  In  the  third  case,  the  network  structure 
will  be  added  to,  in  order  that  the  training  example  may  be  correctly  recognized  the  next 
time  that  it  is  presented  to  the  network.  This  is  called  the  recruitment  of  cells  phase. 

As  an  instance- based  algorithm,  only  the  examples  seen  in  the  train  set  are  recognized 
unless  a  form  of  post  generalization  is  used.  With  a  nearest  neighbor  classifier,  for  example, 
a  distance  metric  is  commonly  used  to  classify  examples  in  the  test  set  which  do  not  exactly 
match  the  training  examples.  SC-net  has  two  forms  of  post  training  generalization  [18].  The 
simplest  to  describe,  and  the  one  used  in  this  work,  is  the  min-drop  feature.  When  a  test 
pattern  is  presented  to  the  system,  which  has  not  been  seen  in  the  training  set  no  output  is 
turned  on.  Hence,  the  min-drop  is  applied  to  find  the  nearest  corresponding  output  for  the 
example,  if  one  exists.  The  min-drop  feature  works  by  dropping  a  pre-set  number  of  inhibitory 
connections  in  the  network  one  at  a  time  until  an  output  is  t  irne  on  or  the  limit  of  the 
connections  allowed  to  be  dropped  is  reached.  It  provides  surprisingly  good  generalization. 
The  other  method  is  a  formalized  covering  algorithm  which  removes  permanently  connections 
that  are  not  needed  for  recognition  of  the  training  examples. 

SC-net  allows  the  use  of  fuzzy  variables  [18]  to  represent  inputs  in  continuous  data.  This 
feature  was  used  in  the  work  reported  here  and  is  important  in  the  following  sense.  The  return 
values  in  db  are  real  valued  and  must  be  encoded  in  some  fashion  to  SC-net  which  accepts 
inputs  between  0  and  1  only.  The  conversion  to  [0,1]  is  done  in  the  binning  process  when 
normalization  occurs.  However,  this  work  depends  upon  the  different  values  represented  by 
the  bin  magnitudes  being  meaningful  (if  compressed).  Further,  it  has  been  found  in  other 
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Table  1:  Fuzzy  partitions  used  for  bin  values 


pl 

0  -  0.1 

p2 

0.1  -  0.2 

p3 

0.2  -  0.3 

p4 

0.3  -  0.4 

p5 

0.5  -  0.6 

p6 

0.6  -  0.7 

P? 

0.7  -  0.8 

p8 

0.8  -  0.9 

p9 

0.9  -  1 

research  that  encoding  real  valued  features  into  a  set  of  features  each  of  which  is  turned  on 
when  the  original  value  is  within  its  range  can  be  an  effective  aid  to  the  learning  process 

[3,  4]. 

In  these  experiments,  it  was  unknown  how  to  partition  the  interval  into  the  proper  sub- 
intervals  so  we  used  those  shown  in  Table  1  and  dynamic  plateau  modification  (DPM)  to 
modify  the  fuzzy  regions  [18]  The  use  of  fuzzy  variables  means  that  for  each  input  feature, 
9  fuzzy  values  will  be  generated  for  use  in  training.  In  Figure  2,  the  membership  function 
for  p2  is  shown.  The  membership  value  for  inputs  is  1  in  p2  when  the  input  values  lie  in 
[0. 1,0.2].  Membership  in  p2,  /ip 2(1),  has  a  linearly  declining  value  as  the  actual  value  goes 
from  0.1  towards  0  or  from  0.2  towards  1,  respectively.  DPM  learns  to  modify  the  arms  of 
the  membership  function  for  each  of  the  respective  intervals  by  bringing  them  in  or  forcing 
them  to  0  earlier.  For  example,  DPM  might  cause  to  go  to  0  at  0.3  instead  of  1,  shown 

by  the  dashed  line  in  Figure  2. 
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Figure  2:  The  initial  fuzzy  membership  function  for  p2. 

3  Representations  of  the  Radar  Returns 

The  radar  returns  in  this  study  are  a  set  of  points  representing  a  radar  cross  section  at 
separations  of  about  8cm  with  each  point  having  a  value  in  the  decibel  (db)  range.  The 
choice  of  features  for  learning  from  these  points  is  problematic  for  the  following  reasons. 
Each  point  of  the  return  will  not  correspond  to  the  same  point  on  the  same  plane  at  the 
same  view,  in  general.  The  magnitude  of  the  returns  is  not  guaranteed  to  be  the  same  for 
different  returns  on  the  same  plane  at  the  same  view.  Noise  can  have  unpredictable  effects 
on  the  returns. 

Taking  the  magnitude  of  the  return  at  each  point  will  not  provide  a  viable  train  set.  The 
second  possibility  is  to  use  the  relative  magnitude  of  each  point’s  return  value,  but  unless  the 
plane  can  be  reliably  positioned  in  the  return  window  this  approach  will  not  provide  a  viable 
train  set  either.  Further,  it  is  desirable  to  have  the  smallest  number  of  viable  features  for 
training.  Hence,  as  others  have,  we  incorporate  the  concept  of  grouping  a  number  of  points 
in  a  return  into  bins  [7].  The  size  of  the  bins,  or  number  of  return  points  each  contains,  can 
be  chosen  at  training  time. 

After  choosing  a  bin  size  for  training,  the  biggest  problem  is  to  ensure  that  the  bin  from 
the  same  portion  of  the  aircraft  is  provided  to  the  same  input  of  the  learning  algorithm  for 
each  training  pattern.  That  is,  if  a  bin  incorporates  for  example  the  first  8  points  of  a  return 
from  a  plane  and  is  mapped  for  one  pattern  to  the  first  input  or  feature  of  the  connectionist 
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network,  ail  future  patterns  involving  that  plane  at  the  same  aspect  angle  would  ideally  have 
the  same  bin  mapped  to  the  same  input.  However,  if  you  could  identify  the  center  or  start  of 
the  plane  in  the  return  without  ambiguity  you  would  probably  not  need  to  use  a  connectionist 
system  for  recognition. 

Three  algorithms  for  representing  the  radar  returns  to  the  connectionist  networks  have 
been  developed  in  an  attempt  to  mitigate  the  difficulties  discussed.  Each  puts  groups  points 
in  the  return  into  bins  of  a  user  set  size.  The  bins  are  represented  by  the  average  value  of 
heights  of  the  peaks  within  them.  Each  algorithm  creates  more  than  one  training  pattern  from 
each  radar  return  by  creating  different  alignments  of  the  bins.  The  idea  is  that  the  group  of 
patterns  for  a  return  (each  labeled  with  the  same  plane)  will  be  recognizable.  Hence,  during 
testing  the  same  type  of  group  will  be  created  and  the  class  assigned  will  be  the  one  which  is 
turned  on  by  the  largest  number  of  patterns  from  the  group. 

The  first  algorithm  is  called  A.l  and  is  shown  in  Figure  3.  It  aligns  a  return  by  the  k 
largest  bins,  where  the  magnitude  of  a  bin  is  calculated  as  the  average  height  of  the  peak 
values  contained  within.  Hence,  k  patterns  are  created  for  each  return  such  that  each  of  the 
of  the  k  largest  bins  is  centered  to  make  the  individual  patterns.  To  prevent  “important”  bins 
from  being  shifted  out  of  the  set  of  features,  left  and  right  “guard  bins”  are  created  and  they 
contain  the  largest  bins  that  would  be  shifted  out  (when  centering  the  the  set  of  features  to 
the  left  and  right  respectively.  The  guard  bins  serve  as  the  left  or  rightmost  bin  depending 
upon  the  direction  of  shift.  When  shifting  in  one  direction,  bins  with  an  average  value  of  0 
are  inserted  as  the  outer  bins  in  the  opposite  direction.  Hence,  shifting  must  be  limited  as 
the  bins  shifted  in  contain  no  information.  The  number  of  inputs  or  features  provided  to  the 
learning  algorithm  for  A.l  will  be  the  number  of  bins. 

The  second  algorithm  is  called  A. 2  and  is  shown  in  Figure  4.  It  is  also  an  attempt  to 
use  shifting  of  bins  to  capture  a  return  in  several  patterns.  This  time  2j  +  l  training  (testing) 
patterns  are  created  from  a  return.  The  parameter  j  is  chosen  by  the  algorithm  user.  A. 2 
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The  Binning  Algorithms 


Binning  Algorithm  A.l 
(Align  by  k  largest  Scatterers) 

(0)  Inputs:  Bin-Size,  k  (k  <  #Bins) 

( 1 )  Determine  largest  and  smallest  scatterer 

(2)  Calculate  average  scatterer  height 

(3)  Remove  noise  (Cutoff  is  average  scatterer  height) 

(4)  Create  Bin- Vector  of  size  Bin-Size 

(5)  For  i=l  to  k  do 

(5.1)  Align  by  ith  largest  bin  (center  bin) 

(5.2)  Create  Left-,  and  Right-Guard-Bin 

(5.3)  Set  Left-Guard-Bin  to  largest  bin  shifted 
out  to  the  left  (if  appropriate) 

(5.4)  Set  Right-Gurad-Bin  to  largest  bin  shifted 
out  to  the  right  (if  appropriate) 

(5.5)  Print  Bin-Vector 

(6)  Halt 

works  by  creating  bins  and  centering  each  of  the  j  bins  to  the  left  of  the  center  of  the  actual 
return  and  each  of  the  j  bins  to  the  right  of  the  actual  return.  The  center  bin  of  the  return  is 
also  the  centerpoint  for  a  train/test  pattern.  Again  guard  bins  are  used  on  the  left  and  right 
sides  to  capture  the  largest  bins  that  would  otherwise  be  shifted  out  of  the  set  of  features  to 
be  presented  to  the  connectionist  network.  For  n  bins  there  will  be  n  features,  one  for  each 


The  third  algorithm  is  called  B  and  is  shown  in  Figure  5.  In  this  algorithm  no  information 
is  shifted  out  of  the  training/testing  pattern.  This  is  accomplished  by  creating  a  temporary 
“bin”  vector  of  twice  the  number  of  features  as  the  number  of  bins  generated  from  the  radar 
return.  Now  as  in  A.l  the  k  largest  bins  are  centered,  but  because  they  are  centered  in  a 
vector  of  twice  the  number  of  actual  bins  all  information  is  retained.  The  temporary  bin 
vector  for  each  train/test  pattern  is  then  compressed  by  allowing  neighboring  bins  to  become 
one  new  bin  with  a  magnitude  that  is  the  average  of  the  average  peak  heights  in  both  bins. 
This  results  in  patterns  with  k,  the  number  of  original  bins  in  the  binned  return,  features. 


Binning  Algorithm  A. 2 
(Align  by  j  Left-Right  Shifts) 

(0)  Inputs:  Bin-Size,  j  (j  <  int(#Bins/2)+l) 

( 1 )  Determine  largest  and  smallest  scatterer 

(2)  Calculate  average  scatterer  height 

(3)  Remove  noise  (Cutoff  is  average  scatterer  height) 

(4)  Create  Bin-Vector  of  size  Bin-Size 

(5)  For  i=center-j  to  center+j  do 

(5.1)  Align  by  ith  element  of  Bin-Vector  (center  by  this 
element) 

(5.2)  Create  Left-,  and  Right-Guard-Bin 

(5.3)  Set  Left-Guard-Bin  to  largest  bin  shifted 
out  to  the  left  (if  appropriate) 

(5.4)  Set  Right-Gurad-Bin  to  largest  bin  shifted 
out  to  the  right  (if  appropriate) 

(5.5)  Print  Bin-Vector 

(6)  Halt 


Figure  4:  Algorithm  A. 2 


Binning  Algorithm  B 

(Align  by  k  Left-Right  Shifts,  no  shifting  out) 

(0)  Inputs:  Bin-Size,  k  (k  <  #Bins) 

( 1 )  Determine  largest  and  smallest  scatterer 

(2)  Calculate  average  scatterer  height 

(3)  Remove  noise  (Cutoff  is  average  scatterer  height) 

(4)  Create  Bin- Vector  of  size  Bin-Size 

(5)  Create  Bin-Vector-2  of  size  2*Bin-Size 

(5)  For  i=l  to  k  do 

(5.1)  Align  by  ith  largest  bin  (center  bin  within  Bin- 
Vector-2,  no  shifting  out) 

(5.2)  Compress  Bin-Vector-2  into  Bin-Vector 

by  taking  neighboring  bins  and  compressing  them 
into  one  and  taking  the  average  value  for  that  new 
bin 

(5.3)  Print  Bin-Vector 

(6)  Halt 

Figure  5:  Algorithm  B 
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4  Recognizing  airplanes  from  radar  returns 


Every  radar  return  for  both  training  and  testing  will  be  pre-processed  by  the  chosen  binning 
algorithm.  During  the  testing  process  all  the  patterns  generated  by  the  algorithm  will  be 
presented  to  the  trained  connectionist  network.  The  plane  recognized  is  the  one  that  is 
recognized  the  most  times  given  the  patterns.  That  is,  if  there  are  4  patterns  and  the  output 
of  plane  A  is  turned  on  for  2  of  them,  the  output  of  plane  B  for  1  and  the  output  of  plane 
C  for  1 ;  plane  A  will  be  presented  as  the  plane  recognized  for  the  given  radar  return.  In  the 
case  that  a  tie  exists,  no  decision  will  be  made. 

The  major  factor  in  getting  a  good  recognition  percentage  would  seem  to  be  the  choice 
of  data  encoding  algorithm  and  then  of  the  number  of  bins  and  the  amount  of  shifting  that 
is  done.  In  the  experimental  results  section,  we  explore  the  effect  of  the  choice  of  parameters 
on  the  accuracy  of  recognition. 

For  Quickprop  the  other  issue  is  how  to  choose  the  number  of  units  in  the  hidden  layers 
and  number  of  hidden  layers.  In  general  with  many  different  configurations,  convergence 
was  not  accomplished.  This  indicates  that  it  is  a  hard  problem  in  the  sense  of  choosing  a 
network  configuration  that  will  work  reasonably  well.  Training  for  up  to  2  weeks  on  a  SUN 
Sparcstation  was  attempted.  This  difficulty  lead  us  to  develop  the  divide  and  conquer  neural 
network  which  grows  its  own  internal  configuration  based  on  the  training  data  presented  it. 

5  Experimental  Results 

The  results  reported  here  are  with  the  use  of  SC-net  for  training  and  testing,  unless  otherwise 
noted.  Despite  our  best  efforts,  we  could  only  get  Quickprop  to  converge  for  the  airplanes 
from  0-45  degrees  aspect  angle  by  creating  3  patterns  for  each  return  by  centering  the  largest 
magnitude  bin  and  shifting  left  and  right  once.  This  means  that  we  had  a  smaller  number 
of  training  patterns  than  in  the  experiments  described  below.  It  is  believed  that  the  same 
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sort  of  problem  would  impede  any  algorithm  for  this  domain  that  requires  a  pre-specified 
architecture  for  learning.  Hence,  we  do  not  show  results  from  any  algorithm  that  is  not 
self-configuring. 

In  the  following,  we  will  discuss  results  with  the  SRCRCS  generated  returns  first.  The 
experiments  with  recognizing  the  planes  from  0  to  90  degrees  of  aspect  angle  shift  are  first 
described.  Next  a  discussion  of  the  limited  results  from  the  small  amount  of  actual  data 
is  provided.  Finally,  a  discussion  about  recovering  the  aspect  angle  of  the  recognized  plane 
comes  next. 

5.1  Recognition  in  the  0  to  90  degree  aspect  angle  range 

There  are  7  planes  each  with  simulated  radar  returns  at  aspect  angles  from  0  to  90°.  Hence, 
a  total  of  637  returns  exist.  Of  these  SC-net  is  trained  on  the  patterns  generated  from  those 
5°  apart  beginning  at  0,  or  0,  5,  10,  15,. .  .,85,  and  90  degrees.  Hence,  133  returns  are  used 
for  training,  which  leaves  504  returns  unseen  by  the  learning  algorithm. 

Table  2  shows  results  using  encoding  algorithm  A.l.  The  values  shown  are  the  percentage 
correct  for  the  bin  size  and  number  of  patterns  or  large  bins  centered.  The  bin  size  is  in  terms 
of  the  number  of  points  in  a  return  that  are  in  a  bin.  Where  the  division  is  uneven  the  last 
points  of  a  return  will  not  be  placed  in  a  bin.  TC  stands  for  the  total  percentage  correct  (1  - 
T° )  an<l  UC  stands  for  the  percentage  correct  on  the  unseen  returns  (1  -  Tota<sff**e* ) 
to  provide  an  explicit  idea  of  how  able  SC- net  is  to  generalize.  It  is  clear  that  performance 
increases  with  the  number  of  patterns  generated  from  a  return.  For  example  in  the  case  that 
6  patterns  are  generated  with  the  use  of  the  6  bins  with  the  largest  magnitudes,  the  average 
performance  on  the  radar  returns  not  in  the  training  set  is  89%  versus  76%  with  2  patterns 
and  69%  with  1  pattern. 

Table  3  shows  the  results  with  encoding  algorithm  A. 2.  The  results  are  presented  in  the 
same  format  as  before,  but  now  only  with  1,  3  and  5  patterns  generated.  It  can  be  seen  that 
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Table  2:  Results  with  algorithm  A.l. 


TC 

UC 

TC 

UC 

TC 

UC 

0.74 

0.67 

0.78 

0.72 

0.85 

0.81 

0.75 

0.69 

0.81 

0.76 

0.86 

0.82 

0.78 

0.72 

0.79 

0.73 

0.88 

0.85 

0.79 

0.73 

0.73 

0.66 

0.87 

0.84 

0.76 

0.69 

0.75 

0.68 

0.83 

0.78 

0.72 

0.65 

0.74 

0.67 

0.86 

0.82 

0.78 

0.72 

0.73 

0.66 

0.78 

0.72 

0.77 

0.71 

0.74 

0.67 

0.74 

0.67 

0.78 

0.72 

0.76 

0.70 

0.79 

0.73 

0.75 

0.68 

0.71 

0.64 

0.80 

0.75 

0.74 

0.67 

0.71 

0.63 

0.79 

0.73 

5  Patterns  I  6  Patterns  |  9  Patterns 


the  results  with  1  pattern  are  identical  to  those  for  A.l  with  the  exception  of  a  bin  size  of  7 
for  which  the  recognition  difference  is  minor.  This  points  out  some  of  the  regularity  found  in 
the  SRCRCS  data.  With  3  and  5  patterns  A.2  is  outperformed  by  A.l  by  several  percent.  It 
appears  that  focusing  on  the  largest  scatterers,  which  is  what  A.l  does  is  a  better  strategy 
than  simply  doing  left-right  shifts  of  the  data. 

Table  4  shows  the  results  using  encoding  algorithm  B.  It  provides  results  comparable  to 
A.l,  as  it  is  not  losing  any  data.  Again  the  best  performance  comes  with  the  use  of  more 
patterns  with  8,  9  and  10  being  the  best  number  of  patterns  for  this  algorithm.  It  is  the  case 
that  this  algorithm  works  better  with  a  smaller  bin  size.  A  smaller  bin  size  will  mean  that 
more  processing  is  necessary,  because  there  are  more  inputs  to  the  network. 

A  modification  to  algorithm  B  in  which  the  number  of  bins  centered  to  create  training  and 
testing  patterns  (k  in  algorithm  B)  is  based  on  a  threshold  has  also  been  tried.  In  this  scheme 
all  bins  with  a  magnitude  (which  is  the  average  value  of  the  returns  captured  by  it)  above  the 
threshold  T  are  used  as  centerpoints  to  the  different  patterns  generated  from  one  return.  Now 
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Table  3:  Results  with  algorithm  A. 2. 


Bin  Size 

1  Pattern 

3  Patterns 

5  Patterns 

TC 

UC 

TC 

UC 

TC 

UC 

•J 

4 

5 

0.74 

0.67 

0.82 

0.77 

0.82 

0.77 

6 

0.75 

0.69 

0.81 

0.77 

0.86 

0.82 

7 

0.75 

0.69 

0.83 

0.79 

0.87 

0.84 

8 

0.79 

0.73 

0.84 

0.79 

0.85 

0.81 

9 

0.76 

0.69 

0.80 

0.74 

0.83 

0.79 

10 

0.72 

0.65 

0.82 

0.77 

0.84 

0.80 

11 

0.78 

0.72 

0.80 

0.74 

0.84 

0.79 

12 

0.77 

0.71 

0.73 

0.66 

0.81 

0.76 

13 

0.78 

0.72 

0.82 

0.77 

0.86 

0.82 

14 

0.75 

0.68 

0.80 

0.75 

0.82 

0.77 

15 

0.74 

0.67 

0.76 

- 

0.70 

0.82 

0.77 
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Table  4:  Results  with  algorithm  B. 


Bin  Size 

3  Patterns 

5  Patterns 

8  Patterns 

9  Patterns 

TC 

UC 

TC 

UC 

TC 

UC 

TC 

UC 

3 

0.82 

0.77 

0.92 

0.90 

0.93 

0.92 

0.94 

0.93 

4 

0.83 

0.78 

0.90 

0.87 

0.93 

0.91 

0.94 

0.93 

5 

0.76 

0.70 

0.89 

0.86 

0.90 

0.87 

0.91 

0.89 

6 

0.77 

0  71 

0.83 

0.79 

0.87 

0.84 

0.86 

0.83 

7 

0.75 

0.68 

0.83 

0.78 

8 

0.77 

0.71 

0.83 

0.78 

9 

0.79 

0.74 

0.81 

0.76 

each  return  may  generate  a  different  number  of  patterns.  The  higher  T  is  the  less  patterns 
that  will  be  generated.  Experiments  were  tried  with  .10  <  T  <  .30.  varying  T  by  0.01. 
Table  5  shows  a  set  of  representative  experimental  points  with  a  bin  size  of  5.  The  number 
of  patterns  linearly  increases  as  T  decreases.  The  number  of  misses  decreases  as  T  decreases 
although  Ti  <  Ti  does  not  guarantee  less  misses  with  the  use  of  the  patterns  generated  by  7\. 
The  trend  is  downward,  though.  The  table  clearly  indicates  that  generating  more  patterns 
based  on  the  highest  peaks  in  the  return  provides  better  recognition  performance. 

5.1.1  Learning  time  and  other  algorithms 

The  learning  time  was  an  average  of  about  5  hours  on  a  SUN-3  workstation  with  SC-net 
for  the  cases  where  multiple  patterns  from  one  return  were  used.  There  were  1197  training 
patterns  for  the  case  of  9  patterns  per  return  which  appeared  to  be  the  best  case  for  learning 
using  encoding  algorithm  A.l  or  B.  This  caused  SC-net  to  recruit  1 197  cells  for  the  recognition 
network. 
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Table  5:  Results  with  algorithm  B  and  varying  threshold. 


T 

Pattern  count 

Misses 

.30 

2621 

75 

.25 

3133 

73 

.20 

3775 

53 

.16 

4461 

49 

.10 

6009 

38 

To  provide  a  perspective  to  this  work,  we  used  Cascade  Correlation  on  algorithm  B  with 
a  Bin  size  of  4.  It  also  grows  its  own  structure,  but  is  somewhat  more  of  a  conventional 
connectionist  system.  It  recruited  86  hidden  units,  ran  for  24  hours  on  a  Sparcstation  1+, 
had  residual  error  of  2.6168  and  an  accuracy  of  77%  on  the  entire  set  of  patterns  (with  a 
threshold  of  0.6).  Again  a  the  plane  with  the  largest  vote  after  all  patterns  generated  by 
algorithm  B  for  a  return  are  presented  is  the  plane  classified.  It  mostly  confused  the  lear  jet 
model  with  other  planes,  primarily  the  T38.  Several  higher  thresholds  were  tried  to  force  the 
algorithm  to  have  less  residual  error  in  the  training,  but  performance  did  not  improve. 

The  divide  and  conquer  network  (DCN)  approach  has  been  tried  on  the  problem  in  which 
the  radar  data  is  not  encoded  at  all.  Training  is  done  at  5°  intervals  and  testing  is  done  on 
the  7  planes  at  all  of  the  other  aspect  angles.  DCN  provided  an  accuracy  of  71.2%  vs.  74.2% 
for  SC-net  and  65.9%  for  Cascade  Correlation. 

5.2  Results  from  actual  data 

The  ARTII  data  consists  of  812  return  points.  The  2  planes  in  the  study  are  likely  significantly 
shorter  than  the  range.  Hence,  alignment  of  the  return  to  the  connectionist  network  is  crucial. 
The  elevation  angle  of  the  planes  is  near  0.  The  aspect  angles  are  within  a  3°  arc  of  180° 
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Table  6:  Results  from  actual  radar  data. 


Bin  Size 

Algorithm  A. 2.1 

Algorithm  A. 2. 2 

25 

.95 

.81 

20 

.90 

.86 

15 

.86 

.86 

essentially  a  view  of  the  planes  receding  or  looking  at  the  tail  of  the  planes.  There  are 
significantly  more  returns  in  a  smaller  aspect  angle  range  with  this  data.  In  a  five  degree 
range,  we  would  use  a  maximum  of  two  returns  per  plane  for  training.  Here,  we  have  been 
even  more  restrictive  and  used  only  2  returns  (one  of  each  type  of  plane)  for  training  with 
the  remaining  19  used  for  testing.  With  such  a  small  data  set  no  definitive  conclusions  can 
be  drawn,  but  the  results  of  generally  getting  80%  or  greater  recognition  are  encouraging. 

Variations  of  algorithm  A. 2  were  used  to  encode  the  returns  into  patterns.  Let  us  call 
the  variations  A. 2.1  and  A. 2. 2,  respectively.  They  both  involve  centering  the  highest  bin  or 
making  that  bin  the  center  bin  in  the  feature  vector  to  be  presented  for  learning.  A. 2.1  then 
implements  algorithm  A. 2  with  j=l  or  1  left,  right  shift  to  create  3  train/test  patterns.  A. 2. 2 
implements  A. 2  with  j=2  or  2  left,  right  shifts  to  create  5  train/test  patterns.  It  is  necessary 
to  center  the  returns  by  the  highest  bin  to  locate  the  plane  within  the  original  radar  return. 
The  plane  may  be  in  the  leftmost,  rightmost  or  center  portion  of  the  actual  return  with  noise 
elsewhere  and  we,  of  course,  have  no  control  over  the  original  location. 

Table  6  shows  the  results  of  3  experiments  using  the  same  two  returns  for  training.  The 
accuracy  is  calculated  over  all  21  planes.  The  number  of  misses  can  be  calculated  by  taking 
the  nearest  integer  value  of  the  number  of  planes  times  one  minus  the  accuracy  value.  Three 
different  bin  sizes  are  used.  Algorithm  A. 2.1  does  better  with  larger  bins,  while  A. 1.1  does 
better  with  smaller  bins. 
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Assume  the  following  K»9  diagnoses  have  been  maae: 


(1)  Plane  F4  at  5  degrees 

(2)  Plane  F4  at  10  degrees 

(3)  Plane  F4  at  5  degrees 

(4)  Plane  F4  at  5  degrees 

(5)  Plane  F16  at  20  degrees 

(6)  Plane  F'.8  at  5  degrees 

( 7 )  Plane  F4  at  15  degrees 

(8)  Plane  F4  at  0  degrees 

(9)  Plane  F18  at  10  degrees 

Plane  F4  wins  the  vote  (6/9)  and  the  predicted  or  recovered  aspect 
angle  is: 

(5+10+5+5-15+0) / 6«6. 67 


Figure  6:  An  example  of  calculating  an  aspect  angle  from  a  test  return. 

5.3  The  recovery  of  aspect  angle 

In  this  section  work  on  recovering  the  aspect  angle  of  a  recognized  plane  is  reported.  Since 
the  recovered  aspect  angle  of  a  misrecognized  plane  is  not  important,  the  misrecognized 
planes  are  ignored.  For  aspect  angle  recovery  91  output  cells  are  used  corresponding  to  each 
of  the  7  planes  available  from  the  SRCRC  simulator  at  the  13  aspect  angles  trained  upon 
(0,5,10,. .  .,85,90°).  This  makes  the  resultant  network  more  complicated,  bui,  is  not  a  major 
problem  since  SC-net  networks  are  sparsely  connected.  Training  and  testing  are  done  with 
data  encoding  algorithm  B,  a  bin  size  of  5  and  k=9  (9  generated  patterns)  and  with  a  variable 
number  of  centered  patterns  based  on  thresholds  of  .10  and  .11  as  described  earlier. 

The  recovered  or  predicted  aspect  angle  is  calculated  by  first  determining  to  which  plane 
the  k  generated  patterns  will  be  assigned.  Then  those  n<k  patterns  each  have  an  aspect 
angle  associated  with  them.  The  aspect  angles  are  summed  and  i  vided  by  n  to  provide  the 
recovered  aspect  angle.  An  example  is  shown  in  Figure  6. 

Table  7  shows  the  results  as  measured  by  four  quantities: 

1.  CAAED  -  The  combined  average  angle  error  difference.  It  is  calculated  by  summing  the 
differences  between  the  known  and  recovered  aspect  angles  for  all  planes  at  all  aspect 
angles  and  dividing  bv  the  number  of  total  examples  ( 637)  minus  the  incorrectly  classified 
or  missed  examples. 
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2.  TAAED  -  The  test  average  angle  error  difference.  It  is  calculated  as  the  sum  of  the 
difference  between  the  known  and  recovered  aspect  angles  on  the  test  set  only,  divided 
by  the  number  of  test  examples  (504)  minus  the  incorrectly  classified  or  missed  examples. 

3.  TNNTA  -  The  total  number  of  nearest  trained  angles.  This  measures  how  close  the 
recovered  angle  is  to  the  trained  angle  nearest  the  known  aspect  angle.  For  example, 
aspect  angles  at  0,  1  and  2  degrees  would  be  nearest  0°  and  aspect  angles  at  3,  4  and 
5  degrees  would  be  nearest  5 ° .  Hence  a  recovered  aspect  angle  for  a  plane  at  1  degree 
known  aspect  angle  would  have  to  be  0,  1  or  2  degrees  to  increase  this  count.  It  is  a 
binary  count  of  the  number  of  test  examples  that  have  reo.  aspect  angles  closest  to 
the  train  angle  nearest  the  known  aspect  angle.  It  can  be  a  harsh  and  imprecise  measure. 
For  example  a  recovered  angle  at  12  degrees  is  counted  as  incorrect  for  an  actual  aspect 
angle  of  13  degrees  for  a  given  plane. 

4.  NTAR  -  The  nearest  trained  angle  ratio.  TNNTA  over  the  correctly  classified  planes  in 
the  test  set. 

The  total  number  of  patterns  generated,  bin  size  and  misses  are  also  shown  in  Table  7.  The 
average  error  in  aspect  angle  over  all  correctly  classified  returns  is  less  than  2  degrees  and 
just  over  2  degrees  when  only  the  test  set  of  examples  is  considered.  Hence,  it  appears  that 
the  cla.  sification  process  does  relate  the  returns  to  the  aspect  angle  and  a  prediction  within 
5°  seems  a  reasonable  possibility. 

6  Summary 

The  data  from  real  radar  returns  is  very  limited  in  this  study.  While  it  is  too  little  in  number 
to  draw  conclusions  from,  the  results  appear  to  support  our  findings  with  the  SRCRCS  radar 
data.  That  is,  with  proper  data  encoding  it  is  possible  to  recognize  airplanes  from  radar 
returns  using  a  connectionist  model. 
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Table  7:  Results  on  aspect  angle  recovery. 


Bin  Size 

Misses 

Pattern  count 

CAAED 

TAAED 

TNNTA 

NTAR 

5 

36 

5733 

1.55 

2.16 

292 

.63 

5 

38 

6009 

1.52 

2.12 

292 

.63 

5 

50 

5687 

1.64 

2.37 

284 

.63 

The  connectionist  model  needs  to  be  capable  of  determining  it’s  own  structure.  The 
proper  structure  to  enable  a  backpropagation  network  to  converge  was  difficult  to  determine 
and  convergence  was  slow.  On  the  other  hand,  SC-net  is  instance-based  and  performs  well. 
Cascade  Correlation  grows  its  own  structure  and  can  do  reasonably,  but  is  correct  in  its 
classification  about  15%  less  than  SC-net.  Cascade  Correlation  was  not  tried  with  the  use  of 
fuzzy  values  for  ranges  in  the  input  as  done  in  SC-net.  A  similar  encoding  of  its  inputs  might 
improve  its  performance. 

Feature  extraction  for  this  problem  is  not  obvious,  in  the  sense  that  its  unclear  of  what 
the  best  features  consist.  It  is  clear  that  the  method  of  data-encoding  to  generate  the  features 
for  train  and  test  patterns  is  very  important.  Methods  that  take  into  account  the  highest 
peaks  in  the  return  appear  best  suited,  such  as  our  A.l  and  B  algorithms.  The  use  of  binning 
to  group  points  in  the  return  seems  to  be  an  effective  method  to  reduce  the  number  of  input 
features  for  a  learning  algorithm.  In  fact  grouping  the  return  points  into  bins  has  a  positive 
effect  on  recognition  provided  the  bins  are  not  made  too  large.  As  the  number  of  patterns 
generated  goes  up,  smaller  bin  sizes  appear  necessary  for  good  discrimination  among  the 
radar  returns. 

When  a  plane  is  correctly  recognized  our  results  suggest  that  it  is  possible  to  estimate 
t  he  aspect  angle  of  the  plane  fairly  reliably.  Our  method  requires  the  use  of  outputs  for  each 
plane  at  each  aspect  angle  trained  upon,  which  could  cause  problems  for  some  connectionist 
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networks  that  rely  on  a  more  fully  connected  structure  than  SC-net  does. 

It  should  be  noted  that  in  all  of  our  experiments  we  used  a  ratio  of  train  patterns  to  test 
patterns  less  than  1  to  3.5.  Many  machine  learning  studies  use  a  ratio  of  2  to  1  (train  vs. 
test)  [24).  Hence,  the  generalization  properties  of  the  network  from  the  data  are  quite  good. 
The  recognition  system,  we  described,  should  increase  its  performance  with  more  training 
examples.  Increasing  the  training  examples  will,  for  each  new  example,  generate  several  (up 
to  9  in  this  study)  new  patterns.  This  will  complicate  the  learning  process  by  making  it 
longer  and  require  a  larger  network  in  the  case  of  SC-net.  However,  learning  time  for  a  truly 
accurate  system  is  not  a  major  issue  as  learning  is  done  just  once.  More  training  patterns  will 
not  slow  the  actual  recognition  process  which  is  primarily  bounded  in  time  by  the  number 
of  patterns  presented  to  the  network  to  classify  a  return.  The  training  set  was  kept  small 
since  performance  was  good  and  to  show  even  better  performance  on  generated  data  wouldn’t 
guarantee  the  same  performance  on  a  large  amount  of  data  from  real  radar  returns. 

What  does  this  study  mean  for  the  recognition  of  real  planes  from  real  radar  returns?  It 
appears  possible  to  get  high  recognition  rates  from  radar  returns  provided  the  radar  is  good, 
atmospheric  conditions  do  not  disrupt  the  relative  magnitude  of  the  points  in  the  return  to  a 
large  degree,  the  encoding  scheme  is  chosen  well  and  changes  in  elevation  angle  do  not  have 
a  tremendous  effect  on  recognition.  Changes  in  elevation  angle  have  only  been  looked  at  on 
a  couple  of  planes  due  to  problems  getting  the  data  from  WPAFB.  The  experiments  we  tried 
indicated  that  training  patterns  at  different  elevations  for  each  plane  will  need  to  be  included 
in  the  train  set. 

The  recognition  method  described  here  needs  to  be  tried  on  a  significant  set  of  real  radar 
returns  from  real  planes.  Unfortunately,  we  do  not  have  access  to  such  data.  The  learning 
algorithm  can  be  any  that  grows  its  own  architecture.  Its  choice  will  have  a  strong  affect  on 
accuracy  measurements,  of  course.  SC-net  has  worked  quite  well  in  this  study  and  should 
work  well  for  real  data  also.  Potential  problems  lie  with  the  number  of  training  patterns 
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needed.  This  will  depend  upon  the  number  of  returns  used  and  the  data  encoding  scheme. 
As  this  number  gets  large,  most  current  learning  schemes  will  become  slow  and  may  have 
difficulty  processing  the  large  amount  of  data  effectively.  The  research  reported  here  provides 
a  promising  guideline  for  future  work  in  the  recognition  of  airplanes  from  radar  returns. 


References 

[1]  Ash,  T.  Dynamic  Node  Creation  in  Backpropagation  Networks,  Tech.  Report  ICS  8901,  Insti¬ 
tute  for  Cognitive  Science,  UCSD,  La  Jolla,  Ca. 

[2]  R.G.  Atkins,  R.T.  Shin,  J.A.  Kong,  A  Neural  Net  Method  for  High  Range  Resolution  Target 
Classification,  Report  from  Dept.  Of  EE  and  CS  and  Research  Lab.  of  Electronics,  MIT, 
Cambridge,  MA.,  1988. 

[3]  Cherkassky,  V.,  Lee,  Y.  and  Lari-Najifi,  Self-organizing  network  for  regression:  efficient  imple¬ 
mentation  and  comparative  evaluation,  Proc.  IJCNN,  Seattle,  Wa.  V.l,  pp.  79-84,  1991. 

[4]  Cherkassky,  V.  and  Lari-Najifi,  Informative  data  representation  for  neural  network-based  di¬ 
agnostic  systems,  Proc.  2nd  Government  Neural  Network  Applications  Workshop,  Huntsville, 
AL,  1991. 

[5]  S.E.  Fahlman,  “Faster- Learning  Variations  on  Back- Propagation:  An  Empirical  Study”,  Pro¬ 
ceedings  of  1988  Connectionist  Summer  School,  pp.38-51,  1988. 

[6]  Fahlman,  S.  and  Lebeire,  C.  The  Cascade  Correlation  Learning  Architecture,  Carnegie  Mellon, 
Computer  Science  TR,  1990. 

[7]  Farhout,  N.H.  and  Babri,  H.,  Cognitive  Networks  for  Automated  Target  Recognition  and  Au¬ 
tonomous  System  Applications,  Second  Government  Neural  Networks  Applications  Workshop, 
Huntsville,  Al.  1991. 

[8]  R.P.  Gorman  and  T.J.  Sejnowski,  “Analysis  of  Hidden  Units  in  a  Layered  Network  Trained  to 
Classify  Sonar  Targets”,  Neurci  Networks,  Vol.  1,  pp.  75-89,  1988. 

[9]  L.O.  Hall,  and  S.G.  Romaniuk,  “A  Hybrid,  Connectionist,  Symbolic  Learning  System”,  AAAI- 
90,  Boston,  Ma.  August,  783-788,  1990. 

[10]  L.O.  Hall,  S.  Romaniuk,  J.  Leonard,  and  R.  Mitchell,  The  Use  of  Connectionist  Networks  to 
Recognize  Airplanes  from  Radar  Returns,  Artificial  Neural  Networks  in  Engineering  ’91,  St. 
Louis,  Mo.,  pp.  921-926,  Nov.  1991. 

[11]  Hirose,  Y.,  Yamashita,  K.  and  Hijiya,  S.,  Back- Propagation  Algorithm  Which  Varies  the  Num¬ 
ber  of  Hidden  Units,  Neural  Networks,  V.  4,  pp.  61-66,  1991. 

[12]  R.  Hecht- Nielsen,  “Kolmogorov’s  Mapping  Neural  Network  Existence  Theorem”.  Proceedings 
of  IEEE  First  International  Conference  on  Neural  Networks,  San  Diego,  CA.,  June,  1987. 


55-25 


[13]  R.  Hecht- Nielsen,  Neurocomputing,  Addison- Wesley,  Reading,  Ma.  1990. 

[14]  A  Study  of  Machine  Learning  Approaches  for  some  Classification  Knowledge  Bases,  4t/l  Florida 
A1  Research  Symposium  1991,  April,  Cocoa  Beach,  pp.  125-129. 

[15]  Y.  Lamdan  and  H.J.  Wolfson,  Geometric  Hashing:  A  General  and  Efficient  Model-Based  Recog¬ 
nition  Scheme,  Robotics  Research  Lab,  Courant  Institute  of  Mathematical  Sciences,  NYU,  New 
York,  1989. 

[16]  R.S.  Michalski,  J.G.  Carbonell,  T.M.  Mitchell,  Machine  Learning:  Ac  Artificial  Intelligence 
Approach.  Palo  Alto,  Ca.,  Tioga  Publishing,  1983. 

[17]  G.C.  Oden,  “A  Symbolic  Superstate  for  Connectionist  Models”,  IEEE  ICNN,  San  Diego, 
California,  July,  1988. 

[18]  S.G.  Romaniuk,  Extracting  Knowledge  from  a  Hybrid,  Symbolic,  Connectionist  Network,  Ph.D. 
Dissertation,  Department  of  Computer  Science  and  Engineering,  University  of  South  Florida, 
Tampa,  1991. 

[19]  Romaniuk,  S.G.  and  Hall,  L.O.,  Fuzzy  Quantifiers  and  Quantifying  Operators  in  a  Connec¬ 
tionist  Expert  System  Development  Tool,  International  Joint  Conference  on  Neural  Networks, 
Singapore,  November,  pp.  134-139,  1991. 

[20]  S.G.  Romaniuk  and  L.O.  Hall,  Divide  and  Conquer  Neural  Networks,  Technical  Report  ISL- 
1219-91,  Dept,  of  CSE,  USF,  Tampa,  FI. 

[21]  D.E.  Rummelhart,  and  J.L.  McClelland,  Parallel  Distributed  Processing,  Vol.  1,  The  MIT 
Press,  Cambridge,  Ma.,  1986. 

[22]  J.W.  Shavlik,  R.J.  Mooney,  and  G.G.  Towell,  Symbolic  and  Neural  Learning  Algorithms: 
An  Experimental  Comparison,  Computer  Sciences  Technical  Report  #857,  University  of 
Wisconsin- Madison,  Madison,  Wise,  1989. 

[23]  Wasserman,  P.D.,  Neural  Computing,  Theory  and  Practice,  Van  Nostrand  Rheinhold,  N.Y. 
1990. 

[24]  S.M.  Weiss,  and  Kulikowski,  C-,  Computer  Systems  That  Learn,  Morgan  Kaufmann,  San 
Mateo,  CA.  1990. 


A  Divide  and  Conquer  Networks 


Error  reduction  on  individual  weights  is  done  with  the  use  of  the  Quickprop  [5]  weight  mod¬ 
ification  algorithm, 


Atu(f)  = 


S(t) 


S(t  -  l)  -  S(t) 


A w{t  -  1) 
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where  S(t)  and  S(t-l)  are  the  current  and  previous  values  of  dE/dw.  This  provides  faster 
convergence,  though  standard  gradient  descent,  Aw(t)  =  —edE/dw(t)  +  aAw(t  —  1),  or  the 
delta  rule  At v(t)  =  w(t  -1)4-  Learning Rate(actualoutput  —  expectedoutput)  *  x(<)  (where 
x(t)  is  the  output  associated  with  the  incoming  link)  provide  comparable  results  [13]. 

Weights  on  links  from  the  other  cells  connected  to  a  newly  introduced  hidden  cell  are  the 
only  ones  trained.  This  makes  the  training  always  of  single  layer  subsets  or  perceptrons.  All 
cells  in  the  network  (except  the  inputs)  are  viewed  as  complex  feature  detectors. 

The  Divide  &  Conquer  Network  (DCN)  learning  algorithm  consists  of  two  major  phases; 
the  divide  phase  and  the  conquer  phase.  The  divide  and  conquer  phases  are  done  individually 
for  each  output  of  the  problem  space  on  which  training  takes  place.  The  object  is  to  allow 
each  cell  in  the  network  to  act  as  a  feature  detector  and  correctly  classify  some  of  the  examples 
in  the  training  set.  Training  is  done  on  the  connections  to  one  ceil  at  each  step.  Hence,  the 
training  is  essentially  on  a  many  input,  one  output  cell  or  unit  at  each  stage.  This  is  in  the 
spirit  of  perceptrons,  though  the  sigmoid  activation  function  and  Quickprop  training  rule  are 
used  instead  of  the  actual  perceptron  formulae.  The  process  is  the  same  for  each  output 
and  training  can  be  done  in  parallel.  Intermediate  cells  are  not  shared  among  outputs  in  the 
version  of  the  learning  algorithm  presented  in  this  section.  There  is  a  method  of  sharing  the 
intermediate  cells  during  learning. 

Figure  7  contains  a  summary  description  of  the  algorithm.  It  begins  in  the  conquer 
phase  discussed  later.  The  divide  phase,  which  works  as  follows.  An  intermediate  cell  will 
have  input  connections  from  all  the  inputs,  the  bias  and  any  intermediate  cells  on  preceding 
layers.  All  the  examples  are  presented  and  the  weights  on  the  input  connections  are  trained 
using  Quickprop  error  reduction.  After  a  user  set  number  of  epochs  (we  have  found  300  to 
500  to  be  adequate)  or  error  reduction  stops,  training  halts  with  some  of  the  examples  being 
correctly  classified.  These  examples  are  removed  from  the  training  set.  The  reduced  training 
set  is  augmented  by  adding  neighboring  examples.  A  neighbor  is  defined  for  example  E  as 
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E\  if  the  mean  square  difference  between  its  inputs  (or  attributes  values)  is  minimal.  Of 
course  E’  may  have  a  different  output  value  than  that  of  E.  The  training  set  resulting  from 
the  addition  of  the  neighbors  is  used  continue  learning.  The  reason  neighbors  are  added  is 
that  they  are  close  in  structure  to  other  examples  in  the  training  set  and  may  cause  errors 
later  if  they  are  not  considered  during  training. 

At  the  point  in  which  a  new  cell  on  the  current  layer  has  been  added  to  the  network, 
but  has  not  been  able  to  reduce  the  training  example  set,  the  conquer  phase  is  entered.  A 
new  cell  will  be  added  at  the  next  level  of  the  network  and  have  input  connections  from 
all  of  the  existing  cells  for  the  output  currently  under  consideration.  Training  is  done  and 
either  the  cell  can  correctly  classify  all  of  the  examples  from  the  original  training  set  in  which 
case  the  output  has  been  learned  or  the  divide  phase  is  entered  and  processing  continues  in 
the  manner  outlined  here.  Termination  will  occur  upon  an  output  being  correctly  learned 
by  the  introduction  of  a  cell  in  the  conquer  phase  or  after  a  set  limit  of  cells  has  been  used 
up  in  constructing  the  network.  The  constructed  network  differs  in  several  ways  from  the 
usual  backpropagation  network.  The  inputs  are  directly  connected  to  every  cell  not  on  the 
input  layer.  Hidden  cells  are  only  connected  to  hidden  cells  in  other  layers  or  one  output  cell. 
That  is,  the  hidden  structures  for  output  cells  are  decoupled.  This  allows  the  hidden  cells 
to  act  more  as  feature  detectors  than  in  a  classic  backpropagation  network.  In  the  spirit  of 
Cascade  Correlation  networks,  a  pool  of  hidden  cells  of  arbitrary  size  may  be  trained  when 
a  new  hidden  cell  is  being  added  to  the  network.  Each  starts  with  different  random  weights 
on  its  input  links  and  the  cell  that  provides  the  best  results  is  chosen  to  be  added  to  the 
network.  During  training  only  the  weights  on  the  incoming  links  to  a  new  cell  (or  pool  of  cells) 
are  modified. 
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Define: 


E  :  Set  of  all  training  examples. 

C,  :  Binary  Concept  i  to  be  learned. 
iV,  :  Neighbor  of  example  E%. 

Ed  :  Remaining  Space  (after  division). 

•  For  all  output  concepts  C,  do 

While  E  has  not  been  learned  do 
Let  Ed  =  E 

Connect  all  cells  currently  used  for  C,  to  a  new  cell. 

Train  the  new  connections  and  remove  all  correctly  classified  examples  from  the 
training  set. 

Add  neighbors  W,  to  Ed. 

While  (Ed  not  empty  AND  Improvement)  do 
-Create  Feature  Fdiv 
-Train  Feature  Fdiv  on  Ed 

-  Remove  Examples  from  Ed  that  Fdiv  correctly  classifies.  If  none  were  correctly 
classified,  remove  a  random  example  and  train  Fdiv  on  the  reduced  example  set.  Con¬ 
tinue  removing  examples  and  training  until  at  least  one  example  is  correctly  recognized. 

-To  remaining  Examples  Edi  in  Ed  add  also  their  Neighbors  Nt. 

end  while. 

end  while. 

The  last  cell  added  has  correctly  learned  concept  C,  as  its  output. 

Figure  7:  The  DCN  Algorithm 
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by 
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ABSTRACT 

Nd-doped  fiber  laser  amplifiers  for  incorporation  into  a  laser  radar  test  system  were 
developed  around  two  Nd-doped  fibers  provided  by  Rutgers  University  and  Brown 
University.  Both  fibers  exhibited  a  fluorescent  band  peaking  at  or  near  1064  nm.  A  gain 
of  10  dB  was  measured  in  an  amplifier  incorporating  the  double-clad  fiber  provided  by 
Rutgers  University.  A  second  amplifier  incorporating  a  Wavelength  division  multiplexer 
(WDM)  and  a  pigtailed  laser  diode  has  been  constructed. 
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FIBER  LASER  PREAMPLIFIER  FOR  LASER  RADAR  DETECTORS 


I.  INTRODUCTION 


The  Electro-optics  Division  of  the  Avionics  Laboratory  at  Wright-Patterson  Air  Force 
Base  is  involved  in  the  development  of  laser  radar  systems.  One  possible  method  of 
increasing  the  detection  of  a  returning  laser  radar  signal  might  be  to  use  a  fiber  optical 
laser  preamplifier  immediately  before  the  photo  detector.  This  type  of  amplifier  shows 
promise  as  a  means  of  increasing  the  signal  to  noise  ratio  of  a  laser  radar  system 
detector. 

During  the  summer  of  1990  I  was  a  USAF  Summer  Faculty  Research  Associate  working 
with  the  Electro-optics  Division  of  the  Avionics  laboratory  at  Wright-Patterson  AFB. 
The  objective  of  that  research  was  to  study  the  feasibility  of  developing  a  fiber  laser 
amplifier  for  use  in  their  laser  radar  test  system.  Since  the  results  of  that  study  indicated 
that  such  an  amplifier  could  be  useful,  a  Research  Initiation  Proposal  for  the 
development  of  this  type  amplifier  was  funded  by  the  Air  Force  Office  of  Scientific 
Research. 
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II.  OBJECTIVES  OF  THE  RESEARCH  EFFORT 

A  Nd-doped  silica  fiber  is  a  four-level  laser  medium.  For  a  four-level  amplifier  the 
unsaturated  single  pass  gain  factor  can  be  given  as 


where  the  gain  is  given  by 


(1) 


G  =  —  =  eY. 


(2) 


lgjlm  is  the  ratio  of  amplified  signal  to  the  input  signal,  a  is  the  stimulated  emission 
cross  section  for  the  amplified  wave,  rf  is  the  fluorescent  lifetime  of  the  upper  lasing 
level,  hvp  is  the  energy  per  photon  of  the  pump  light,  and  P^/A*  is  the  effective  intensity 
of  the  absorbed  pump  light  in  the  fiber.  [1] 


For  a  Nd-doped  silica  fiber  assuming  a  =  3  x  10‘20  cm2,  rf  =  4.5  x  10"*  s,  11^(800  nm) 
=  2.48  x  10-19  J,  Ap*  ~  5  x  10"7  cm2  gives  a  slope  efficiency  of  7/P^  =  0.11/mW  or 
G(dB)/P^  =  0.47  dB/mW.  Po,  et  al.  reported  a  slope  efficiency  of  0.437  dB/mW  in 
a  Nd-doped  silica  fiber  when  pumped  at  800  nm.  [2] 


The  objective  of  this  research  was  to  construct  a  fiber  laser  amplifier  using  optimum 
amplification  parameters  and  minimum  noise  to  be  integrated  into  a  test  laser  radar 
system  at  the  Electro-optics  Division  of  the  Avionics  Laboratory  at  Wright- Patterson 


: 
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100,000 


Chip  Parameters 

Number  of  transistors  on  the  chip 
Fan-out  of  a  typical  gate  on  the  chip 
Total  capacitance  at  an  output  pin 
Fraction  of  on-chip  gates  that  switch 
during  a  dock  cycle 
Density  of  defects  on  the  chip 
NMOS  Transistor  Parameters 
Minimum  feature  size 
Input  capadtance  of  feature  size 
transistor 

Resistance  of  feature-size  transistor  in 
the  depletion  mode 

Resistance  of  feature-size  transistor  in 

the  enhancement  mode 

Power  supply  voltage 

Ratio  of  optimum-size  to  feature- size 

transistors 

Interconnection  Parameters 

Number  of  interconnection  layers 
Widths  of  on-chip  interconnects 
Pitches  of  on-chip  interconnects 
Thicknesses  of  on-chip  interconnects 


3 

50  pF 

0.3 

5/cm2 

1  pm 

2  pF 

25,000  Cl 

15,000  Cl 

2  volts 

1 

3 

2  pm 

4  pm 
0.2  pm 
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Thickness  of  the  dielectric  material  =  0.4  pm 

Utilization  coefficient  of  interconnections  =  0.33 


4.1  SIMULATION  RESULTS  USING  NCHIPSIM 

NCHIPSIM  has  been  used  to  predict  the  dependence  of  the  chip  performance 
on  its  minimum  feature  size  as  well  as  on  its  integration  level.  For  example,  the 
dependences  of  the  chip  size,  maximum  clock  frequency,  power  consumption, 
computational  capacity,  power  efficiency  and  functional  throughput  rate  of  a  100,000- 
transistors  silicon  NMOS  single-chip  microprocessor  on  its  minimum  feature  size  in 
the  range  0.1  to  2.5  pm  are  shown  in  figures  2  to  7,  respectively  and  the  dependences 
of  each  of  these  performance  indicators  for  a  1-pm  silicon  NMOS  microprocessor  chip 
on  its  integration  level  in  the  range  100  to  100,000,000  transistors  are  shown  in 
figures  8  to  13,  respectively. 

NCHIPSIM  has  also  been  used  to  compare  the  simulation  results  for  several 
actual  silicon  NMOS  single-chip  microprocessors  to  the  known  values  of  the 
performance  indicators  [4-7].  For  example,  such  a  comparison  for  the  1.5  pm  NMOS 
microprocessor  chip  called  HP  Focus  (1982)  is  shown  in  Table  1,  that  for  the  3  pm 
NMOS  microprocessor  chip  called  Stanford  MIPS  (1984)  is  shown  in  Table  2,  that  for 
the  NMOS  microprocessor  chip  called  BERKELEY  RISCI  (1981)  is  shown  in  Table 
3  and  that  for  the  NMOS  microprocessor  chip  called  MICRO-VAX  32720  (1984)  is 
shown  in  Table  4.  Tables  1-4  show  that  the  agreement  between  the  actual  and  the 
simulated  results  is  indeed  very  good. 
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the  typical  values  of  the  transistor  related  parameters  for  a  given  minimum  feature 
size  such  as  its  input  capacitance,  output  gate  resistance,  power  supply  voltage  and 
the  ratio  of  optimum-size  to  feature-size  transistors  but  permits  the  user  to  change 
any  of  these  values  for  the  chip  being  simulated.  Next,  for  the  silicon-based  chips, 
the  user  can  choose  a  dielectric  material  out  of  silicon  dioxide,  polyimide,  alumina 
and  epoxy  glass  or  select  one  of  his  own  and  define  its  dielectric  constant 
interactively.  Next,  the  user  can  choose  an  interconnection  material  out  of 
aluminum,  copper,  silver,  tungsten  and  molybdenum  or  select  any  other  material  and 
define  its  electrical  resistivity  interactively.  Next,  the  program  lists  the  typical 
values  of  the  other  interconnection  parameters  for  the  technology  feature  size  defined 
earlier  but  allows  the  user  to  modify  any  of  these  values  interactively.  These 
parameters  include  the  width,  pitch  and  thickness  of  the  on-chip  interconnects, 
thickness  of  the  dielectric  material,  number  of  interconnection  layers  and  the 
utilization  coefficient  of  the  interconnections.  Then,  for  the  chip  defined  above,  the 
program  calculates  and  displays  the  values  of  its  performance  indicators  such  as  its 
size,  the  maximum  clock  frequency,  power  consumption,  computational  capacity, 
power  efficiency,  functional  throughput  rate  and  the  fabrication  yield.  Next, the 
program  allows  the  user  to  scale  the  reference  chip  defined  above  by  a  certain  scaling 
factor  and  determine  the  performance  indicators  for  the  scaled  chip.  Finally,  the  user 
is  able  to  change  the  number  of  transistors  or  logic  gates  on  the  reference  chip  and 
study  the  dependence  of  the  various  chip  performance  indicators  on  its  integration 
level.  In  addition  to  displaying  the  simulation  results  on  the  screen,  the  program  also 
writes  the  various  chip  parameters  and  the  corresponding  results  on  an  output  file 
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for  later  reference.  As  an  example,  the  flow  chart  of  the  program  NCHEPSIM  for  the 
silicon  NMOS  chips  is  shown  in  figure  1.  The  program  descriptions  and  the  results 
for  each  of  the  four  technologies  and  the  details  specific  to  a  particular  technology  are 
described  in  the  following  sections. 

4.  THE  PROGRAM  "NCHIPSIM"  FOR  SILICON  NMOS  CHIPS 

A  microcomputer  program  called  "NCHIPSIM"  has  been  developed  which  can 
be  used  to  predict  the  performance  indicators  of  a  microprocessor  or  a  gate  array  chip 
based  on  the  silicon  NMOS  technology  as  well  as  to  study  the  dependence  of  these 
indicators  on  the  feature  size  of  the  transistors  and  the  integration  level  of  the  chip. 
The  default  values  of  the  various  chip,  transistor  and  the  interconnection  parameters 
used  in  NCHIPSIM  are  as  follows: 

Chin-Type  Dependent  Parameters  for  a  Microprocessor  Chip 


Interconnect-length  Rent’s  constant  =  0.4 

Pin-count  Rent’s  constant  =  0.45 

Pin-count  multiplication  constant  =  0.82 

Logic  depth  =  22 

Chin-Type  Dependent  Parameters  for  a  Gate-Array  Chip 

Interconnect-length  Rent’s  constant  =  0.5 

Pin-count  Rent’s  constant  =  0.5 

Pin-count  multiplication  constant  =  1.9 

Logic  depth  =  30 
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various  component  delays  such  as  the  delay  due  to  the  output  resistance  of  the 
driving  gate  and  the  interconnection  capacitance,  the  delay  due  to  the  input 
capacitance  of  the  gate  at  the  next  state,  the  distributed-RC  delay  of  the 
interconnections  and  the  delay  due  to  the  resistance  of  the  interconnections  and  the 
input  capacitance  of  the  gates. 

e)  The  total  delay  suffered  by  an  input  signal  on  the  chip  was  determined  by  adding 
the  various  component  delays  such  as  the  delay  through  the  logic  gates  including  the 
latch  delay,  the  combinational  logic  delay  and  the  setup  time,  the  distributed-RC 
delay  of  an  interconnection  that  crosses  the  chip  halfway  diagonally  representing  the 
contribution  of  the  global  interconnection  delay,  and  the  contribution  of  the  speed-of- 
light  limit  depending  on  the  propagation  speed  of  the  electromagnetic  waves  on  the 
chip.  This  last  component  was  negligible  unless  the  chip  is  very  large  or  the  clock 
frequency  is  greater  than  1  GHz  as  is  generally  the  case  with  the  GaAs  chips.  The 
maximum  clock  frequency  of  the  chip  was  then  determined. 

0  The  power  consumption  of  the  chip  was  calculated  by  adding  the  power 
consumption  in  the  logic  gates  and  the  dynamic  power  consumption  at  the  I/O 
buffers.  This  depends  on  the  power  supply  voltage,  fraction  of  the  on-chip  gates  that 
switch  during  a  clock  period,  the  total  capacitance  at  an  output  pin  and  the  number 
of  pins  per  chip  as  determined  by  using  the  Rent's  rule  [2]  or  provided  by  the 
designer. 

g)  The  computational  capacity  of  the  chip  is  a  measure  of  the  computational  power 
of  the  entire  chip.  It  was  determined  by  using  the  number  of  gates  on  the  chip  and 
the  maximum  clock  frequency  of  the  chip  determined  above. 
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h)  The  power  efficiency  of  the  chip  is  a  measure  of  the  computational  power  per  unit 
power  consumption  of  the  chip.  It  was  obtained  by  using  the  values  of  the 
computational  capacity  and  the  power  consumption  determined  above. 

i)  The  functional  throughput  rate  of  a  chip  is  a  measure  of  the  computational  power 
per  unit  area  of  the  chip.  It  was  obtained  by  using  the  values  of  the  computational 
capacity  and  the  chip  area  determined  above. 

j)  For  a  known  value  of  the  density  of  defects  on  the  chip,  the  fabrication  yield  of  the 
chip  was  obtained  by  using  the  Price  law  [3]. 

3.  SOFTWARE  DEVELOPMENT  -  GENERAL  APPROACH 

For  each  of  the  technologies  mentioned  above  and  using  the  steps  outlined  in 
the  above  section,  flexible  and  user-oriented  programs  suitable  for  the  simulation  of 
the  various  performance  indicators  for  the  integrated  circuit  chips  with  known 
technology  parameters  were  written  in  FORTRAN-77  and  run  on  the  IBM  and  its 
compatible  personal  computers.  Each  program  was  made  extremely  user-friendly  so 
that  it  allows  the  user  to  choose  the  chip  type,  i.e.,  a  microprocessor,  a  gate  array  or 
a  high-speed  computer  chip  (for  CMOS  and  GaAs  HBT  technologies).  Next, 
depending  on  the  chip  type,  the  program  lists  the  constants,  found  empirically, 
required  for  determining  the  average  interconnect  length  and  the  number  of  pins  on 
the  chip.  Next,  the  program  lists,  but  allows  the  user  to  change  interactively,  the 
values  of  several  chip  parameters  such  as  the  number  of  transistors,  fan-out  of  a 
typical  gate,  capacitance  at  the  output  pin,  density  of  fabrication  defects  and  the 
probability  of  an  on-chip  gate  to  switch  during  a  clock  period.  Next,  the  program  lists 
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achieve  higher  packing  densities,  shorter  propagation  delays  and  smaller  chips. 
Further,  because  of  the  much  higher  mobility  of  electrons  in  Gallium  Arsenide 
(GaAs),  it  has  emerged  as  a  preferred  substrate  for  the  development  of  the  very  high 
speed  integrated  circuits  (VHSIC).  In  fact,  compared  to  the  existing  silicon  NMOS 
and  CMOS  technologies,  GaAs,  MESFET,  HEMT,  HBT  and  HFET  technologies  have 
proven  to  be  much  superior  for  the  development  of  the  VHSIC  chips. 

The  Device  Technology  Section  of  the  Electronics  Technology  Laboratory  at  the 
Wright  Patterson  Air  Force  Base  is  interested  in  the  development  of  very  high  speed 
integrated  circuits  based  on  the  GaAs  technology.  In  particular,  they  are  concerned 
about  the  parasitic  effects  associated  with  the  devices  and  interconnections  that 
adversely  affect  the  performance  of  a  small-geometry  high-speed  high-density 
integrated  circuit.  They  are  also  interested  in  predicting  the  performance  of  future 
submicrometer  feature  size  very  high  speed  integrated  circuits.  This  can  be 
accomplished  by  executing  the  following  steps:  a)  Development  of  a  computer- 
efficient  model  of  the  integrated  circuit  performance  indicators;  b)  Development  of 
a  user-oriented  computer  program  suitable  for  the  chip  performance  simulation;  and 
c)  Application  of  the  computer  simulator  for  the  determination  of  the  performance 
indicators  for  a  chip  with  known  values  of  the  technology  parameters. 

The  objective  of  this  project  was  to  develop  the  computer-efficient  algorithms 
and  the  related  user-friendly  computer  software  modules  suitable  for  the  simulation 
of  the  performance  indicators  of  the  submicrometer-geometry  high-speed  high-density 
integrated  circuit  chips  based  on  the  silicon  NMOS,  silicon  CMOS,  silicon  BJT  and 
GaAs  HBT  technologies.  In  addition  to  predicting  the  various  performance  indicators 
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of  an  integrated  circuit  chip  such  as  its  maximum  clock  frequency,  power 
consumption,  computational  capacity,  power  efficiency,  fabrication  yield,  functional 
throughput  rate  and  its  size,  the  programs  are  also  suitable  for  predicting  the 
performance  of  future  circuits  with  scaled  feature  sizes  and/or  increased  integration 
levels.  In  order  to  validate  the  algorithms  and  the  related  simulators,  the  simulation 
results  have  been  compared  with  the  data  available  for  the  existing  single-chip 
microprocessors  and  gate  arrays  based  on  the  NMOS  and  CMOS  technologies. 

2.  ALGORITHM  DEVELOPMENT  •  GENERAL  APPROACH 

a)  The  average  interconnection  length  on  the  chip  in  units  of  the  average  logic  gate 
dimension  was  determined  by  the  given  number  of  transistors  or  gates  on  the  chip 
and  the  Rent’s  constant  by  using  the  equations  derived  by  Donath  [1]. 

b)  For  an  interconnection-capacity  limited  VLSI  chip  as  is  the  case  with  almost  all 
logic  intensive  chips,  the  average  logic  gate  dimension  on  the  chip  was  obtained  by 
setting  the  interconnection  available  per  gate  equal  to  that  required  per  gate.  This 
depends  on  the  interconnection  pitch  (equal  to  the  sum  of  the  interconnection  width 
and  the  spacing  between  the  interconnects),  the  number  of  interconnection  layers  on 
the  chip  and  the  utilization  coefficient  of  the  on-chip  interconnections  to  account  for 
the  interconnect  lines  not  utilized  or  those  used  for  power  and  clock  distribution. 

c)  The  value  of  the  average  interconnection  length  on  the  chip  was  evaluated  and  the 
values  of  the  chip  size  and  the  chip  area  were  determined. 

d)  The  average  delay  time  in  a  gate  on  the  chip  (defined  as  the  time  taken  by  the 
output  signal  to  reach  50%  of  its  steady  state  value)  was  obtained  by  adding  the 
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COMPUTER  SIMULATION  OF  SMALL-GEOMETRY  HIGH-SPEED  HIGH-DENSITY 
INTEGRATED  CIRCUIT  PERFORMANCE  INDICATORS 

ABSTRACT 

For  integrated  circuit  chips  based  on  the  silicon  NMOS,  silicon  CMOS,  silicon 
bipolar  and  GaAs  heterojunction  bipolar  technologies,  computer-efficient  models  of  the 
various  chip  performance  indicators  have  been  developed  and  user-friendly  computer 
programs  called  "NCHIPSIM,,,  "CCHEPSIM,"  "BCHIPSIM"  and  "GCHIPSIM,"  suitable 
for  the  simulation  of  the  chip  performance  indicators  for  a  microprocessor  or  a  gate- 
array  chip,  have  been  developed.  In  addition  to  predicting  the  various  chip 
performance  indicators  such  as  its  maximum  clock  frequency,  power  consumption, 
computational  capacity,  power  efficiency,  fabrication  yield,  functional  throughput  rate 
and  the  size  of  the  chip  with  the  given  technology  parameters,  the  programs  have 
also  been  used  to  simulate  the  dependences  the  various  chip  performance  indicators 
on  the  technology  feature  sizes  in  the  range  0. 1-5.0  microns  and  the  chip  integration 
levels  in  the  range  100-1,000,000  transistors  or  logic  gates  on  the  chip.  The  results 
for  the  NMOS  and  CMOS  chips  have  been  compared  with  and  found  in  excellent 
agreement  with  those  known  for  several  single-chip  microprocessors  based  on  these 
technologies. 

1.  INTRODUCTION 

Continuous  advances  in  the  integrated  circuit  technology  have  resulted  in  more 
complex  chips  integrating  millions  of  devices  and  interconnections.  In  the  recent 
years,  it  has  become  necessary  to  use  interconnections  in  two  or  more  levels  to 
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Minimum  Feature  Size  (micron) 


Functional  Throughput  Rate  (GHz/mmA2) 
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Minimum  Feature  Size  (micron) 

FIGURE  7 
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Number  of  Transistors 


Maximum  Clock  Frequency  (MHz) 


58-20 


100  1,000  10,000  100,000  1,000,000  10,000,000  1.000E+08 

Number  ol  T  ransistors 


Power  Consumption  (W) 


LU 

"fc  O 

r  ° 

E  P 


I  O 

r  ° 

© 

_  o 

-  o 

-  o 


U  o 

L  8- 
b  o 
P  o 
Z  o 


2  co 
Q-Q- 
2  x 
.9  o 

CO  O) 
O  .E 

^  =) 


Number  of  Transistors 


Power  Efficiency  (GHz/mW) 


58-22 


,000  100,000  1,000,000  10,000,000  1.000E+08 

Number  of  Transistors 


Computational  Capacity  (GHz) 


58-23 


Number  of  Transistors 


Functional  Throughput  Rate  (GHz/mmA2) 


58-24 


100  1,000  10,000  100,000  1,000,000  10,000,000  1.000E+08 

Number  of  T ransistors 


TABLE  1 :  Actual  Data  and  Simulation  Results  tor 
NMOS  Microprocessor  Chip  HP  FOCUS  (1982) 
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TABLE  3:  Actual  Data  and  Simulation  Results  for 
NMOS  Microprocessor  Chip  BERKELEY  RISCI  (1981) 

Performance  Indicator  Actual  Value  Using  NCHIPSIM 
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TABLE  4:  Actual  Data  and  Simulation  Results  for 
NMOS  Microprocessor  Chip  MICRO-VAX  32720  (1984) 

Performance  Indicator  Actual  Value  Using  NCHIPSIM 
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5.  THE  PROGRAM  "CCfflPSIM'  FOR  SILICON  CMOS  CHIPS 

A  microcomputer  program  called  "CCHIPSIM”  has  been  developed  which  can 
be  used  to  predict  the  performance  indicators  of  a  microprocessor,  a  gate  array  or  a 
high-speed  computer  chip  based  on  the  silicon  CMOS  technology  as  well  as  to  study 
the  dependence  of  these  indicators  on  the  feature  size  of  the  transistors  and  the 
integration  level  of  the  chip.  The  default  values  of  the  various  chip,  transistor  and 
the  interconnection  parameters  used  in  CCHIPSIM  are  as  follows: 

Chip-Type  Dependent  Parameters  for  a  Microprocessor  Chip 


Interconnect-length  Rent’s  constant  =  0.4 

Pin-count  Rent’s  constant  =  0.45 

Pin-count  multiplication  constant  =  0.82 

Logic  depth  =  22 

Chip-Type  Dependent  Parameters  for  a  Gate-Array  Chip 

Interconnect-length  Rent’s  constant  =  0.5 

Pin-count  Rent’s  constant  =  0.5 

Pin-count  multiplication  constant  =  1.9 

Logic  depth  =  30 

Chip-Type  Dependent  Parameters  for  a  High-Speed  Computer  Chip 
Interconnect-length  Rent’s  constant  =  0.6 

Pin-count  Rent’s  constant  =  0.63 

Pin-count  multiplication  constant  =  1.4 

Logic  depth  =  10 
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Chip  Parameters 


Number  of  transistors  on  the  chip 
Fan-out  of  a  typical  gate  on  the  chip 
Number  of  logic  gates  on  the  chip 
Total  capacitance  at  an  output  pin 
Fraction  of  on-chip  gates  that  switch 
during  a  clock  cycle 
Density  of  defects  on  the  chip 
Transistor  Parameters 

Minimum  feature  size 

Input  capacitance  of  feature  size 

NMOS  transistor 

Power  supply  voltage 

Ratio  of  optimum-size  to  feature-size 

transistors 

Ratio  of  W/L  of  PMOS  to  W/L  of  NMOS 
Output  resistance  of  a  typical  gate 
Time  delay  in  a  typical  gate 
Interconnection  Parameters 

Number  of  interconnection  layers 
Widths  of  on-chip  interconnects 
Pitches  of  on-chip  interconnects 
Thicknesses  of  on-chip  interconnects 


120,000 

3 

20,000 

50  pF 

0.3 

5/cm2 

1  pm 

2fF 

2.5  volts 


1 

2 

30,000  ft 
7,914  ps 


3 

2  pm 

4  pm 
0.25  pm 
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Thickness  of  the  dielectric  material 
Utilization  coefficient  of  interconnections 
Interconnection  resistance 
Interconnection  capacitance 


0.2  pm 
0.4 

56  Q/mm 
0.48  pF/mm 


5.1  SIMULATION  RESULTS  USING  CCHIPSIM 

CCHIPSIM  has  been  used  to  predict  the  dependence  of  the  chip  performance 
on  its  minimum  feature  size  as  well  as  on  its  integration  level.  For  example,  the 
dependences  of  the  chip  size,  maximum  clock  frequency,  power  consumption, 
computational  capacity,  power  efficiency  and  functional  throughput  rate  of  a  20,000- 
logic  gates  silicon  CMOS  single-chip  microprocessor  on  its  minimum  feature  size  in 
the  range  0.1  to  5  pm  are  shown  in  figures  14  to  19,  respectively;  and  the 
dependences  of  each  of  these  performance  indicators  for  a  1  pm  silicon  CMOS 
microprocessor  chip  on  its  integration  level  in  the  range  100  to  1,000,000  logic  gates 
are  shown  in  figures  20  to  25,  respectively. 

CCHIPSIM  has  also  been  used  to  compare  the  simulation  results  for  several 
actual  silicon  CMOS  single-chip  microprocessors  to  the  known  values  of  the 
performance  indicators  [4-7].  For  example,  such  a  comparison  for  the  2  pm  CMOS 
microprocessor  chip  called  Fairchild  Clipper  (1985)  is  shown  in  Table  5  and  that  for 
the  1.5  pm  CMOS  microprocessor  chip  called  Intel  80386  (1985)  is  shown  in  Table  6. 
Tables  5  and  6  show  that  the  agreement  between  the  actual  and  the  simulated 
results  is  again  very  good. 
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Chip  Dimension  (mm) 
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Minimum  Feature  Size  (micron) 


Power  Consumption  (mW) 
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Minimum  Feature  Size  (micron) 


Chip  Dimension  (mm) 


10,000  100,000  1,000,00 

Number  of  Logic  Gates 


Maximum  Clock  Frequency  (MHz) 
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Number  of  Logic  Gates 


Power  Consumption  (mW) 
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)  10,000  100,000 
Number  of  Logic  Gates 


Functional  Throughput  (GHz/mm2) 


Number  of  Logic  Gates 


TABLE  5:  Actual  Data  and  Simulation  Results  for 
Si  CMOS  Microprocessor  Chip  Fairchild  Clipper  (1985) 
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TABLE  6:  Actual  Data  and  Simulation  Results  for 
Si  CMOS  Microprocessor  Chip  Intel  80386  (1985) 

Performance  Indicator  Actual  Value  Using  CCHIPSIM 
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6.  THE  PROGRAM  "BCHIPSIM"  FOR  SILICON  BIPOLAR  CHIPS 

A  microcomputer  program  called  "BCHIPSIM"  has  been  developed  which  can 
be  used  to  predict  the  performance  indicators  of  a  microprocessor,  gate  array  or  a 
high  speed  computer  chip  based  on  the  silicon  bipolar  technology  as  well  as  to  study 
the  dependence  of  these  indicators  on  the  feature  size  of  the  transistors  and  the 
integration  level  of  the  chip.  The  default  values  of  the  various  chip,  gate  and  the 
interconnection  parameters  used  in  BCHIPSIM  are  as  follows: 

Chin-Type  Dependent  Parameters  for  a  Microprocessor  Chip 


Interconnect-length  Rent’s  constant  =  0.4 

Pin-count  Rent’s  constant  =  0.45 

Pin-count  multiplication  constant  =  0.82 

Logic  depth  =  22 

Chip-Type  Dependent  Parameters  for  a  Gate-Arrav  Chip 

Interconnect-length  Rent’s  constant  =  0.5 

Pin-count  Rent’s  constant  =  0.5 

Pin-count  multiplication  constant  =  1.9 

Logic  depth  =  30 

Chip-Type  Dependent  Parameters  for  a  High-Speed  Computer  Chip 
Interconnect-length  Rent’s  constant  =  0.6 

Pin-count  Rent’s  constant  =  0.63 
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Pin-Count  multiplication  constant 
Logic  depth 
Chip  Paramgterg 

Number  of  logic  gates  on  the  chip 
Fan-out  of  a  typical  gate  on  the  chip 
Total  current  at  an  output  buffer 
Fraction  of  on-chip  gates  that  switch 
during  a  clock  cycle 
Density  of  defects  on  the  chip 
Number  of  pins  on  the  chip 
Gate  Parameters 

Minimum  feature  size 

Base-emitter  capacitance  of  feature 

size  transistor 

Logic  swing 

Gate  current  source 

Gate  power  supply  voltage 

Ratio  of  optimum-size  to  feature-size 

transistors 

Collector-base  capacitance 
Collector-substrate  capacitance 
Transistor  base  resistance 
Transistor  base  delay 


1.4 

10 

10,000 

3 

1  mA 

0.3 

5/cm2 

190 

1  pm 

20  fF 
0.5  volt 
0.25  mA 

4  volts 

1 

5  fF 

6  fF 
500  n 
3  ps 
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Time  delay  in  a  typical  gate  =  264  ps 

Interconnection  Parameters 

Number  of  interconnection  layers  =  4 

Widths  of  on-chip  interconnects  =  2  pm 

Pitches  of  on-chip  interconnects  =  4  pm 

Thicknesses  of  on-chip  interconnects  =  0.5  pm 

Thickness  of  the  dielectric  material  =  0.5  pm 

Utilization  coefficient  of  interconnections  =  0.4 

Interconnection  resistance  =  28  O/mm 

Interconnection  capacitance  =  0.27  pF/mm 


6.1  SIMULATION  RESULTS  USING  BCHIPSIM 

BCHIPSIM  has  been  used  to  predict  the  dependence  of  the  chip  performance 
on  its  minimum  feature  size  as  well  as  on  its  integration  level.  For  example,  the 
dependences  of  the  chip  size,  maximum  clock  frequency,  power  consumption, 
computational  capacity,  power  efficiency  and  functional  throughput  rate  of  a  10,000- 
logic  gates  silicon  bipolar  single-chip  gate-array  on  its  minimum  feature  size  in  the 
range  0.1  to  5  pm  are  shown  in  figures  26  to  31,  respectively;  and  the  dependences 
of  each  of  these  performance  indicators  for  a  1  pm  silicon  bipolar  gate-array  chip  on 
its  integration  level  in  the  range  100  to  1,000,000  logic  gates  are  shown  in  figures  32 
to  37,  respectively. 
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Dimension  (mm) 


Minimum  Feature  Size  (micron) 
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Power  Consumption  (W) 
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Minimum  Feature  Size  (micron) 

FIGURE  28 


Computational  Capacity  (GHz)  (Thousands) 


Minimum  Feature  Size  (micron) 

FIGURE  30 


58-54 


Chip  Dimension  (mm) 


Number  of  Logic  Gates 


Maximum  Clock  Frequency  (MHz) 


Number  of  Logic  Gates 

FIGURE  33 
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Power  Efficiency  (GHz/W) 
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Number  of  Logic  Gates 


Computational  Capacity  (GHz)  (Thousands) 


Number  of  Logic  Gates 


Functional  Throughput  (GHz/mm2) 


|  10,000  100,000  1,000,00 

Number  of  Logic  Gates 


7.  THE  PROGRAM  "GCHIPSIM"  FOR  GaAs  HETEROJUNCTION  BIPOLAR  CHIPS 


A  microcomputer  program  called  "GCHIPSIM"  has  been  developed  which  can 
be  used  to  predict  the  performance  indicators  of  a  conventional  microprocessor,  a 
conventional  gate  array  or  a  high-speed  computer  chip  based  on  the  GaAs  HBT 
technology  as  well  as  to  study  the  dependence  of  these  indicators  on  the  feature  size 
of  the  transistors  and  the  integration  level  of  the  chip.  In  this  program,  the 
interconnection  capacitances  were  calculated  by  the  method  of  moments  in 
conjunction  with  a  Green's  function  appropriate  for  the  geometry  of  the 
interconnections  [8].  The  capacitance  results  included  the  fringing  fields  as  well  as 
the  shielding  effects  of  neighboring  interconnections  for  high-density  chips.  The 
default  values  of  the  various  chip,  gate  and  the  interconnection  parameters  used  in 
GCHIPSIM  are  given  below.  Some  of  these  were  derived  from  the  values  provided 
by  Dr.  Ross  Mactaggard  of  Honeywell,  Inc.  [9]. 

Chip-Type  Dependent  Parameters  for  a  Microprocessor  Chip 


Interconnect-length  Rent’s  constant  =  0.4 

Pin-count  Rent’s  constant  =  0.45 

Pin-count  multiplication  constant  =  0.82 

Logic  depth  =  22 

Chip-Type  Dependent  Parameters  for  a  Gate-Arrav  Chip 

Interconnect-length  Rent’s  constant  =  0.5 

Pin-count  Rent’s  constant  =  0.5 

Pin-count  multiplication  constant  =  1.9 

Logic  depth 
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Chip-Type  Dependent.  Parameters  for  a  High-Speed 
Interconnect-length  Rent’s  constant  = 

Pin-count  Rent’s  constant  = 

Pin-count  multiplication  constant  = 

Logic  depth  = 

Chip  Parameters 

Number  of  logic  gates  on  the  chip  3 

Fan-out  of  a  typical  gate  on  the  chip  = 

Total  current  at  an  output  buffer  3 

Fraction  of  on-chip  gates  that  switch 
during  a  clock  cycle  3 

Density  of  defects  on  the  chip  3 

Gate  Parameters 

Minimum  feature  size  = 

Total  base-emitter  capacitance  of 
feature  size  transistor  = 

Base  resistance  of  feature-size  transistor  = 
Gate  power  supply  = 

Ratio  of  optimum-size  to  feature-size 
transistors  = 

Gate  current  source  = 

Logic  8 wing  = 

Transistor  base  delay  3: 


Computer  Chii 
0.6 
0.63 
1.4 
10 

10  ,000 

3 

1  mA 

0.3 

5/cm2 

1  pm 

5  fF 
300  ft 

4  volts 

1 

0.25  mA 
0.5  volt 
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0.5  ps 


Collector-base  capacitance 
Collector-substrate  capacitance 
Time  delay  in  a  typical  gate 
Interconnection  Parameters 

Number  of  interconnection  layers 
Widths  of  on-chip  interconnects 
Pitches  of  on-chip  interconnects 
Thicknesses  of  on-chip  interconnects 
Thickness  of  the  GaAs  substrate 
Utilization  coefficient  of  interconnections 
Interconnection  resistance 
Interconnection  capacitance 


4  £F 
0 

224  ps 


4 

2  pm 
4  pm 
0.5  pm 
350  pm 
0.33 

28  Q/mm 
0.32  pF/mm 


7.1  SIMULATION  RESULTS  USING  GCHEPSIM 

GCHIPSIM  has  been  used  to  predict  the  dependence  of  the  chip  performance 
on  its  minimum  feature  size  as  well  as  on  its  integration  level.  For  example,  the 
dependences  of  the  chip  size,  maximum  clock  frequency,  power  consumption, 
computational  capacity,  power  efficiency  and  functional  throughput  rate  of  a  10, 000- 
gate  GaAs  HBT  high  speed  computer  chip  on  its  minimum  feature  size  in  the  range 
0.1  to  5.0  pm  are  shown  in  figures  38  to  43,  respectively;  and  the  dependences  of  each 
of  these  performance  indicators  for  a  1-pm  GaAs  HBT  high-speed  computer  chip  on 
its  integration  level  in  the  range  100  to  1,000,000  logic  gates  are  shown  in  figures  44 
to  49,  respectively. 
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Chip  Dimension  (mm) 
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Minimum  Feature  Size  (micron) 


Maximum  Clock  Frequency  (GHz) 


K<OlO^C9CMt-0 


58-65 


Minimum  Feature  Size  (micron) 


Power  Consumption  (W) 


i 
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Minimum  Feature  Size  (micron) 


Computational  Capacity  (GHz)  (Thousands) 
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Minimum  Feature  Size  (micron) 


Chip  Dimension  (mm) 


i  10,000  100,000  1,000,00 

Number  of  Logic  Gates 


Maximum  Clock  Frequency  (MHz) 


Number  of  Logic  Gates 


Power  Consumption  (W) 
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)  10,000  100,000  1,000,00 
Number  of  Logic  Gates 


Power  Efficiency  (GHz/W) 


Number  of  Logic  Gates 


Computational  Capacity  (GHz)  (Thousands) 
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Functional  Throughput  (GHz/mm 


Number  of  Logic  Gates 


7.2  PROGRAM  LISTINGS 

The  listings  of  the  programs  NCHIPSIM,  CCHIPSIM,  BCHIPSIM  and 
GCHIPSIM  can  be  made  available  on  request. 
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ABSTRACT 

An  acoustic  emission  (AE)  test  platform  was  designed  and  tested  after 
consideration  of  several  alternatives.  This  test  platform  is  intended  for  use  in 
developing  more  accurate  AE  source  location  approaches  useful  for  smart  structures 
applications.  The  major  design  objectives  of  the  test  platform  included:  a)  flat  with 
frequency  sensors  in  the  far-field  of  crack  related  AE  sources;  b)  a  uniform  thickness 
plate  section  containing  a  crack  with  sufficient  lateral  dimensions  to  remove  early  edge 
reflections  of  the  acoustic  waves;  and  c)  ability  to  test  the  specimen  in  an  existing 
acoustically  quiet  screw-driven  test  machine  with  hydraulic  grips  to  minimize 
extraneous  AE  sources.  In  addition,  a  literature  search  identified  two  potential 
techniques  to  improve  AE  source  location  accuracy.  Preliminary  data  for  real  AE  was 
obtained  with  both  techniques  for  the  first  time.  Typical  waveforms  are  shown  for  a 
small  array  sensor  and  for  an  in-line  sensor  configuration  for  crack  opening  AE  events 
in  aluminum.  Recommendations  for  extension  of  this  limited  study  are  presented. 
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I.  INTRODUCTION 


The  Fatigue,  Fracture  and  Reliability  Group  of  the  Air  Force  Wright  Laboratory 
is  interested  in  the  concept  of  smart  structures.  This  terminology  refers  to  the  ability 
to  nearly  continuously  sense  conditions  and  states  such  as  loads  and  structural 
integrity  in  aircraft  structures  [1].  The  inputs  of  the  sensors  into  properly 
programmed  computers  lead  directly  to  corrective  actions  thereby  bypassing  scheduled 
nondestructive  evaluation.  Acoustic  emission  {AE}  technology  provides  one  approach 
to  the  network  of  sensors  and  computers  necessary  for  smart  structures. 

AE  technology  is  concerned  with  the  monitoring  of  stress  waves  hich  are 
generated  by  rapid  local  energy  releases  in  solid  materials.  The  damage  generated  (or 
related)  AE  signals  are  created  by  a  variety  of  sources  such  as  bond  fracture,  impacts, 
phase  changes,  crack  growth,  friction,  inclusion-particle  fracture,  and  other 
microdamage  sources. 

AE  technology  has  a  number  of  useful  and  potentially  unique  features 
applicable  to  smart  structure  applications.  First,  AE  is  a  passive  technique  in  that 
stress  waves  generated  at  damage  sources  throughout  a  structure  propagate  to  the 
sensor  locations.  This  feature  limits  the  density  of  sensors  required  to  monitor  a 
structure.  Second,  AE  is  a  whole  volume  damage  sensing  technique  no  less  sensitive 
to  interior  damage  compared  to  surface  damage.  Third,  AE  data  from  multiple  sensors 
can  be  used  to  locate  the  spatial  position  of  the  damage  sources  within  the  structure. 
Fourth,  the  damage  source  mechanism  may  be  potentially  identified  by  use  of 
advanced  statistical  concepts  with  waveform  feature  vectors.  Fifth,  AE  is  essentially 
a  real  time  damage  sensing  technique.  Sixth,  AE  is  a  microscopic  characterization 
technique  with  high  sensitivity.  And  seventh,  AE  is  at  its  most  fundamental  level  a 
measure  of  the  damage  response  of  a  structure  to  stress. 
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Since  an  important  goal  of  the  smart  structures  concept  is  to  bypass  the  usuai 
nondestructive  inspections,  the  application  of  AE  would  make  its  most  valuable 
contribution  if  the  extent  of  damage  in  a  structure  could  be  continuously  defined  by 
using  the  source  location  capabilities  of  AE  technology.  For  example,  with  a  metal 
structure  in  a  fatigue  environment,  crack  lengths  could  be  measured  by  continuously 
following  the  position  of  the  crack  tips  with  AE  source  location  data.  Such  an 
application  of  AE  requires  an  extension  of  the  technology  as  usually  practiced  in 
structures.  Typical  AE  approaches  are  based  on  locating  the  vicinity  of  a  crack  or 
damage  region  rather  than  actual  location  of  crack  tips  or  the  advancing  perimeter  of 
a  damaged  region.  Thus,  the  smart  structures  application  requires  a  significant 
increase  in  the  accuracy  of  AE  source  location  techniques  in  structures. 

Examination  of  the  accuracy  of  AE  source  location  with  conventional 
commercial  AE  instrumentation  demonstrated  that  such  an  approach  does  not  locate 
crack  tips  [2].  Further,  it  was  shown  in  the  same  reference  that  an  approach  using 
waveform  recorders  with  commercial  resonant  sensors  results  in  only  a  small 
improvement  in  source  location  accuracy.  Based  upon  the  above  results,  it  can  be 
concluded  that  approaches  which  ignore  source  radiation  field  characteristics  and 
specimen  wave  propagation  features  will  not  be  capable  of  accurate  source  location. 

On  the  other  hand,  accurate  location  has  been  demonstrated  for  waveform 
recorders  by  use  of  nearly  flat  with  frequency  sensors  located  very  close  to  the  AE 
sources.  The  keys  to  success  with  flat  sensors  are:  1)  the  velocity(s)  of  the  specific 
wave(s)  mode  used  to  locate  the  source  must  be  known;  2)  the  path  from  source  to 
sensor  of  these  wave  modes  must  be  known;  and  3)  the  same  phase  point  within  the 
wave  mode  must  be  used.  Consistent  with  such  an  approach,  Scruby  [3]  and  Johnson 
and  Carlson  [4]  quote  accuracy  values  on  the  order  of  ±500  pm  and  ±  100  pm 
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respectfully.  These  results  were  obtained  using  multiple  sensors  of  the  point  contact 
type  with  waveform  recorders.  The  best  results,  Johnson  and  Carlson  [7],  had  a  total 
of  8  sensors  located  on  both  surfaces  of  a  plate  nearly  as  close  to  the  crack  as  was 
physically  possible.  The  other  quoted  results  were  for  a  relatively  thick  compact 
fracture  toughness  specimen  with  sensors  on  the  order  of  3  cm  from  the  crack.  Thus, 
in  both  cases,  with  sufficient  source  amplitude,  the  first  arrival  of  the  direct  path  P 
wave  (compressional  wave)  could  be  identified  and  used  to  obtain  relative  in  phase 
arrival  times. 

Clearly,  a  smart  structures  AE  application  will  generally  have  sensors  at  greater 
distances  from  the  source  to  maintain  a  reasonable  sensor  density.  In  such  a  case  the 
wave  propagation  aspects  become  considerably  more  complicated  since  the  direct  path 
compressional  wave  has  insufficient  amplitude.  Assuming  a  range  of  plate  thicknesses 
of  0.125  in.  to  0.625  in.  for  sensor  spacings  of  8  to  20  in.  or  more  in  structural 
applications,  sensor  to  source  distances  are  some  13  to  160  times  the  plate  thickness. 
Thus,  the  AE  signals  will  be  dominated  by  appropriate  plate  modes  in  the  so-called  far 
field.  Since  no  literature  results  on  location  accuracy  of  real  AE  sources  in  plates  with 
sensors  in  the  far  field  could  be  found,  a  beginning  was  made  to  study  this  area.  The 
results  of  a  limited  Research  Initiation  Program  Mini-Grant  study  are  given  in  this 
report. 

II.  OBJECTIVES  OF  THE  RESEARCH  EFFORT 

This  research  had  two  objectives.  The  first  objective  was  design,  development, 
and  qualification  testing  c  an  AE  test  platform  which  would  be  appropriate  to  study 
source  location  accuracy  of  real  AE  in  a  plate  at  source  to  sensor  distances  appropriate 
to  smart  structures  AE  applications.  To  meet  this  objective  the  transverse  size  of  the 
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plate  specimen  was  required  to  be  such  that  source  to  sensor  distances  would  put  the 
sensors  in  the  far-field  of  the  source.  Further,  at  the  same  time  the  plate  size  must 
place  the  sensors  sufficiently  removed  from  the  plate  edges  or  thickness  change 
regions  so  that  a  sufficient  portion  of  the  AE  signal  could  be  observed  without 
distortion  due  to  reflections  from  these  boundaries. 

The  second  objective  was  to  obtain  initial  multi-channel  AE  waveforms  with 
relatively  flat  sensors  applied  to  the  test  platform  containing  a  sharpened  crack  under 
fatigue  and  overload  testing.  This  objective  involved  exploring  the  theoretical  and 
experimental  wave  propagation  literature  as  related  to  AE  signals  in  plates.  The 
purpose  of  the  literature  search  was  to  determine  the  best  potential  approaches  for 
sensor  types,  positions,  frequency  ranges  and  analysis  techniques  to  accurately 
determine  the  source  location  of  real  AE  crack-based  sources  monitored  in  the  far 
field.  Then  using  the  best  approach(es)  the  waveforms  obtained  would  be  examined 
to  determine  the  potential  usefulness  of  the  approach(es)  for  accurate  source  location. 

III.  AE  TEST  PLATFORM:  DESIGN  AND  PRELIMINARY  TESTING 

Since  one  of  the  primary  objectives  of  this  research  was  to  develop  an  AE  test 
platform  for  use  in  a  future  extensive  evaluation  of  various  approaches  to  accurate  AE 
source  location  using  AE  sensors  in  the  far  field,  a  paper  design  study  was  first  made 
to  identify  the  best  design  that  would  meet  AE  test  requirements  and  minimize  the 
cost  of  the  specimens.  The  key  constraints  in  this  design  study  were:  1)  a  center- 
cracked  plate  of  sufficient  lateral  dimensions  to  eliminate  early  acoustic  wave 
reflections  at  sensors  in  the  far  field;  2)  a  specimen  and  gripping  system  which  would 
generate  little  or  no  extraneous  AE;  3)  total  specimen  size  such  that  it  could  be  tested 
with  existing  hydraulic  grips  in  the  acoustically  quiet  «crew-type  Instron  machine 
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available  at  Wright  Laboratory;  and  4)  minimum  cost  to  manufacture  additional  future 
samples. 

Designs  considered  in  the  study  included:  1)  specimens  machined  out  of  solid 
plate  stock;  2)  specimens  with  adhesively  bonded  transition  tabs;  3)  specimens  with 
bolted  transition  tabs;  and  4)  specimens  with  welded  transition  tabs.  The  bolted-type 
specimens  which  have  been  successfully  used  for  non-AE  type  testing  were  eliminated 
due  to  che  presence  of  extraneous  noise  sources  at  the  bolt  to  hole  interfaces  as  well 
as  at  the  tab  to  plate  interfaces.  Welded  specimens  were  eliminated  due  to  the 
expected  difficulties  with  warping  and  the  expected  brittle  weld  material  which  could 
lead  to  extraneous  AE  as  well  as  tab  failures  during  fatigue  cycling  for  crack 
sharpening.  The  specimens  chosen  for  testing  were  the  specimen  machined  from  solid 
stock  and  the  specimen  with  adhesively  bonded  tab  transition  regions. 

Since  frictional  and  interface  extraneous  noise  sources  are  not  present  in  a  solid 
sample,  full-size  samples  with  two  different  central  plate  section  thicknesses  were 
designed  and  fabricated  from  2024-T351  aluminum.  Figure  1  shows  the  dimensions 
and  other  pertinent  information  on  this  specimen  with  unusual  geometry.  Because 
the  overall  axial  length  of  the  specimen  was  limited  by  the  test  machine  and  the 
essential  design  constraint  was  to  eliminate  early  acoustic  reflections,  a  specially 
designed  transition  region  between  the  3.75  in.  grip  width  and  the  34  in.  wide  plate 
region  was  not  used.  Such  a  transition  region  would  result  in  more  uniform  stresses 
in  the  plate  section  of  the  specimen,  but  at  a  penalty  of  a  significant  reduction  in  the 
length  of  the  plate  section  in  the  axial  direction.  This  condition  would  result  in  early 
acoustic  reflections.  Instead  the  design  used  the  minimum  length  transition  region 
sufficient  to  control  stress  concentrations  as  determined  from  reference  5.  The 
resultant  design  with  the  selected  location  for  the  small  array  transducer  (to  be 
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discussed  later)  at  a  nominal  6  in.  distance  from  one  crack  tip  and  a  30°  angle  from  the 
crack  plane  gives  an  acoustic  wave  time  window  of  about  50  ps  from  the  arrival  of  the 
extensional  mode  wave  at  the  plate  wave  extensions]  velocity  [approximately  0.214 
in/ps]  until  the  arrival  of  the  first  reflection  of  this  wave.  The  calculated  window  until 
the  arrival  of  the  first  reflection  of  the  dominant  flexural  mode  was  about  120  ps 
[based  on  a  velocity  of  approximately  0.114  in/psj. 

Since  the  adhesive  tab  design  was  unproven,  both  with  respect  to  strength, 
fatigue  life,  and  extraneous  AE  noise  generation,  a  scaled-down  sample  was  designed 
and  tested  to  develop  data  with  respect  to  these  unknown  aspects.  Figure  2  shows  the 
relevant  dimensions  of  this  0.125  in.  thick  sample.  This  subscale  design  maintained 
the  stress  concentrations  as  well  as  the  shear  stresses  in  the  adhesive  bonds  at  the 
same  levels  present  in  a  full-size  adhesively-bonded  sample.  A  special  room 
temperature  curable  epoxy  adhesive  which  does  not  contain  a  filler  that  would 
generate  extraneous  AE  was  obtained  from  Dexter/Hysol  (EA  9330.4).  This  adhesive 
has  excellent  strength  (4000  psi  tensile  lap  shear)  and  was  expected  to  have  good 
fatigue  properties  although  no  direct  fatigue  data  were  available. 

An  initial  evaluation  of  the  extraneous  noise  characteristics  of  this  sub-scale 
sample  was  carried  out  by  proof  testing  the  sample  to  25,000  lbs.  with  AE  monitoring. 
This  proof  test  was  carried  out  on  an  MTS  machine  with  hydraulic  grips  at  the 
University  of  Denver.  This  test  machine  is  modified  in  that  the  servo  valve  is  off- 
mounted  from  the  actuator  to  control  machine  noise.  An  additional  modification  was 
necessary  to  control  noise  from  the  cycling  of  the  pressure  pump  that  raises  the 
gripping  pressure  (above  the  3,000  psi  supply  pressure)  of  the  hydraulic  grips  with 
smooth  inserts.  Installation  of  needle  valves  in  the  lines  to  the  grips  allowed  the  grips 
to  be  pressurized  and  then  valved  off  during  the  tests.  This  approach  permitted 
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shutting  off  the  grip  pressure  pump  for  the  duration  of  the  test. 

The  AE  system  used  in  this  test  (Physical  Acoustics  3000/3104,  p-30  sensors 
bonded  to  the  sample,  100-300  kHz,  54  dB  threshold  with  60  dB  preamplifier  gain) 
indicated  hundreds  of  AE  events  starting  at  loads  of  only  a  few  hundred  pounds.  Since 
these  loads  were  well  below  those  required,  it  was  decided  not  to  make  a  full  size 
adhesive  tab  type  specimen.  This  specimen  was  later  notched  (with  a  water  jet)  and 
tested  in  fatigue  at  Wright  Laboratory.  The  adhesive  bond  failed  in  the  tab  region 
prior  to  the  central  starter  notch  growing  to  the  intended  size  of  1.00  in.  This  failure 
occurred  after  about  9,000  cycles  under  a  nominal  A K  of  20  Ksiv/in  after  reaching  a 
crack  length  of  0.720  in.  (initial  notch  size  was  0.500  in.).  It  was  thus  concluded  that 
the  currently  designed  adhesive  sample  also  did  not  have  sufficient  fatigue  life  for  the 
necessary  pre-cracking  procedure.  A  different  design  which  results  in  a  more  uniform 
distribution  of  shear  stresses  may  possess  sufficient  fatigue  life,  but  this  design  would 
not  likely  pass  the  extraneous  noise  requirement. 

Since  cost  was  also  a  consideration,  it  is  of  interest  to  note  that  the  solid  full- 
sized  samples  cost  about  $1200  each  for  materials  and  machining,  while  the  adhesively- 
bonded  subscale  specimen  cost  about  $500.  With  the  additional  cost  expected  for  a 
full-size  adhesive  specimen  it  does  not  seem  worth  the  risk  of  extra  extraneous  noise 
sources. 

After  machining  the  solid  samples,  each  was  strain  gaged  along  the  line  shown 
in  figure  1  and  then  loaded  in  tension.  The  purpose  of  this  preliminary  test  was  to 
allow  an  estimate  of  the  stress  intensity  factor  at  the  notch/crack  tips  during  the 
fatigue  pre-cracking  process.  By  comparing  the  measured  strain  levels  with  those  for 
a  plate  with  uniform  strain,  an  effective  width  of  the  samples  was  obtained.  This 
effective  width  was  used  to  calculate  the  approximate  stress  intensity  factor.  Table  1 
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shows  the  pre-cracking  history  (for  both  samples)  used  to  extend  the  initial  notch  to 
the  2.00  in.  crack  size  with  an  approximate  maximum  stress  intensity  factor  of  10 
Ksiv/in,  which  is  well  below  the  KIC  value  for  this  material  of  about  35  Ksi\/in. 

Prior  to  the  AE  tests  of  the  load  platform  an  additional  test  was  necessary  to 
check  for  extraneous  grip  noise.  The  previous  work  had  shown  with  all  the 
precautions  taken  that  extraneous  noise  was  not  a  difficulty  with  smooth  hydraulic 
grip  inserts  [2],  Since  the  two  test  platforms  had  different  thicknesses  in  the  grip 
region  compared  to  the  previous  test  specimens,  it  was  necessary  to  use  different  grip 
inserts.  These  different  inserts  were  serrated  rather  than  smooth,  and  smooth  inserts 
were  not  available  in  a  reasonable  time  or  at  reasonable  prices  since  they  would  have 
to  be  made  as  a  special  order  in  England  where  the  hydraulic  grips  are  manufactured. 
Hence,  using  the  AE  system  and  sensors  previously  described  in  reference  2,  a  dummy 
aluminum  sample  was  tested  for  extraneous  noise  in  load  cycling  (0  to  22,000  lbs) 
above  the  loads  planned  for  the  solid  test  platform.  These  tests  which  were  carried 
out  at  the  same  maximum  sensitivity  (limited  by  electronic  noise)  of  42dB  threshold 
out  of  a  60dB  gain  preamplifier  showed  only  a  few  events,  as  documented  in  table  2. 
It  is  speculated  that  the  serrated  grip  inserts  do  not  contribute  additional  noise  as  a 
result  of  the  procedure  of  applying  3,000  psi  hydraulic  pressure  to  the  grips  and  then 
reducing  the  pressure  to  2,700  psi  prior  to  applying  axial  load  to  the  specimen. 

IV.  SELECTION  OF  AE  APPROACH  AND  PRELIMINARY  TESTING 

Since  the  limited  nature  of  this  research  project  dictated  that  totally  new 
approaches  could  not  be  developed  for  accurate  AE  source  location,  the  best  that  could 
be  done  was  to  extract  from  the  published  literature  previously  developed  approaches 
which  incorporated  results  from  wave  propagation  theory  for  accurate  location  using 
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AE  sensors  in  the  far-field  of  the  source.  An  extensive  search  of  the  AE  literature  as 
well  as  discussions  with  others  in  the  AE  research  community  during  the  time  frame 
of  January  to  March  of  1991  identified  two  potential  approaches. 

Sachse  and  Sancar  [6]  were  issued  a  patent  in  1986  based  on  an  approach  that 
used  a  closely  spaced  array  of  four  small  diameter  approximately  flat  with  frequency 
(velocity  sensitive)  sensors.  This  approach  is  only  documented  in  the  open  literature 
by  the  patent  which  lacks  the  type  of  discussion  usually  found  in  a  typical  published 
paper  or  report.  Only  after  extensive  efforts  was  a  copy  of  a  hand-written  report,  upon 
which  the  patent  was  based,  obtained  in  late  October  1991  [71.  The  patent  describes 
an  accurate  AE  location  approach  based  upon  visual  selection  of  the  same  waveform 
features  in  the  waveform  from  each  of  the  four  sensors  in  the  array.  For  each  of  two 
separate  features,  the  group  velocity  is  calculated,  and  then  using  these  two  group 
velocities  and  the  difference  in  arrival  times  of  the  selected  features,  the  range  and 
direction  to  the  source  are  calculated.  It  is  important  to  note  that  the  experimental 
work  upon  which  the  patent  was  based  included  only  simulated  AE  sources  (lead 
breaks)  applied  perpendicular  to  the  plane  of  a  plate.  No  testing  had  been  done  with 
real  AE  where  typically  inplane  AE  sources  are  present.  In  fact  the  theoretical  far- 
field  solution  for  an  inplane  AE  source  is  not  in  the  published  literature  to  correspond 
to  the  solution  available  for  the  out-of-plane  case  [8]. 

Ziola  and  Gorman  [9]  have  recently  been  developing  a  different  approach  to 
accurate  AE  far-field  location  in  thin  plates.  Their  approach  is  based  on  use  of  a  cross¬ 
correlation  technique  to  determine  the  proper  phase  points  within  AE  waveforms  from 
conventionally-spaced  approximately  flat  with  frequency  sensors.  The  approach  and 
analysis  is  based  upon  applying  classical  plate  theory  (thin  plates)  which  includes  only 
the  extensional  mode  and  the  flexural  mode  of  the  more  complex  Lamb  theory.  As 
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with  the  above  small  array  technique  this  cross-correlation  approach  has  to  date  only 
been  demonstrated  for  out-of-plane  simulated  AE  (pencil  lead  breaks)  rather  than  for 
real  inplane  AE  sources. 

It  is  important  to  note  that  both  of  the  above  approaches  allow  for  dispersion 
of  the  acoustic  waves  and  thus  they  overcome  a  key  reason  for  the  inaccuracy  of 
conventional  AE  source  location  approaches.  At  present  both  approaches  also  do  not 
account  for  edge  reflections.  Thus  they  are  consistent  with  the  current  design 
objectives  for  the  AE  test  platform.  The  fact  that  neither  approach  has  been  tested 
with  real  AE  made  both  approaches  good  candidates  for  the  testing  aspect  of  this 
research. 

Since  only  one  approach  fit  the  budget  constraints  of  the  project,  the  small 
array  transducer  (SAT)  technique  was  chosen  since  this  approach  may  be  especially 
suitable  for  smart  structures  applications.  The  reason  that  this  technique  is 
particularly  suitable  is  that,  since  all  four  sensors  are  in  close  proximity  to  each  other, 
they  lend  themselves  to  local  processing  of  data.  Thus  after  local  processing  only  the 
results  need  to  be  sent  on  to  more  central  computing  stations.  With  the  alternate 
approach  the  raw  data  from  each  sensor  must  be  sent  on,  thereby  greatly  increasing 
the  amount  of  data  requiring  transport  over  significant  distances.  Further,  the  SAT 
technique  is  expected  to  be  less  sensitive  to  the  effects  of  the  radiation  pattern 
differences  from  one  source  to  the  next  since  all  four  sensors  are  at  about  the  same 
angle  to  a  given  source. 

Later  in  the  project  it  was  decided  to  invite  Dr.  Michael  Gorman  to  participate 
in  the  testing  at  Wright  Laboratories,  since  his  participation  would  not  add  to  the  cost 
because  he  had  available  support  from  the  Astronautics  Laboratory  at  Edwards  Air 
Force  Base,  CA.  Since  the  AE  test  platform  is  large  and  has  two  sides,  it  can  support 
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up  to  four  simultaneous  AE  approaches  without  "congestion"  of  sensors.  Preliminary 
results  of  Dr.  Gorman’s  participation  are  included  in  Appendix  I  to  this  report. 

Since  the  SAT  technique  depends  on  a  particular  sensor  called  a  pinducer 
(model  VP-1093  available  from  Valpey-Fisher),  it  was  necessary  to  determine  the 
relative  output  of  this  sensor  compared  to  those  used  previously  in  order  to  make  sure 
that  the  expected  loss  of  sensitivity  with  a  flat  sensor  would  not  result  in  a  total  loss 
of  capability  to  detect  the  AE  from  the  2024-T351  aluminum.  Table  3  lists  relative 
decibel  outputs  of  the  pinducer  (velocity  sensitive)  as  well  as  two  other  relatively  flat 
displacement  sensors  compared  to  the  output  of  the  resonant  sensor  for  which  the 
typical  amplitudes  for  AE  crack  sources  in  2024-T351  are  known  from  previous  work 
[2].  These  output  levels  were  obtained  in  the  far- field  using  top  and  midplane  edge 
lead  breaks  as  the  source.  The  relative  outputs  were  determined  for  the  dominant 
wave  groups,  namely  the  first  extensional  mode  arrival  and  the  first  flexural  mode 
peak.  Since  the  previous  work  indicated  a  range  of  amplitudes  from  42  -  80  dB  with 
the  p-30  sensor,  the  pinducer  being  about  25  dB  less  "sensitive"  was  still  expected  to 
detect  AE  signals.  But,  the  number  detected  would  be  considerably  less  and  hence 
effort  is  needed  to  develop  higher  output  flat  sensors  and/or  less  noisy  preamplifiers. 

Since  no  published  paper  (other  than  the  patent)  was  available  on  the  SAT 
approach,  the  method  described  in  the  patent  was  first  tried  using  data  constructed 
by  assuming  an  SAT  at  a  nominal  distance  of  6.00  in.  from  an  assumed  source  position. 
Using  the  velocities  of  two  wave  groups  expected  to  be  dominant  for  a  real  source  with 
sensors  in  the  far-field,  the  arrival  times  were  calculated  for  a  source  on  the  same  side 
of  a  plate  as  the  SAT  position.  Figure  3  shows  the  geometrical  experimental 
configuration.  The  propagation  distances  were  calculated  to  eight  decimals  based  on 
the  location  of  the  center  of  each  pinducer.  Then  using  0.214  in/ps  and  0.114  in/ps 
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as  the  velocities  of  the  dominant  groups  in  the  lowest  symmetri':  and  antisymmetric 
branches  the  propagation  times  were  calculated  to  eight  decimals.  From  these 
propagation  times  the  appropriate  differences  in  arrival  times  were  calculated  as  well. 
To  simulate  in  part  the  effects  of  different  finite  digitization  rates  the  arrival  time 
differences  were  rounded  to  the  appropriate  number  of  digits.  Appendix  II  gives  the 
equations  used  from  the  patent  to  calculate  the  group  velocities,  range,  and  direction 
to  the  source.  Table  4  shows  the  values  and  the  percentage  errors  based  on  the  known 
values  from  which  the  propagation  values  were  calculated.  Since  the  ratio  of  the  range 
to  the  source  relative  to  the  maximum  array  dimension  was  on  the  order  of  6,  it  was 
necessary  to  apply  a  correction  term  (see  Appendix  II)  to  the  approach  indicated  in  the 
patent  when  the  range  is  much  greater  than  the  maximum  array  dimension.  Clearly, 
from  table  4  a  digitization  rate  of  at  least  0.1  ps  is  required  for  accurate  results  in  this 
ideal  numerical  case. 

V.  AE  TEST  APPROACH  LARGE  PLATES 

After  the  center  cracks  were  sharpened  out  to  a  2.00  in.  length  by  cyclic  fatigue 
at  about  0.5  Hz,  each  specimen  was  mounted  in  the  100  kip  Instron  for  testing  with 
AE.  Two  sensor  arrays  were  used  for  the  four  pinducers.  First,  tests  were  made  with 
the  SAT  configuration  (at  30°  to  the  crack  plane,  see  figure  3)  suitable  for  source 
location  following  the  patent.  Second,  additional  tests  were  made  with  an  in-line 
configuration  with  all  four  pinducers  spaced  2.00  in.  apart  along  an  imaginary  line 
emanating  from  one  crack  tip  at  30°  to  the  crack  plane  (see  figure  4).  The  30° 
position  was  chosen  to  provide  for  observation  of  significant  amplitudes  of  bulk 
longitudinal  and  shear  radiation  for  a  crack  plane  AE  source.  The  sensors  were  held 
in  position  with  an  aluminum  cantilever  bar  that  was  clamped  to  the  thick  shoulder 


59-15 


of  the  specimen.  The  pinducers  were  bonded  with  adhesive  (Super  Glue)  to  the  inside 
diameter  of  a  hollow  rod  of  micarta  as  shown  schematically  in  figure  5.  Then  the 
micarta  rod  was  inserted  into  close  tolerance  slip  fit  holes  in  the  cantilevered  bar  until 
the  sensor  came  into  contact  with  the  plate.  A  tapered  wedge  was  then  inserted  into 
the  slots  at  the  top  of  the  micarta  to  expand  it  to  hold  it  firmly  in  place. 

The  four  sensors  used  were  pinducers  with  coaxial  pinducer  cables  (Valpey- 
Fisher  models  VP- 1093  and  VPC-4,  respectfully).  Modified  preamplifiers  (Physical 
Acoustics  model  1220A)  at  60  dB  gain  with  a  nominal  bandpass  of  0.035-1.0  MHz  (3  dB 
down  points)  were  used  powered  by  the  nominal  28  volts  from  a  24  channel  Spartan 
(Physical  Acoustics  Corp.)  expansion  box.  The  expansion  box  was  used  only  to  supply 
the  power  to  the  preamplifiers  and  to  properly  separate  the  AE  from  the  dc  power. 
For  most  of  the  tests  the  internal  power  supply  of  the  Spartan  was  not  used.  Instead 
several  6-volt  "lantern"  batteries  were  connected  in  series  and  then  to  the  distribution 
board  of  the  Spartan.  This  approach  was  used  to  try  to  decrease  background  electronic 
noise.  For  data  recording  a  four  channel  waveform  recorder  (Nicolet  model  440,  12 
bits,  0.1  ps  digitization  interval)  was  connected  by  electrical  tees  at  the  Spartan 
distribution  board.  All  four  channels  were  triggered  simultaneously  to  save  typically 
4000  data  points  (including  pre-trigger  information).  These  data  were  stored  onto  3- 
1/4  inch  standard  disks. 

Some  electrical  noise  originating  from  an  unknown  unshielded  component  in 
the  Instron  was  discovered  at  an  approximate  burst  frequency  of  10  kHz.  A  braided 
cable  was  connected  between  the  plate  specimen  and  the  Spartan  box  to  eliminate  the 
noise.  In  order  for  this  approach  to  be  successful  the  original  couplant  selected  for  the 
pinducers  had  to  be  changed.  The  pinducers  do  not  have  an  insulating  wear  plate 
where  they  come  into  contact  with  the  test  specimen  (as  do  all  commercial-type  AE 
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sensors).  The  original  couplant  Apiezon  M  did  not  maintain  sufficient  electrical 
insulation  even  though  it  is  sufficiently  viscous  that  it  does  not  flow  perceptibly  down 
a  vertical  surface  under  the  influence  of  gravity.  Substitution  of  a  very  stiff  couplant 
wax  called  Tacki-wax  (amorphous  wax  with  a  high  melting  point,  Cenco)  provided 
sufficient  insulation  combined  with  the  ground  cable  to  eliminate  the  problem. 
Testing  with  0.3  mm  pencil  lead  indicated  no  loss  of  sensitivity  with  the  new  couplant. 

The  original  plan  was  to  trigger  the  transient  recorder  with  one  pinducer 
channel.  This  approach  did  not  work  well  because  the  level  of  the  AE  waveform  was 
not  significantly  above  the  preamplifier  electronic  noise  peak  levels.  Hence  to 
eliminate  false  triggers  and  at  the  same  time  to  increase  the  number  of  AE  events 
captured,  a  fifth  AE  channel  was  added  with  a  significantly  more  sensitive  resonant 
type  sensor  (Physical  Acoustics  model  R-15).  The  signal  from  this  sensor  located  on 
the  opposite  half  of  the  plate  (relative  to  the  crack  plane)  compared  to  the  SAT  was 
used  as  an  input  to  trigger  all  four  pinducer  channels  simultaneously.  This  trigger 
sensor  was  coupled  with  the  same  couplant  and  used  a  standard  preamplifier  (Physical 
Acoustics  1220A,  200-400KHz,  60  dB). 

To  gather  the  sample  waveforms  from  real  AE  sources  with  approximately  flat 
AE  sensors  located  in  the  far-field  of  a  uniform  thickness  plate  of  sufficient  size  to 
eliminate  early  edge  reflections  the  specially  designed  AE  platform  was  loaded  in 
displacement  control  between  zero  and  approximately  the  maximum  load  seen  at  the 
end  of  the  precracking.  Table  5  gives  the  loading  conditions  for  both  specimen 
thicknesses.  Prior  to  and  after  each  load  cycle  lead  breaks  [Pentel,  0.3  mm,  2H]  were 
made  at  one  or  both  crack  tips  for  several  purposes:  1)  to  verify  proper  acoustic 
coupling  of  each  sensor;  2}  to  check  for  proper  operation  of  the  electronics;  and  3)  to 
provide  data  for  a  source  at  the  crack  tips  with  a  high  signal  to  noise  ratio.  In  some 
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cases  the  lead  was  broken  at  the  crack  tips  on  the  side  opposite  to  the  location  of  the 
pinducers  and  trigger  sensor.  It  should  be  noted  that  this  out-of-plane  source  would 
not  be  expected  to  partition  energy  in  the  plate  modes  in  the  same  fashion  as  an  in¬ 
plane  source,  but  we  were  unable  to  devise  an  appropriate  artificial  in-plane  source  at 
the  crack  tips.  During  this  cyclic  testing  the  expected  AE  source  was  that  due  to  crack 
opening  causing  fracture  of  the  cold  welds  as  observed  previously  in  reference  2.  To 
increase  the  number  of  events,  significant  rest  periods  at  zero  load  were  incorporated 
in  the  test  schedule,  since  the  previous  results  had  shown  the  usefulness  of  such  an 
approach  [2].  Since  it  was  desirable  to  limit  sets  of  recorded  waveforms  to  those  with 
reasonable  signal  to  noise  ratios,  interrupted  cycling  was  used.  Hence,  when 
appropriate  waveforms  appeared  on  the  Nicolet  display  the  crosshead  was  stopped. 
The  pause  gave  time  to  record  the  waveforms  to  disk  and  to  note  the  load  at  which  the 
AE  source  event  occurred.  The  test  was  then  resumed  until  the  next  waveforms 
appropriate  for  storage  appeared.  After  a  number  of  such  fatigue  cycles,  the  specimen 
was  given  some  overload  cycles  to  attempt  to  obtain  AE  waveforms  from  inclusion 
particle  fracture  in  the  crack  tip  plastic  zone.  Table  5  also  gives  the  maximum 
overload  levels  for  each  specimen.  The  overload  cycles  were  applied  with  manual 
control  of  the  Instron  in  20  to  50  lb  load  increments.  This  procedure  was  used  since 
the  previous  work  [2]  had  indicated  relatively  high  rates  of  AE  during  the  overload. 
Hence,  if  this  overloading  was  done  at  a  fixed  crosshead  rate  at  most  one  set  of 
waveforms  would  be  recorded.  Since  no  overload  events  were  recorded,  a  final  set  of 
overload  cyclic  tests  were  carried  out  on  the  1/8  in.  thick  sample  with  no  interruptions 
or  manual  control.  These  cycles  were  made  at  an  increased  crosshead  rate  as  well  so 
that  the  cyclic  rate  was  on  the  order  of  0.08  Hz.  Again,  no  overload  events  were 
recorded. 
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Finally  after  the  cyclic  loading  and  overload  tests  were  completed,  a  series  of 
pencil  lead  breaks  (Pentel,  0.3  mm,  2H)  were  carried  out  on  one  edge  of  each  sample. 
The  lead  was  broken  near  the  top  edge  (test  platforms  were  horizontally  supported  for 
these  tests)  of  the  plates  at  a  point  in  the  crack  plane.  Two  pinducers  were  coupled 
to  the  top  surface  with  Apiezon  M.  One  pinducer  was  placed  at  a  fixed  position  near 
the  plate  edge  to  act  as  a  fixed  trigger.  The  second  pinducer  was  moved  after  each 
lead  break  in  increments  of  1/8  or  1/4  inch  through  a  range  of  distances  from  about 
3  inches  to  9  inches  from  the  source  position.  These  tests  were  done  to  characterize 
the  features  of  the  typical  wave  transient  propagation  in  the  test  samples.  The  edge 
break  was  used  to  simulate  the  in-plane  nature  of  the  real  AE  sources. 

VI.  RESULTS  AND  DISCUSSION 

The  success  of  the  AE  test  platform  in  providing  a  test  specimen  where 
advanced  techniques  for  more  accurate  source  location  of  real  AE  sources  can  be 
studied  is  demonstrated  in  figure  6.  This  figure  shows  large  amplitude  high  fidelity 
waveforms  from  the  crack  opening  source  for  the  SAT  configuration  on  both  plate 
thicknesses.  These  waveforms  allow  identification  of  both  the  extensional  and  flexural 
modes  (see  labels  in  figure  6)  which  are  expected  from  either  the  simple  plate  wave 
theory  or  the  more  complex  Lamb  wave  analysis.  Further  the  required  time  interval 
of  observation  is  completely  free  of  the  complications  of  either  free  edge  reflections 
or  reflections  from  the  change  in  plate  thickness  at  the  shoulders  of  the  specimen. 
Hence,  available  theoretical  results  can  be  applied  to  analyze  these  results.  And  at 
some  time  in  the  future  when  the  solution  for  normal  surface  displacements  in  the  far- 
field  due  to  an  inplane  AE  type  source  is  presented,  even  more  detailed  analysis  can 
be  done.  As  previously  noted,  to  date  only  the  out-of-plane  AE  type  source  far-field 
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solution  has  been  publisned  [8]. 

It  should  be  pointed  out  that  this  successful  test  fits  the  other  constraints  for 
useful  AE  testing  such  as  no  significant  extraneous  AE  noise  sources.  The  key  factors 
which  lead  to  lack  of  noise  are  use  of  a  solid  specimen,  center  crack  with  hydraulic 
grips  which  can  be  loaded  prior  to  axial  load  application,  and  a  relatively  quiet  screw- 
driven  test  machine.  It  was  also  important  to  demonstrate  with  the  test  platform  that 
real  AE  sources  are  detectable  with  the  less  sensitive  fiat  with  frequency  sensors. 
Data  from  this  type  of  sensor  are  essential  for  use  with  theoretical  results.  Certainly 
as  can  be  concluded  from  figure  6,  more  work  needs  to  be  done  to  improve  the  signal 
to  electronic  noise  ratio.  Such  improvements  will  not  only  allow  more  fundamental 
analysis  techniques  to  be  applied  but  will  also  increase  the  number  of  AE  signals 
detected  thereby  increasing  the  detectibility  of  smaller  and  smaller  AE  sources.  To 
increase  signal  to  noise  ratio,  both  higher  output  flat  sensors  and/or  less  noisy 
preamplifiers  need  to  be  developed. 

The  test  platform  also  meets  cost  constraints  in  the  sense  that  a  single 
specimen  can  be  used  for  different  source  location  approaches  since  the  crack  opening 
source  is  repeatable.  Further,  since  the  crack  tips  can  be  resharpened  by  additional 
fatigue  cycling,  even  crack-tip  overloads  can  be  used  repeatedly  to  generate  AE  when 
such  a  source  is  operative. 

A  further  advantage  of  the  test  platform  is  that  more  than  one  experimenter 
can  use  it  at  the  same  time  since  both  plate  surfaces  are  accessible.  And  when  the 
SAT  approach  is  used,  four  separate  quadrants  could  be  used  at  one  time  on  one  side 
of  the  sample.  Also,  since  both  sides  of  the  sample  are  accessible,  studies  on  the 
differences  in  the  partitioning  of  energy  in  the  various  wave  modes  can  be  made  on 
two  separate  sides  for  the  same  source.  Further,  essentially  360°  around  the  crack  tip 
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can  also  be  used  so  that  radiation  pattern  effects  in  the  far-field  can  be  studied 
potentially  for  two  separate  source  mechanisms  (asperity  cold  weld  fracture  and 
inclusion  particle  fracture). 

A  potential  weakness  of  the  sample  as  currently  designed  is  that  the  stress  field 
in  the  uniform  thickness  plate  portion  of  the  sample  is  not  uniform  across  the  width 
of  the  sample.  Figure  7  shows  the  axial  strain  versus  position  for  one  quadrant  of  the 
sample  based  on  a  finite  element  analysis  for  loading  without  a  center  crack.  Clearly, 
the  strains  are  higher  in  the  center  of  the  specimen  in  line  with  the  loading  tabs  of 
the  sample.  For  this  initial  design  the  potential  for  this  effect  was  ignored  as  has 
already  been  discussed.  As  will  be  discussed  later  in  this  report,  there  are  some 
reasons  why  a  subsequent  design  of  the  test  platform  might  include  some  transition 
zone  provisions  to  meet  other  experimental  goals. 

As  a  first  step  in  a  more  detailed  examination  of  some  of  the  waveforms 
obtained  in  this  experiment,  we  consider  figure  8.  This  figure  for  a  crack  opening 
event  shows  with  an  in-line  configuration  clear  evidence  of  two  AE  events  that 
occurred  in  close  proximity  in  time.  At  about  10.8  ps  after  the  arrival  of  the  first 
event,  the  characteristic  waveform  of  a  second  event  appears.  Clearly  without  the 
present  experimental  test  platform  such  an  observation  would  not  be  possible  since  in 
a  more  normal  size  test  specimen  edge  reflections  would  compromise  the  ability  to 
distinguish  the  second  event  in  close  proximity  in  time.  And  further,  with  a 
traditional  resonant-type  sensor,  the  waveform  would  be  distorted.  The  distortion 
would  make  recognition  of  the  second  event  difficult.  It  is  of  interest  to  consider  the 
potential  source  location  of  the  first  and  second  events.  Since  from  an  AE  wave 
propagation  viewpoint  the  two  crack  tips  are  approximately  in  the  near  field,  it  is  of 
interest  to  consider  the  times  for  bulk  waves  to  travel  from  one  tip  to  the  other.  This 
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time  will  give  an  indication  as  to  whether  the  stress  redistribution  from  the  first  event 
contributed  to  the  generation  of  the  second  event.  At  a  bulk  p-wave  velocity  in 
aluminum  at  about  0.250  in/ps  the  2.0  inch  distance  represents  a  time  of  about  8.0  ps, 
while  the  bulk  s*wave  velocity  of  about  0.120  in/ps  represents  a  time  of  about  16.6  ps. 
Hence,  it  is  possible  that  the  local  stress  redistribution  from  one  AE  source  event  can 
trigger  another  event.  Certainly  this  idea  is  open  for  future  investigation,  particularly 
if  a  sensor/preamplifier  combination  with  increased  sensitivity  can  be  obtained. 

Next  we  turn  to  an  examination  of  the  AE  waveforms  from  the  point  of  view 
of  use  of  such  waveforms  for  the  purposes  of  accurate  source  location  by  the  SAT 
configuration  approach.  The  first  aspect  to  consider  is  shown  in  figure  9  which  is  more 
representative  of  the  signal-to-noise  ratio  of  most  of  the  crack  opening  AE  waveforms 
at  the  SAT.  Clearly  preamplifier  noise  has  a  significant  effect  on  the  signal  and  will 
make  more  difficult  the  identification  of  the  same  waveform  feature  in  all  four  sensors. 
Further,  it  is  clear  in  this  figure  and  also  in  figure  6  that  it  is  almost  impossible  to 
clearly  distinguish  any  features  in  the  waveform  prior  to  the  arrival  of  the  flexural 
mode.  This  observation  is  made  more  forcefully  if  we  compare  figures  6  and  9  with 
figure  10  which  represents  a  lead  break  at  the  crack  tip  nearest  to  the  SAT  position 
and  the  in-line  configuration.  In  this  latter  figure,  both  the  extensional  and  flexural 
modes  are  easy  to  identify.  There  seems  to  be  two  potential  reasons  for  the  almost 
total  lack  of  extensional  signal  in  the  real  AE  waveforms.  First,  the  low  signal  to  noise 
ratio  makes  it  difficult  to  see  the  extensional  mode.  Second,  it  seems  that  the  real  AE 
signals  from  the  present  crack  opening  source  deposit  a  dominant  amount  of  energy 
in  the  flexural  mode.  For  example,  some  of  the  largest  amplitude  waveforms  where 
some  of  the  extensional  mode  can  be  identified  in  the  1/4  inch  plate  have  a  typical 
ratio  of  flexural  peak  amplitude  to  extensional  peak  amplitude  on  the  order  of  15-20:1. 


59-22 


During  the  lead  break  experiments  on  the  edges  of  the  1/4  inch  plate,  it  was 
determined  that  the  typical  ratio  was  about  10-12:1  for  top  or  bottom  edge  breaks  and 
about  4:1  for  the  center  of  edge  breaks,  (both  at  about  the  same  source  to  sensor 
distance  as  for  the  real  AE).  Since  as  Gorman  and  Prosser  [10]  have  indicated,  a 
center  break  deposits  a  greater  portion  of  the  energy  in  the  extensional  mode,  the 
above  results  indicate  that  the  crack  opening  source  observed  in  these  experiments 
was  dominated  by  a  source  mechanism  that  deposited  more  energy  into  the  flexural 
mode  (as  observed  with  vertical  sensitivity  sensors).  In  order  to  explain  the  reason  for 
the  dominance  of  the  flexural  source,  it  is  worthwhile  to  consider  an  observation  which 
was  made  during  the  fatigue  precracking  of  the  specimen.  While  fatiguing  the  samples 
there  was  a  noticeable  flexing  motion  of  the  plate  which  is  consistent  with  the  fact 
that  the  central  portion  of  the  plate  experienced  higher  axial  strains  than  the  outer 
portions  of  the  plate.  Thus  it  is  speculated  that  this  flexing  motion  led  to  the  crack 
opening  source  being  dominated  by  fracturing  of  cold  welds  near  the  outer  edges  of  the 
plate.  To  resolve  this  question  further  would  require  more  detailed  studies  (beyond 
the  scope  of  this  initial  study)  of  the  elastic  deformation  of  the  sample  with  a  center 
crack. 

The  key  significance  of  the  loss  of  extensional  signal  is  that  the  demonstrated 
studies  in  the  handwritten  report  by  Sancar  [7]  specifically  used  extensional  waveform 
features  to  determine  one  group  velocity.  In  particular  the  very  first  extensional  peak 
in  the  waveform  is  very  easy  to  identify  and  use  for  a  group  velocity  determination, 
since  this  plate  wave  (nondispersive)  represents  both  low  frequencies  (i.e.  long 
wavelengths  compared  to  the  pinducer  element  diameter  and  the  maximum  diameter 
of  the  SAT)  and  the  displacement  is  always  in  the  same  direction  as  the  wave 
propagates.  The  other  feature  which  Sancar  [7]  typically  used  was  the  initial  part  of 
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the  flexural  mode.  For  this  part  of  the  waveform  the  phase  point  (feature)  is  more 
difficult  to  identify,  especially  in  the  presence  of  noise  since  the  frequencies  are  higher 
(i.e.  short  wavelengths  compared  to  maximum  diameter  of  the  SAT)  and  the  sign  of 
the  displacement  changes  from  tension  to  compression  and  back  again  with  increasing 
propagation  distance.  Further,  the  shape  changes  with  distance  since  the  flexural 
mode  is  in  general  dispersive.  Thus,  without  the  extensional  mode  and  difficulty  in 
determining  constant  phase  points  for  the  flexural  mode  it  was  not  possible  to  try  the 
SAT  approach  to  calculate  the  source  range  and  direction.  With  additional  work 
beyond  the  scope  of  this  limited  study,  it  is  expected  that  constant  phase  points  could 
be  extracted  from  the  flexural  mode. 

To  begin  to  develop  a  rational  way  to  select  common  features  in  the  flexural 
mode  a  more  detailed  examination  of  the  determination  of  group  velocities  was  made 
using  the  top  edge  leadbreak  data.  The  leadbreak  data  were  chosen  for  this 
examination  since  they  are  free  of  the  electronic  noise.  Figure  11  shows  relative 
arrival  time  of  the  plate  wave  as  a  function  of  distance  for  the  1/8  inch  thick  plate. 
The  slope  gives  the  group  velocity  which  compares  well  with  expected  values.  The  fact 
that  most  data  points  are  right  on  or  close  to  the  line  indicates  the  ease  with  which 
this  is  done  and  the  reason  why  it  is  desirable  to  have  sufficient  sensitivity  to  detect 
the  extensional  mode  for  real  AE.  Figure  12  shows  similar  results  for  the  flexural 
mode  using  the  zero  crossing  with  the  greatest  slope  for  the  feature  in  the  peak 
region.  As  can  clearly  be  seen  the  slope  gives  a  reasonably  accurate  velocity  for  the 
dominant  flexural  group  that  travels  at  approximately  the  shear  velocity,  but  the  fit 
is  poorer,  even  over  short  distances  on  the  order  of  the  SAT  maximum  dimension. 
This  result  indicates  that  significant  errors  would  occur  in  determining  the  correct 
time  differences  for  use  with  the  SAT  approach  where  the  close  spacing  of  the  sensors 
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requires  very  high  accuracy  of  the  measured  time  differences  for  accurate  calculations 
of  source  location. 

Significant  numbers  of  AE  events  were  observed  for  the  crack  opening  source. 
These  events  occurred  over  a  limited  range  of  loads  consistent  with  a  fully  closed  crack 
at  low  loads  and  a  fully  open  crack  at  higher  loads  with  the  events  occurring  at  loads 
in  between.  Not  as  many  events  were  observed  as  with  the  previous  study  [2].  But 
with  the  increased  specimen  size  and  more  importantly  the  less  sensitive  sensors,  it 
was  expected  to  observe  fewer  events. 

Virtually  no  events  were  observed  in  the  overload  tests  which  was  contrary  to 
the  previous  observations  [2].  To  investigate  why  the  inclusion  fracture  AE  source  was 
not  observed  with  the  large  samples,  dogbone  tensile  samples  were  fabricated  from 
both  the  previous  2024-T351  material  and  the  current  material.  These  samples  were 
tested  in  the  usual  way  for  a  uniform  tensile  sample  at  a  constant  crosshead  rate  of 
0.050  in/min  using  smooth  hydraulic  grips,  a  resonant  p-30  sensor  (Physical  Acoustics), 
a  bandpass  of  100-300  KHz,  and  root-mean-square  (RMS)  voltmeters  [Hewlett-Packard 
3400A].  Figure  13  shows  that  the  results  were  essentially  the  same  from  both  material 
samples.  Hence,  both  samples  do  have  the  same  inclusion  fracture  source  typical  of 
the  2024  material.  Thus  we  must  seek  another  explanation  for  the  lack  of  overload 
emission. 

Another  potential  explanation  is  the  change  in  the  stress  field  in  the  large 
sample.  Since,  as  figure  7  shows,  the  axial  stress  is  not  uniform,  significant  transverse 
stresses  will  also  be  present.  Hence,  the  stress  field  at  the  crack  tip  will  not  be  the 
same  as  for  the  uniform  axial  stress  case.  And  the  plastic  zone  size  and  shape  could 
be  expected  to  change  as  well.  Without  detailed  studies  (beyond  the  scope  of  this 
work)  it  is  not  possible  to  clearly  determine  if  this  change  in  stress  field  is  the  reason. 
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VII.  RECOMMENDATIONS  FOR  FUTURE  WORK 

Based  on  the  results  of  this  study,  four  primary  recommendations  for  future 
work  are  presented.  First,  effort  should  be  invested  in  the  development  of  flat  with 
frequency  sensor/pre-amplifier  designs  which  have  up  to  an  order  of  magnitude  better 
sensitivity  than  the  currently  available  pinducers.  Second,  an  additional  AE  test 
platform  should  be  designed  which  will  provide  for  a  more  uniform  axial  stress 
distribution  in  the  plate  section.  Then  specimens  would  be  available  which  emphasize 
both  extensional  and  flexural  modes.  It  is  likely  that  both  specimens  would  be  needed 
since  real  AE  applications  can  result  in  both  of  these  cases.  Third,  for  the  SAT 
approach  additional  effort  should  be  made  to  develop  a  rational  and  automatic 
approach  to  identify  the  same  feature  point  for  the  flexural  mode.  This  work  could 
start  with  Sancar’s  report  [7],  since  it  was  obtained  too  late  to  be  of  much  use  in  this 
initial  study.  Fourth,  a  theoretical  far-field  solution  needs  to  be  completed  for  an  in¬ 
plane  transient  AE  source. 

VII.  CONCLUSIONS 

1)  An  AE  test  platform  which  meets  requirements  for  generation  and 
observation  of  far-field  real  AE  waveforms  with  high  fidelity  without 
edge  reflections  or  extraneous  noise  in  a  plate  was  successfully  designed 
and  tested  for  use  in  improvement  of  source  location  accuracy  for  smart 
structures  applications. 

2)  This  test  platform  concept  provides  for  potentially  two  different  AE 
sources,  is  reusable  so  that  the  cost  can  be  spread  over  several  different 
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test  approaches,  and  can  support  more  than  one  experimental  approach 
at  the  same  time. 

3)  A  full  test  of  the  SAT  approach  could  not  be  carried  out  due  to  the 
current  difficulties  of  lack  of  extensional  mode,  poor  signal-to-noise 
ratio,  and  difficulty  in  selection  of  the  same  phase  point  in  the  flexural 
mode. 

4)  Evidence  was  demonstrated  that  real  AE  source  events  can  occur  in 
close  time  proximity  to  each  other. 
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Table  1.  Pre-cracking  History  for  Both  Specimens  with 
Initial  Crack  Length  1.00  Inch. 


a.  1/4  Inch  Thickness 

Maximum  Load,  lbs 

Number  of  Cycles 

Finishing  Crack  Length 

(2a),  in. 

34,000  \ 

14,000 

1.013 

31,000  ) 

500 

1.038 

25,000  / 

1,000 

1.060 

23,000  1 

2,000 

1.104 

22,000  >  + 

15,000 

1.203 

21,500  \ 

15,000 

1.324 

20,700 

16,000 

1.418 

19,900  / 

15,000 

1.515 

19,200 

15,000 

1.612 

18,600 

15,000 

1.719 

18,000  / 

15,000 

1.829 

17,500  r  * 

12,000 

1.910 

17,000  J 

13,000 

2.001 

+  Minimum  load  10%  of  maximum 
*  Minimum  load  1%  of  maximum 

b.  1/8  Inch  Thickness 

Maximum  Load  *,  lbs 

Number  of  Cycles 

Finishing  Crack  Length 
(2a),  in. 

19,400 

500 

1.01 

17,000 

284 

1.02 

15,500 

417 

1.0# 

14,000 

440 

1.04 

13,000 

500 

1.05 

12,470 

2,500 

1.10 

12,050 

5,000 

1.20 

11,500 

5,000 

1.30 

11,100 

5,000 

1.40 

10,700 

5,000 

1.50 

10,400 

5,000 

1.60 

10,100 

5,000 

1.70 

9,800 

5,000 

1.80 

9,140 

5,000 

2.00 

Table  2.  Extraneous  Noise  Events  from  use  of  Serrated  Hydraulic 
Grip  Inserts  with  60  dB  Preamplifier,  42  dB  Threshold, 
200-400  kHz,  ii-30  Sensor,  Loads  0  to  22,000  lbs. 


Table  3.  Relative  Outputs  from  Different  AE  Sensors  6.0  inch 

from  Leadbreaks  on  Edge  of  0.313  inch  Aluminum  Plate 
for  35  KHz  -  1.0  MHz  Bandpass  of  Preamplifier  (40  dB) 
with  Vacuum  Grease  Couplant 


Average  for  First 
Half  Cycle  of 
Extensions!  Mode*, 
dB _ 

-12 

-25 

-15 

0  (reference) 


*  0.3  mm  lead  break  at  center  of  plate  edge 
0.3  mm  lead  break  at  top  of  plate  edge 


Average  for  First 
Half  Cycle  of 
Flexural  Mode*, 
dB _ 

-16 


-23 


-19 


0  (reference) 


Sensor  /  Wave  Form  Feature 


Harisonics,  Model  G-0504 
1/4  inch  diameter  crystal _ 

Valpey  Fisher,  Pinducer 

Model  1093,  0.054  inch  diameter 

crystal 

EBL,  Model  NBS 
Point  Contact,  -0.025  inch 

Physical  Acoustics,  Model  p-30 
Crystal  Diameter  -0.31  inch 
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Table  4.  Approximate  Effect  of  Digitization  Rate  on  Accuracy 
of  Determined  Group  Velocities,  Range  and  Direction  * 


Digitization 
Interval,  ps 

Percentage  Error 

Higher  Group 
Velocity 

Lower  Group 
Velocity 

Range  to 
Source 

Direction  to 
Source 

0.5 

0.5 

2.2 

3.4 

7.4 

0.3 

3.4 

2.6 

11.3 

6.7 

0.1 

0.6 

1.2 

1.7 

0.2 

0.05 

0.5 

0.3 

1.1 

0.2 

0.03 

0.5 

0.6 

0.7 

0.01 

0.010 

0.1 

0.2 

0.2 

0.06 

*  Assumed  higher  group  velocity  0.214  in/ps,  lower  group  velocity  0.114  in/ps, 
range  to  center  of  SAT  6.0  in,  and  angle  with  respect  to  SAT  axis  30°. 


Table  5.  Loading  Conditions  for  AE  Test  Platforms 


Plate  Section 
Thickness,  in 

Interrupted  Cycling 

Overload 

Test 

Higher  Rate  Cycling 

Maximum 
*  Load, lbs 

Crosshead 

Rate, 

in/min. 

Maximum 
Load, lbs 

Maximum 
*  Load, lbs 

Crosshead 

Rate, 

in/min. 

1/8 

9,100 

0.002 

10,000 

10,500 

0.2 

1/4 

16,900 

0.002 

30,000 

— 

~ 

*  Minimum  <  1%  of  maximum 
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Figure  I.  Drawing  of  nominal  dimensions  of  solid  AE  test 
platform  (  All  dimensions  in  inches  ). 
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Figure  2.  Drawing  of  nominal  dimensions  of  sub-scale  adhesively 

bonded  sample  showing  main  plate  and  reinforcement  tabs 
to  be  bonded  to  the  ends  of  the  sample  (  All  dimensions  in 
inches  ). 
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Figure  4.  Relative  positions  of  SAT  and  in-line  pinducer 
configurations  (  All  dimensions  in  inches,  p'  - 
pinducers  ). 


Figure  5.  Drawing  of  pinducer  without  connector  adhesively  bonded  into 
micarta  holder  (  All  dimensions  in  inches  ). 
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AMPLITUDE  (VOLTS) 
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Figure  6  (a).  Large  signal  to  noise  waveforms  from  SAT  configuration 
for  crack  opening  source  in  center-crack  plate  (  1/4  inch 
thick  plate  ). 
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Figure  6  (D).  Large  signal  to  noise  waveforms  from  SAT  configuration 
for  crack  opening  source  in  center-crack  plate  (1/8  inch 
thick  plate  ). 
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Figure  7.  Finite  element  axial  strain  distribution  in  one  quadrant 

at  a  load  of  10,000  lbs  for  1/4  inch  thick  test  platform 
without  a  crack. 
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AMPLITUDE  (VOLTS) 


TIME  (MICROSECONDS) 


Figure  8.  Two  events  separated  by  10.8  ns  for  crack  opening  source 
in  1/8  inch  plate  with  in-line  sensor  configuration  at 
2430  lbs. 
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Figure  9.  Waveforms  from  SAT  with  typical  signal  to  noise  ratio 
from  crack  opening  event  at  2430  lbs  load  on  the  1/4 
thick  plate. 
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Figure  10  (a).  Waveforms  from  SAT  for  lead  break  at  near  crack 
tip  on  1/8  inch  thick  plate  (Gain  40  dB). 
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Figure  10  (b).  Waveforms  from  in-line  configuration  for  lead 
break  at  near  crack  tip  on  1/4  inch  thick  plate 
(Gain  40  dB). 
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Feature  arrival  time  (us) 


Figure  !  1.  Source  distances  versus  first  half  cycle  peak  extensional 
feature  arrival  time  for  top  edge  lead  breaks  on  1/8  inch 
thickness  sample. 
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Figure  12.  Source  distance  versus  arrival  time  of  zero  crossing  of 
high  amplitude  flexural  feature  with  the  steepest  slope 
on  1/8  inch  thickness  sample  (sign  indicates  slope  of 
zero  crossing  ). 
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Figure  13  (b).  Root-mean-square  of  AE  (88  dB  &  100-300  kHz)  and 
load  versus  time  for  tensile  test  of  2024-T35  1  from 
1  /4  inch  thick  plate  (  Full  scale  values  are  RMS  =  0. 1  V; 
Load  =  5,000  lbs;  and  chart  speed  =  4  cm/min  ). 
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APPENDIX  I.  Short  Report  from  Dr.  Gorman 
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Introduction 


This  report  is  an  added  appendix  to  the  main  report  by  Dr.  Marvin  A. 
Hamstad.  The  purpose  of  this  section  is  to  give  the  results  of  a  different  method  of 
locating  and  analyzing  acoustic  emission  caused  by  fracturing  of  tiny  cold  welds 
behind  crack  tips  in  aluminum  plates.  The  details  of  the  plates,  the  creation  of 
fatigue  cracks,  and  the  loading  schemes  are  given  in  the  main  report. 

Experimental 

Three  broadband  transducers  (Harisonic,  Model  G0405)  were  affixed  with 
Tacki-wax  (Cenco)  to  the  surface  of  the  plate  in  a  triangular  arrangement  about  the 
crack.  It  was  desired  to  measure  the  vertical  displacement  of  the  surface  as  the 
transient  pulse  passed  underneath  the  transducer.  The  transducers  were  surface 
contact  ultrasonic  types  having  a  broad  resonance  at  5  Mhz.  They  were  selected 
because  they  have  been  shown  to  have  a  smooth  and  nearly  flat  response  below  one 
megahertz  where  acoustic  emission  is  found.  The  signals  from  the  transducers  were 
fed  into  wideband  preamplifiers  which,  in  turn,  were  connected  to  transient 
recorders.  The  digitized  waveforms  were  stored  on  a  hard  disk  in  a  personal 
computer. 

Typically,  narrowband  transducers  have  been  used  for  acoustic  emission  work 
because  of  their  high  sensitivity.  Source  location  is  carried  out  by  starting  and 
stopping  AE  analyzer  clocks  when  the  signals  at  the  transducers  first  cross  a  preset 
threshold.  It  has  been  shown  in  earlier  work  by  this  author  that  such  a  method  can 
lead  to  serious  errors  in  source  location  [1].  A  different  source  location  technique 
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has  been  developed  called  Gaussian  crosscorrelation  which  takes  the  wave 
propagation  in  a  plate  into  account  and,  in  principle,  is  capable  of  high  accuracy  [2]. 
It  is  not  based  on  threshold  crossing.  It  does  require  high  fidelity  waveform  capture 
however  and  thus  the  use  of  broadband  transducers.  At  kind  invitation  of  Dr. 
Marvin  Hamstad  and  Dr.  George  Sendeckyj  I  was  allowed  to  piggyback  on  top  of 
their  experiment  to  see  if  this  new  technique  would  work  with  real  AE  sources  (as 
opposed  to  artificial  sources  like  lead  breaks). 

Results 

The  AE  pulses  created  by  the  cold  weld  fracture  were  small.  The  flexural 
mode  was  usually  all  that  was  detected.  This  is  a  highly  dispersive  mode  in  a  thin 
plate  which  means  that  it  changes  shape  rapidly  as  it  propagates.  Generally  only  a 
few  events  were  detected  in  each  loading.  Representative  results  using  the  flexural 
mode  are  given  in  the  figures  for  the  two  different  plates  (1/4"  and  1/8"). 

Figure  1  shows  the  measured  locations  of  two  0.3  mm  lead  breaks  done  at  the 
far  crack  tip  and  one  done  at  the  near  tip  on  the  1/4"  plate.  The  actual  break 
locations  were  as  close  to  the  tips  as  could  be  determined  by  eye.  80  Khz  was 
selected  for  the  location  calculations  since  this  appeared  in  all  of  the  waves  which 
could  be  analyzed.  Other  frequencies  can  also  be  used.  The  box  lines  represent  the 
plate  and  the  crack  is  shown  in  scale. 

Figure  2  shows  the  measured  locations  of  actual  AE  events.  It  can  be  seen 
that  the  transducers  were  placed  in  a  different  arrangement  than  in  Test  1.  Different 
arrangements  of  the  transducers  were  tried  for  various  reasons  having  mainly  to  do 
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with  desiring  to  see  the  shape  of  the  pulse  at  different  angles  to  the  crack  plane.  No 
effect  on  source  location  was  expected  and  none  was  found  but  it  probably  wouldn’t 
be  noticed  with  the  actual  crack  events  since  the  source  position  was  known  only  to 
be  somewhere  behind  the  crack  tips. 

Figure  3  shows  the  waveforms  from  one  event  as  measured  by  the  three 
sensors  shown  in  Figure  2.  This  was  the  best  looking  set  of  waveforms  of  the  five 
measured.  Figure  4  gives  an  idea  of  a  more  typical  set.  It  was  difficult  to  extract  the 
information  if  they  were  smaller  than  this. 

A  result  for  the  1/8  plate  is  shown  in  Figure  5.  There  were  only  two  events 
measured  during  this  particular  loading. 

Conclusions 

The  results  thus  far  are  quite  encouraging.  The  signals  were  very  small  and 
contained  a  lot  of  noise  but  locations  could  be  obtained  for  some  of  the  larger 
events.  The  accuracy  cannot  be  determined  because  the  exact  location  of  any 
particular  source  itself  was  not  known  (it  was  only  known  that  the  source  was  in  the 
crack  plane  somewhere  behind  the  crack  tips).  The  events  were  all  placed  near  the 
crack  to  within  about  one-half  of  an  inch.  This  is  within  the  error  created  by  the 
method  used  to  position  the  sensors  and  the  quarter-inch  size  of  the  sensors. 
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Test  1,  0.25"  2024  Al  Plate 
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Figure  1. 


Test  5,  0.25"  2024  Al  Plate 
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Figure  2. 
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Test  10,  0.1 


APPENDIX  II.  Equations  and  their  Partial  Modification 
from  the  SAT  Patent 


From  equation  (31)  in  the  patent  for  each  wave  feature 


t2-t0  r,-r, 

= - ,  D-- 


yj+bjl 


IM 


where  from  figure  4  in  the  array  coordinate  system  x,  =  x3  =  0.500  in,  Xo  =  o,  x2  = 
1.000  in.,  y0  =  y2  =  o,  yj  =  0.354  in.,  y3  =  -0.256  in.  and  Ta,  Tp  T2,  T3  are  the  relative 
arrival  times  of  the  selected  feature  at  sensors  p0,  p„  p2,  p3,  respectively. 

From  equation  (36)  in  the  patent  corrected  for  a  missing  minus  sign  in  the 
denominator 


AT 


{a\*b^)xn-(a\*b\)xa 
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where  R„  is  the  range  from  the  source  to  a  sensor  (e.g.  to  sensor  p0)  and  AT  is  the 
arrival  time  difference  between  two  selected  features  with  subscripts  1  and  2.  The  a 
and  b  values  are  calculated  from  equation  II- 1. 

From  equation  (32)  in  the  patent  the  group  velocities  of  the  two  selected  features  are 
given  by 


c- - - -  11-3 

{a2+b2)lf2 

where  the  appropriate  subscript  is  attached. 

Since  in  the  current  work  the  typical  range  from  the  source  to  the  sensors  is 
not  much  greater  than  the  spacing  of  the  sensors,  the  simplification  used  in  the  patent 
to  go  from  equation  (24)  to  equation  (30)  leads  to  errors  in  the  determination  of  the 
components  of  the  unit  vector  uK  :  [ux,  uy,  o]  from  the  source  to  the  sensor  p„  (Note: 
Following  the  patent,  we  ignore  plate  thickness  and  determine  the  two-dimensional 
source  position  in  this  work.)  To  obtain  accurate  results,  equation  (30)  from  the 
patent  was  used  to  find  trial  values  for  ux  and  uy  (using  both  of  the  features  and 
averaging  the  results). 


U*=  (a2 +b2)lf2 

u  b 
Uy~  (a2+b2)'n 


II-4(a) 


II-4(b) 
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Then  from  equation  (29)  in  the  patent  (corrected) 


5,  R,  r. 


Where  2,  is  the  unit  vector  in  the  sensor  coordinate  system  (with  origin  at  p0)  to  each 
sensor  p,,  p2,  p3  for  i  =  1,  2,  3,  respectively,  R,=  |fl,|  is  the  range  from  the  source  to 
each  sensor,  rt-  |P,  |  is  the  range  from  sensor  p0  to  p,,  p2,  p3,  for  i  =  1, 2, 3,  respectively, 
and  ft,  is  given  by  equation  (9)  from  the  patent 

a  r(rr7>  n-6 

where  c  is  given  by  equation  II-3  for  a  selected  feature.  Also  from  the  patent  equation 


RiuRS*i 


Now  after  some  algebraic  manipulations,  equations  11*5  can  be  solved  for 


n  ^2 


II-8(a) 


W(1 4, 1 n-8(b> 


Now  since  P,:[x,,y,],  P3:[x,,yJ  then  using  the  trial  values  of  ux  and  Uy  from 

equation  II-4  and  R„  from  equation  II-2,  the  values  of  R,  can  be  found  from  equation 
(2)  in  the  patent,  i.e. 

Rf  ^R'+F'HR'+r)  119 

Then  equations  II-8  yield  two  new  values  for  ux  and  uy  (one  for  each  feature)  with 
group  velocities  c,  and  c2.  These  values  were  averaged  and  taken  as  new  trial  values, 
and  the  process  was  repeated  one  more  time,  resulting  in  convergence  to  the  correct 
value  for  u,  and  uy. 
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H-lnf Inltv  Control  Design — A  New  Approach 

by 

Chin  S.  Hsu 

Washington  State  University 

Abstract 

This  report  addresses  the  issue  of  the  He#  compensator  design  based  on 
full  order  and  reduced  order  observers.  A  new  approach  for  robust  control 
design  is  proposed.  New  results  pertaining  to  approximate  and  precise  Heo  loop 
transfer  recovery  (Hoo/LTR)  have  been  derived.  Numerical  examples  illustrating 
the  proposed  design  procedures  are  also  presented. 
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H-infinity  Control  Design — A  New  Approach 

I.  Introduction 

Recent  progress  on  H  control  theory  has  provided  control  engineers  with 

practical  design  procedures  for  developing  robust  control  laws.  Though 

different  approaches  of  H  suboptlmal  design  have  been  devised,  the  resultant 

00 

compensators  are,  in  general,  observer-based. 111  These  H  design  approaches 

00 

include  the  celebrated  two-Rlccati  formulation,  the  conjugation  method,  the 

bounded-real-lemma  method,  J-spectral  factorization  method  among  others.121 

Along  with  the  development  of  H  output  feedback  design,  H  state  feedback  and 

00  00 

its  dual,  H  filtering,  have  attracted  considerable  attention  during  past  few 
00 

[31 

years. 

Motivated  by  the  recent  results  of  disturbance  rejection,  we  propose  a 

new  method  of  deriving  an  H  compensator  design.141  The  new  method  is  based  on 

00 

the  notion  that  all  H  suboptlmal  controllers  are  observer-based.  One 

00 

significant  benefit  of  the  proposed  approach  is  that  it  provides  reduced-order 

H  controllers  based  on  reduced-order  observers. 

00 

Considerable  research  has  been  carried  out  by  researchers  for  the  control 

law  synthesis  using  the  LQG/LTR  and  the  H_/H  methodologies. IS1  The 

<£  00 

challenging  problem  of  H  /LTR  has  also  been  investigated  in  recent 

00 

years.161,171  Influenced  by  the  foregoing  robust  control  design  methods,  we 
developed  control  design  procedures  and  numerical  algorithms  which  provide  H 

00 

compensators  based  on  full-order  or  reduced-order  observers.  Several  results 
pertaining  to  approximate  and  precise  H  /LTR  are  presented  in  this  report. 

To  facilitate  the  readability  of  this  report,  we  defer  the  mathematical 
derivation  of  the  main  results  to  the  Appendix  at  the  end  of  this  report. 
Notations  and  block  diagrams  of  various  control  sytems  configurations  are  given 
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in  Section  III  for  the  ease  of  relating  the  theoretical  results  with  the 
physical  setting.  The  key  results  of  this  research  are  collectively  presented 
in  Section  II,  while  numerical  examples  illustrating  the  key  results  can  be 
found  in  Section  IV. 

Finally,  Section  V  offers  some  concluding  remarks  and  recommendations  for 
future  research. 

II.  Summary  of  Main  Results 
2. 1  Full  State  Feedback 

We  begin  with  the  control  system  configuration  (Figure  1)  which  is  used 
for  designing  H  suboptimal  controllers,  where  S  and  S  represent  the  plant 

00  o 

and  its  associated  observer,  respectively.  The  H  design  is  concerned  with 

00 

developing  a  regulator  K  and  an  observer  S  such  that  H  norm  of  the  transfer 

O  00 

function  matrix  from  w  to  z  is  smaller  than  a  prespecified  bound,  while  the 
closed-loop  system  is  internally  stable. 

The  plant  S  is  represented  by  the  following  linear  system: 
x  (t)  »  A  x(t)  +  B  w(t)  +  B  u(t),  x(0)  =  x 

12  o 

z  ( t )  —  C  x ( t )  +  D  ,  w(t)  +  D  ,  u ( t )  (1) 

l  u  iz 

y  (t)  =  C2  x(t)  ♦  D2i  w(t) 

where  z(t)  is  a  vector  of  performance  variables  and  w(t)  represents  a 
deterministic  disturbance.  The  dimensions  of  x(t),  u(t),  z(t),  and  y(t)  are 
n,  m.  I,  and  p,  respectively.  The  following  assumptions  are  standard  in  H 

00 

design. 

{A,  B^},  {A,  B2>  stabillzable 
{A,  C2>,  {A,  >  detectable 


T 

C  D 
1  12 


=  0,  D 


12 


12 


>  0, 


B  D 
1  12 


=  0 
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Lemma 1 

The  closed-loop  transfer  function  matrix  (TFM)  from  w  to  2  is 

T  „  (s)  =  D  +  (C  -  D  K)  (si  -A  +  BK)'1  B  (2) 

sF  11  1  12  21 

if  state  feedback  is  used,  l.e., 

y(t)  =  x(t)  and  u(t)  =  -K  x(t) 

It  is  known  that  a  gain  matrix  K  can  be  obtained  by  solving  a  modified  Rlccati 

[8] 

equation,  to  nearly  minimi2e  the  H  norm  of  T  (s).  Suppose  that  X  >  0  is  a 

oo  SF  os 

solution  to 

0  =  X  A  +  ATX  +  X  (r~2 B  B  T  -  B  (D  TD  )-1B  T)X  +  C  TC  (3) 

»  on  CO  11  2  12  12  2  on  11 

such  that  A  -  B  (D  TD  )  *B  TX  is  stable.  Then,  if  the  regulator  gain  is 
2  12  12  2  on  00 

computed  as, 

K  *  (D  TD  )_1B  TX  (4) 

12  12  2  on 

the  norm  of  the  closed-loop  transfer  function  matrix  is  bounded  by  7: 

IVS)L  <  »•  (5) 


2.2  Full-Order-Observer  Based  Controller 


A  full-order  observer  S  is  of  the  form, 

O 

x  (t)  -(A  -  LC  )  x  (t)  ♦  Ly(t)  +  B  u ( t ) 

c  2  c  2 

u(t )  *  -  K  X  (t),  X  (t )  C  Rn 
c  c 

where  L  denotes  the  observer  gain. 

Theorem  1  191 


The  observer-based  closed-loop  TFM  from  w  to  z  is 

T(s)  *  T  (s)  +  F(s)  (si  -  A  +  LC  )~l  (B.  -  LD  ) 
sf  2  1  21 

where  F(s)  is  a  stable  filter  described  by 

F(s)  ■  D  K  ♦  (C  -  D  K)  (si  -  A  +  B  K)’1  B  K 
12  1  12  22 
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By  examining  the  result  of  Theorem  1,  we  see  that  once  K  is  obtained  by 

solving  an  H  state  feedback  problem,  F(s)  is  a  known  TFM  of  a  linear  system. 

In  fact,  equation  (7)  is  of  the  form  as  the  well  known  one-sided  model 

matching  problem  (MMP),  if  (7)  is  rewritten  as 

T(s)  =*  T  (s)  +  F(s)M(s)  (9) 

SF 

where 

M(s)  A  (SI  -  A  +  LC2fl  (Bj  -  LDai)  (10) 

It  is  noted  here  that  in  (9),  both  transfer  function  matrices  TSf.(s)  and 

F(s)  are  stable.  The  link  between  H  compensation  design  and  stable  model 

00 

matching  problem  is  an  important  one.  As  to  be  presented  in  the  sequel,  this 
connection  plays  a  vital  role  in  developing  precise  H^/LTR  design  procedures. 
Moreover,  the  stable  filter  F(s)  can  be  considered  as  a  frequency-shaped 
filter  which  predicates  the  determination  of  the  observer  gain  matrix  L, 
(Figure  2). 

The  unknown  parameter  L  of  (7)  is  of  a  form  dual  to  its  counterpart  K  of 

(2).  Hence,  an  H  output  feedback  compensator  can  be  arrived  at  by 

00 

determining  the  observer  gain  L  such  that 
1  T(s)  -  Tsr(.)  I, 

=  I  F(s)  (si  -  A  ♦  L  C  I'1  (B,  -  UJ„)  I  dl  > 

=  I  F(s)M(s)  I  <  5 

■  M  QQ 

where  5  is  a  positive  number. 

It  is  found  that  the  selection  of  L  is  essentially  a  frequency-shaped 
state  estimator  problem.  This  is  solved  according  to  the  following  theorem. 
Theorem  2:  Let  2  >  0  be  a  solution  to 
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(12) 


A2  ♦  Z  AT 


♦  Z  {S'2*  B  (D  „TD  )'‘b7x  -  C  T(D  Do,T)-1C  )Z 

oo  os  2  12  12  2  »  2  21  21  2  ® 


+  B  B  T  =  0 
1  1 


such  that  A  -  Z  C  T(D  D  T)_1C  is  stable  and  5  >  0.  Then, 

09  2  21  21  2 

L  =  Z  C  T(D  D  V1  (13) 

co  2  21  21 

gives  |T(s)  -  T  ( s ) B  <  5. 

H  SF  1  oo 


It  is  seen  from  equation  (13)  that  the  resulting  closed-loop  transfer 

function  matrix  T(s)  is  approximately  equal  to  the  closed-loop  transfer 

function  matrix  with  H  state  feedback.  To  summarize  the  main  results 

00 

presented  so  far,  we  give  a  two-step  design  procedure: 

1)  Determine  a  state  feedback  gain  K  such  that  ||T  (s)fl  <  jr. 

2)  Determine  an  observer  gain  L  such  that  8T(s)  -  T  (s)||  <  5. 

Each  step  of  the  design  procedures  requires  the  solution  of  only  one  Riccati 
equation. 


2.3  Reduced -Order-Observer  Based  Controller 

Let  us  consider  again  the  linear  system  (1),  with  a  Luenberger  observer 
(Figure  4), 

<  =  F<  ♦  G  y  ♦  G  u 

(14) 

Kx  =  H<  +  Jy 

The  observer  design  equations  are  known  to  be1101 


VA  -  FV  =  G  C 

l  2 

G  =  VB  (15) 

2  2 

JC  ♦  HV  =  K 
2 
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Theorem  3 

The  closed-loop  transfer  function  matrix  can  be  expressed  as 

T(s)  =  T  ( s )  +  F(s)M(s)  (16) 

SF 

where 

Tsf(s)  =  (C  -  D12K)  (si  -  A  +  BY.)'1  Bi  +  Dn 

F(s)  =  (C.  -  D  JC)  (sI-A+B  K)_1B  ♦  D  „  (17) 

1  12  2  2  12 

and  M(s)  =  H(sI-F)_1(VB  -GDI  -  JD,, 

1  1  21  21 

The  above  theorem  is  an  extension  of  Theorem  \  (F(s)  =  F(s)K)  as  it  can  be 
seen  by  letting  V=I ,  J=0,  H=K,  ^2sB2>  and  Gi=L. 

2. 4  Exact  LTR 

An  immediate  application  of  Theorem  3  is  that  exact  loop  transfer 
recovery  can  be  achieved  when  M(s)  =  0  (Figure  5  and  Figure  3).  More 

specifically,  via  (17),  we  have 
Theorem  4 

The  necessary  and  sufficient  conditions  for  exact  LTR  are  those 
conditions  given  in  (15)  and 


VB  =  G  D 

1  1  21  (18) 

JD  =0 
21 

The  existence  of  the  observer  gain  matrix  f  'equlres  the  condition  p  >  q 


be  satisfied.  The  observer  design  can  be  accomplished  using  the  following 
procedures  which  are  devised  from  a  recent  result  in  precise  LTR  1111 


Theorem  5  (p  >  q,  D2j  *  0) 

(1)  perform  a  QR  decomposition  of  D2i,  [Q,RJ  =  qr(D2i> 

Q  =  IQ,  Q2J  .  R  -  [S'] 


(19) 


60-9 


(ii)  Let 


*  QTCa  *  [y 


(ill)  Solve  the  Lyapunov  equation 


V(A-BiRi  V  -  FV  -  L2E2 
where  L2  Is  any  matrix  of  proper  dimension. 


(lv)  Let 

=  tVB^'1  L2]QT 

Given  K  and  V,  J  and  H  can  be  computed,  if  r  =  n  +  q  -  p 


[J  H]  «  IK  0] 


rc  D  V 

2  21 

y  o 


Theorem  6  (p  >  q,  D2j  =  0) 

(1)  perform  a  QR  decomposition  of  B^ 


[W  S]  =  qrCB^ 


W  =  [wi  w2J  ,  S  = 


■[:■] 


(li)  Let 


C  =  CV  ,  A,  =  WTAW  ,  and  A,  =  WTAW 

1  2  1  1  2  1  2  2  2 


(ill)  perform  a  QR  decomposition  of  C 


[Q,  R]  =  qr  (^ ) 


q  -  iq,.  <y  . 


and  let 


E  *  QT  C  W  =  fi 

2  2  E 
>-  2 
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(iv)  Solve  the  Lyapunov  equation 

Z  (A,  -  A  R^E  )  -  FZ  =  L  E„  (28) 

2  111  22 

where  L  is  any  matrix  of  proper  dimension. 

(v)  Set 

Gt  =  (ZA^'J  L21  QT 

V  =  Zul  G  =  VB  (29) 

2  2  2 

[J  H]  =  K  Pzl 


Numerical  examples  to  illustrate  the  above  design  procedures  are  presented  in 
Section  IV. 

2.5  Controller  Design  via  Model  Matching 

As  mentioned  previously  in  this  report,  our  H  design  formulation  can  be 
viewed  as  a  stable  one-sided  model  matching  problem  (MMP).  Recent  research  on 
standard  H  control  design  has  resulted  in  various  algorithms  for  solving 

OB 

model  matching  problems.  It  is  felt  that  the  method  proposed  by  Hung  is  more 

1121 

useful  to  this  research. 

Since  T  (s)  and  F(s)  are  known  after  the  full-state  feedback  gain  K  is 

SF 

designed,  we  can  determine  M(s)  via  an  appropriate  MMP  algorithm. 

Theorem  7 

Let  {A,  B,  C,  D>  be  a  minimal  realization  of  M(s),  then  the  observer 
design  equations  are 
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VA-AV  =  G  C_ 

1  2 

JC  +  CV  =  K 
2 

VB  -  G  D  =  B  (30) 

l  l  21 

JD  =  -D 
21 

From  the  definition  of  M(s),  the  observer  parameters  F  and  H  are  A  and  C, 
respectively.  Other  observer  parameters  J,  G^  and  G 2  can  be  similarly 
computed  as  proposed  in  Theorem  6.  The  detailed  algorithm  has  yet  to  be 
worked  out.  It  should  be  mentioned  here  that  for  the  existence  of  solutions 
to  the  design  equations  (30),  the  following  conditions  must  be  satisfied, 
p  >  q 

and  r(m+p-q)  >  m(n+q-p)  (31) 

III.  Notations  and  Block  Diagrams 
3.  1  Matrices 


G 

l 

r  x  p 

A 

n  x  n 

G 

2 

r  x  m 

B 

l 

n  x  q 

H 

m  x  r 

B 

2 

n  x  m 

J 

m  x  p 

c 

1 

l  x  n 

V 

:  r  x  n 

C  : 

2 

p  x  n 

T(s) 

l  x  q 

D 

ll 

lx  q 

Vs) 

cr 

X 

d,2  : 

lx  m 

F(s) 

lx  n 

D 

21 

p  x  q 

F(s ) 

l  x  m 

K 

m  x  n 

M(s)  : 

n  x  q 

L 

n  x  p 

M(s )  : 

m  x  q 

X 

CO 

n  x  n 

R 

p  x  q 
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2 

CD 

n  x  n 

R 

l 

q  x  q 

F 

r  x  r 

Q 

p  x  p 

Q>  •• 

p  x  q 

Q  : 

p  x  p 

Qa  : 

p  x  (p-q) 

: 

p  X  q 

E 

p  x  n 

!2  : 

p  X  (p-q) 

E 

l 

q  x  n 

R 

p  x  q 

L  : 

2 

r  x  (p-q) 

R  : 

l 

q  X  q 

w 

n  x  n 

A 

r  x  r 

S 

n  x  q 

B  : 

r  x  q 

s 

1 

q  X  q 

C  : 

m  x  r 

w 

1 

n  x  q 

D  : 

m  x  q 

w 

2 

n  x  (n-q) 

E  : 

p  x  (n-q) 

C  : 

p  x  q 

A 

E 

l 

q  x  (n-q) 

a 

A 

1 

(n-q)  x  q 

E2  : 

(p-q)  x  (n-q) 

A 

2 

(n-q)  x  (n-q) 

z 

r  x  (n-q) 
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Figure  1.  Observer-Based  Compensator 


Figure  2.  Frequency- Shaped  State  Estimator 
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Figure  5  is  reduced  to  Figure 


IV.  Simulation  Results 


In  this  section,  we  shall  present  three  numerical  examples.  The  first 
two  examples  are  for  exact  LTR  with  D  *  0  and  D2i  =  0.  The  second  example 
is  nominal  linearized  longitudinal  dynamics  of  the  A4D  aircraft  at  flight 
condition  0.9  Mach  and  15,000  ft.  altitude. fl3J  The  third  example  is  for 
approximate  LTR  covered  in  subsection  2.1  and  2.2. 

4.1  Example  1  (exact  LTR,  D  *  0) 


<model  data>: 


A  = 

'  0 

0 

1  0  ‘ 

0  1 

B  = 

’  1  ' 
2 

B2= 

’  0  ' 
0 

6 

-11  6 

l 

» 

1 

» 

1 

cr  1 

1 

3  1  I.  C2= 

'12  0 
Oil 

] 

• 

D2r 

[J] 

D  =  2, 
12 

,  D  = 

11 

1 

<dlmension>: 

n=3,  q=l ,  m=l,  l~\ ,  p=2,  r=2 

<selectlon>: 

open  loop  poles  :  1,  2,  3 

observer  poles  :  -5,  -3 

free  matrix  L  :  [  10  1  ,  F  =  [-5  2l 

2  L  -10  J  Lo  -3j 

<result>: 
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full  state  feedback:  K=  [9 


-4.5 


10.5) 


QR  factorization: 

Q!  = 

[*-0.4472] 

[-0.8944J 

• 

Q  =  [-0.8944] 
w2  [  0 • 4472J 

R  = 

l 

-2.2361 

E  =  QTC2  : 

E  = 

l 

[-0.4472 

-1.7889  -0.8944] 

E  = 

2 

[-0.8944 

-1.3416  0.4472] 

Solution  to  Lyapunov  equation: 


v  -  f--2179 

0.3538 

0.3278' 

[4.8291 

5.3540 

-0.4199 

observer  parameters: 

[*-8.7808 

4.7991] 

l  [l 1.9677 

1 . 5747J 

[  0.3278] 

2  **  [-0.4199J 

J  =  [-30.5069  15.2535] 
H  =  [-4.2680  7.9885] 


<closed  loop  transfer  functions 
with  full  state  feedback,  T(s)  = 


s3-8  5  s2  +  336  s-193.5 
(s+2)  (s+1.5)  (s+l) 


with  reduced-order  observer, 

T,  ,  _  (s+5)  (s+3)  (s3-8.5  s2+336  s- 

(s+5)  (s+3)  (s+2)  (s+1 .5)  (s+ 


<remark>: 

The  exact  LTR  for  0  can  be  achieved  using  a  reduced-order 


193.5) 

1) 


observer. 
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4.2  Example  2  (exact  LTR,  D  =  0) 

21 

<model  data>: 


'-0.0605 

-32.37 

0 

32.2' 

'O' 

-0.00014 

-  1.475 

1 

0 

II 

r* 

CO 

0 

-0.0111 

-34.72 

-2.793 

0 

1 

0 

0 

1 

0 

0 

B  = 
2 


0 

-0.1064 

-33.8 

0 


[0  1  0  0], 


r  _  fl  0  0  01 

a"  [o  0  l  oj 


Du  =  <>•  d.2  *  °-  D„  ■  ° 


<dimension>: 

n  =  4,  q=l ,  m=l.  1=1 ,  p=2,  r=2 

<selectlon>: 

observer  poles:  -4,  -5 

a 

free  matrix  L 


■■  [>] 


.  F 


-[■o  y 


<result>: 

full  state  feedback: 

K  =  (-1.06312,  16.83668,  -1.38415,  -32.47965] 


OR  factorization  of  B  : 

i 


'0  ' 

'0  -1  0’ 

-1' 

W  at 

0 

u  * 

1  0  0 

s  = 

0 

1 

-1 

2 

0  0  0 

0 

0 

0  0  1 

0 
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To  1 

-r 

*- 1.475 

0.00014 

0  ' 

-1 

A  = 

l 

0 

A  = 

32.37 

-0.0605 

-32.2 

l1! » 

i 

2 

» 

0 

0 

0 

QR  factorization  of  C  : 


E  =  QTC  W  :  E  =  [0  0  0] 

2  2  l 

e2  =  [0  -1  0] 

Solution  of  Lyapunov  equation: 

[3.25565  -0.2540  -2.04431 

C  ~  [1.8596  -0.2025  -1.3041J 


[*0.2540  3.2557  0  -2.04431 
[o . 2025  1.8596  0  -1.304lJ 

Observer  parameters: 


G 

i 


1  1.2113' 

1  0.5555 


r-0.3464l 
2  "  [-0.1979J 

J  *  [-11.6016  -1.3842] 
H  =  [-86.5510  160.5833] 


<closed  loop  transfer  functlon>: 
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with  full  state  feedback: 

T,  .  0.8527  (s-4.9126)  (s+0.9205) 

Tis)  = - 

(s+30.2167)  (s+16. 3442)  [(s+1.3802)2  ♦  1.21472] 

with  reduced-order  observer: 

,  0.8527  (s+4)  (s+5)  (s-4.9126)  (s+0.9205) 

(s+4)  (s+5)  (s+30.2167)  (s+16. 3442)  [ (s+1 . 3802)2+  1.21472] 

<remark>: 


The  exact  LTR  for  D  =0  can  be  achieved  using  a  reduced-order 

21 

observer. 


4. 3  Example  3  (Approximate  LTR) 

114] 


<model  data>: 


A* 


-14  100  0 

1  0  0 
0  -10  0 


] 


B  = 


0  0.1  0  0 
0  0  0  0 

1  0  0  0 


100 

0 

0 


] 


<dimension>: 

n=3,  q=4,  m=l .  1-2,  p=2 


c  = 


o  0  0.5 

0  0  0 

o  0  i’ 

1  5  o 


u 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

d 

1 

IP 

•  •  • 

0 

0 

0 

0 

1 

0 

<design>: 

1.  Check  6  initial  assumptions. 

2.  Choose  a  7  and  S  pair. 

3.  Compute  X  >0  -*  K. 

4.  Compute  Zj>0  -»  L. 

5.  Check  for  system  stability  (ReA(A-LC2)<0). 

6.  Compute  H^-norm,  J|T(s)-T  (s)|| 

7.  If  any  condition  fails,  repeat  step  2-  6>. 


60-21 


<system  configuration  l  c^i,  R^ur«  2  H\4.  p6^* . 

<remark>: 

For  a  more  complete  analysis,  (y*l,  8*1)  and  (7*1.  6*0.548)  were  chosen. 
Figures  12-15  are  the  corresponding  sigma-plots  for  Tf (s)*T(s)-T>f (s)  and 
T  (s).  The  trade-off  between  attenuation  and  bandwidth  is  apparent.  The  BW 

ZW 

for  (7*1,  3*1)  Is  *8  rad/s  and  for  (7*1,  8*0.548),  *2000  rad/s.  The 
corresponding  step  responses  are  given  In  Figures  17-20.  the  step  was  applied 
to  the  y  command  reference  input  while  the  other  exogeneous  inputs  were  kept 

1 C 

at  zero.  The  design  Intent  Is  for  y  to  track  the  command  reference  input. 
According  to  Figures  19-20,  the  design  objective  Is  achieved  with  *63%  =0.996 
sec.  for  {7*1,  5=1),  and  t  =0.711  sec.  for  (7*1.  6*0.548).  The  latter 

63X 

parameter  pair  provides  a  performance  close  to  the  design  found  In  the  paper, 
t  =0.6  sec.  The  output  z  is  the  accumulated  e  ror  e  For  Figures  17  and 

63X  ^  1  «C 

18,  e  =0.52  and  e  =0.36.  respectively.  These  plots  are  In  direct  casual 

ac  ac 

agreement  with  the  output  step  responses  yjQ  (Figures  19-20).  That  Is,  a 
quicker  response  will  produce  less  accumulated  error  (over-damped  case).  The 
output  z^  Is  the  control  vector  u.  The  negative  value  indicates  the  controller 
is  trying  to  drive  the  output  z^  to  zero;  however,  this  will  never  happen  since 
there  will  always  be  accumulated  error. 

Eigenvalue*: 


MT,.) 

m 

X(A-LCj) 

-1946 

0 

•1941 

•1941 

541 

•6.02 

-4.42t2.63 1 

-1941 

-146 

-6.02 

-1.06 

7 


il,  8«0446-* 

X(A) 

-2407.7 

0 

•1941 

541 

•1946 

-1941 

•44212.631 

-1.63 

X(A-LCj) 

-2407.7 

-19.21 

-143 


Notice  that  one  of  the  poles  Is  weakly  observable,  A3(A)*-19.21.  The  pole  is 
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«l  A*2 


Figure  16 


Figure  19 
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gamma =  1.0000 
deltas  1.0000 


x= 

1 . 411e-03 
2 . 969e-02 
-5 . 097e-03 

lambda (X) = 

3 .281e-06  +0 
6.782e-01  +0 
6.374e-02  +0 

Z= 

3.288e+00 
6.314e-01 
-1 . 269e+00 

lambda (Z) = 

4 . 093e+00  +0 
9 . 681e-01  +0 
9.260e-06  +0 


2 . 969e-02 
6 . 416e-01 
-1.428e-01 


OOOe+OOj 

OOOe+OOj 

OOOe+OOj 


6.314e-01 
1.213e-01 
-2 . 437e-01 


OOOe+OOj 

OOOe+OOj 

OOOe+OOj 


K= 

1.411e-01  2 . 969e+00 


L= 

-1 . 269e+00 
-2 . 437e-01 
1 . 652e+00 


6 . 445e+00 
1 . 238e+00 
-2 . 488e+00 


lamba ( A-LC2 )- 
-1 . 92ie+01  +0 . OOOe+OOi 
-8 . 016e+00  +0. OOOe+OOi 
-1 . 063e+00  +0. OOOe+OOi 


I  I T-Tsf I  I =  8 . 344335e-01  0  w= 


Design  Status: 

X  positive  definite: 

Z  positive  definite: 

Feedback  system  stable: 
Feedback  system  norm  <  delta: 


-5 . 097e-03 
-1 . 428e-01 
9 . 893e-02 


-1.269e+0Q 
-2 . 437e-01 
1 . 652e+00 


-5 . 097e-01 


0.000 


passed. 

passed. 

passed. 

passed. 
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gamma=  1.0000 
deltas  0.5480 


Xs 

1 . 411e-03 
2 . 969e-02 
-5 . 097e-03 

lambda (X) s 
3.281e-06  +0 
6.782e-01  +0. 
6 . 374e-02  +0, 

Z= 

5.114e+02 
9 . 822e+01 
-4.778e+02 

lambda (Z) s 
9.780e+02  +0. 
1 . 477e+00  +0. 
9 .  261e-06  +0. 

Ks 

1 . 411e-01 

Ls 

-4 . 778e+02 
-9 . 178e+01 
4 . 492e+02 

lamba (A-LC2) s 
-2 . 408e+03  +0 
-1 . 921e+01  +0 
- 1 . 633e+00  +0 


2 . 969e-02 
6 . 416e-01 
-1 . 428e-01 


OOOe+OOj 

OOOe+OOj 

OOOe+OOj 


9 . 822e+01 
1 . 887e+01 
-9 . 178e+01 


OOOe+OOj 

OOOe+OOj 

OOOe+OOj 


2 . 969e+00 


1 . 003e+03 
1 . 926e+02 
-9.367e+02 


OOOe+OOi 

OOOe+OOi 

OOOe+OOi 


I  I T-Tsf  I  I  s  5.474570e-01  @  w: 


Design  Status: 

X  positive  definite: 

Z  positive  definite: 

Feedback  system  stable: 
Feedback  system  norm  <  delta: 


-5 . 097e-03 
-1 . 428e-01 
9 . 893e-02 


-4 . 778e+02 
-9.178e+01 
4 . 492e+02 


-5 . 097e-01 


106.525 


passed. 

passed. 

passed. 

passed. 
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V.  Conclusions  and  Recommendations  for  Future  Research 


The  main  task  of  the  described  research  under  the  present  contract  has  been 

to  develop  a  new  approach  for  H  control  design.  The  ultimate  objective  is 

00 

to  devise  H  theory-oriented  numerical  algorithms  which  would  offer 
00 

reduced-order  H  compensators. 

00 

In  this  report,  we  have  presented  two  H  design  results.  The  first  result 

00 

is  for  approximate  H  loop  transfer  recovery  based  on  full-order  observers, 

00 

while  the  second  result  is  for  exact  (precise)  LTR  based  on  reduced-order 
observers.  Three  numerical  examples  are  also  Included  to  demonstrate  the 
use  of  the  presented  design  procedures.  It  is  mentioned  here  that  the  two 
results  are  not  meant  to  be  used  for  the  same  control  design  problem,  since 
the  technical  assumptions  leading  to  the  two  results  are  different. 

Several  research  topics  are  recommended  for  future  research: 

1.  Use  the  notion  of  ‘robust  observer  design'  to  extend  the 
presented  results  for  control  systems  with  parametric 
uncertainties. 

2.  Revise  the  existing  results  In  H  state  feedback  design  so  that 

00 

the  obtained  gain  matrix  K  can  be  used  directly  for  exact  LTR  via 
reduced-order  observers. 

3.  Perform  more  computer  simulations  using  practical  control 
examples  to  assess  the  numerical  aspects  of  the  proposed  design 
algorithms. 

4.  Compare  the  proposed  exact  H  /LTR  method  with  Niemann  et.  al.  who 

00 

formulated  the  H  /LTR  as  a  singular  H  control  problem. 
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Appendix:  Derivation  of  Main  Results 

We  shall  present  the  derivation  of  Theorem  2.  Theorem  3  and  Theorem  5  in 
the  sequel.  Theorem  1  is  a  special  case  of  Theorem  2,  and  hence  omitted. 
Theorem  4  follows  Theorem  3,  while  Theorem  6  is  very  similar  to  Theorem  5. 
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Appendix  A  (Proof  of  Theorem  2) 


Proof  of  Theorem  2:  Figure  2  gives  a  realization  of  the  transfer 
function  matrix  T(s)  -  Tgr(s)  -  F(s)#q(s)  (B^  -  LD2^) .  A  state-space 
description  is: 


Xf(t). 


F 


x0(t) 

Xf(t) 


+  c  U0(t) 


Yf(t) 


H 


Xf(t). 


where,  F 


and  H 


[d12k  cx  -  d12k]. 


The  problem  is  to  find  L  such  that  Hv  norm  of  the  transfer 
function  between  uQ  and  is  less  than  S.  This  norm  bound  will  be 
satisfied  if  F  is  stable  and  there  exists  an  X  it  0  such  that 
U  -  FTX  +  XF  +  S ' 2XGGTX  +  HTH  S  0. 

Let  X  -  block  diag(X^,  X,,)  and  partition  U  into  4  nxn  blocks: 


U11  U12 
•U21  D22- 


where , 

d22  *  X«(A  ’  B3K)  +  <A  *  BaK)TX«>  +  CiTCx  +  X.Ba(DiaTD1a)'1BaTXB> 

-  X.A  -  X.Ba(DlaTDia)'1BaTX.  +  ATX.  -  X.Ba(DlaTDia)’1BaTX» 

♦  CiTC»  +  X.B2(DiaTDxa)’1BaTX. 

-  -7*2X,B,BiTX. 


by  using  the  state  feedback  Riccati  equation  (3).  The  off-diagonal 
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blocks  vanish: 


u12  “  Un  -  Xap3(T>l7JDi2)'l*7rX<0  -  Xa,Bj(D12TDi2)'1B2TXa)  -  0 
Finally,  Che  upper  left-hand  block  is  also  made  Co  vanish. 

U11  "  X1(A  *  Lc2>  +  <A  *  lc2)Tx1  +  «*2xl(BiBiT  +  LD21d21TlT)x1 
+  X.B2(D12TDia)-1B2TXa, 

-  XjA  +  ATX^  +  5*2X1B1B1TX1  +  X„B2(Dl2TDj2)*1B2TXa) 

+  (f^XjLD^D^7  -  5C2T)(D21D21T)*1(5*1X1LD21D21T  -  SC2T)T 
-  i2c2"*"(D2^D2^T)  ^c2 


If 

5*1X1LD21D21T  -  SC2J 

and 

0  -  XXA  +  ATXx  +  5*2X1B1B^TX^  +  Xa,B2(Dl2TDi2)*1B2TXso 

-  62c2t(d21d21t)*1c2  i*> 

then  UX1  -  0.  If  the  solution  to  Of  )  is  Xj  >  0,  then  X  >  0  and  we  can 
solve  for  L  as 

L  -  52X1*1C2T(D21D21T)*1. 

Equation  )  can  then  be  rewritten  as 

0  -  AZ.  +  Z„AT  +  Zw(5*2XobB2 (Di 2tDi 2 ) ’1B2TXa)  -  C2T(D21DnT)  ^CsJZ* 


where , 


In  this  case,  L  is  defined  as  in  equation  (13). 
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Since  U22  ^  0  and  the  remaining  blocks  of  U  are  identically  0, 

U  5  0.  All  that  remains  to  show  is  that  F  is  stable.  This  follows 
easily  from  the  fact  that  A  -  B2K  and  A  -  LC2  were  required  to  be  stable 
at  the  state  and  observer  stages  of  the  design  procedure  respectively. 


Appendix  B  (Proof  of  Theorem  3) 

From  equations  (1),  (14)  and  (15),  we  have 


• 

[A  -B  JC  -B  Hi 

•  * 

fB  -B  JD 

X 

2  2  2 

X 

1  2  21 

• 

G  C  -G  JC  F  -G  H 

▼ 

(G  -G  J)D 

s  J 

1222  2 

1  2  21 

(B.l) 


2  =  (C  -D  JC 

1  12  2 


-D„»l 


X 


*  (Du  -w  w 


(B.21 


Applying  a  coordinate  transformation  to  (B.l)  gives 


fA  -B  K 

-B  H] 

[&  -B  JD 

2 

2 

1  2  21 

0 

F 

•n  * 

-VB  +  G  D 

. 

1  1  21 

(B.3) 
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(B.  4) 


2  *  [C  -D  K 
1  12 


-D  H]  v  *  D  -D  JD 
12  11  12  21 


The  closed-loop  transfer  function  matrix  T(s)  can  be  obtained  from  (B.3) 
and  (B.4), 


T(s)  =  [Cj-D^K 


-D12H1 


rsl  -A  +  B  K  B  H  1 

[B  -B  JD 

2  2 

1  2  21 

0  sI-F 

-VB  +  G  D 

1  1  21 

+  D  -D  JD 
11  12  21 


(B.  5) 


Expanding  the  RHS  of  (B.5)  and  after  some  algebraic  manipulation  yields  the 
expected  result 


T(s)  =  T  ( s )  ♦  F(s)  M(s)  (B.6) 

where  F(s)  and  M(s)  are  defined  as  in  (17). 

Appendix  C:  (Proof  of  Theorem  5) 

The  proof  of  this  theorem  can  be  established  by  verifying  G^D^  =  VB^  and 

VA  -FV  =  G  C  .  From  (22). 

1  2 

Gi  *  ^  ’  w^ere  Q  an  orthogonal  matrix, 

which  Implies 


G,D2.  *  IVB,R.“  Ql  5 


-  ,vb1r."  [S'] 

»  VB^  ,  since  QTQ  =  I 
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Next,  from  (20)  and  (21)  we  have 


VA  -  FV 

=  VB  R  *'  E  +  LE 
11  1  2  2 


=  [VB  R  -1 

l  l 

= 


L2] 


since  QQT  =  I 
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A8STRACT 


The-  oojective  of  tnis  stuay  is  to  determine  me  stress  wave 
(or  pulse)  propagation  tnrough  the  thickness  of  a  grapni te-epoxy 
laminated  piate  at  room  temperature  and  at  low  temperature. 

This  is  part  of  an  overall  study  to  understand  the  damage  of 
these  plates  under  normal  projectile  impact.  Upon  a  snarp  impact, 
the  stress  wave  propagates  from  the  impact  point  into  the  rest 
of  the  material.  It  is  found  that  the  imbeaaed  ooiyvinyliaene 
fluoride  sensors  are  aDie  to  predict  the  wave  velocities  and 
wave  attenuation. 


1  .  INTRODUCTION 


The  impact  benavior  of  laminated  plates  is  an  important  topic 

since  composite  plates  are  known  to  respond  to  impact  loading  and 
energy  aissipation  in  a  very  different  way  tnan  metallic  piates. 

In  fact,  impact  resistance  is  one  of  the  most  serious  weakness  of 
composite  material  plates.  Excellent  recent  survey  articles  on  this 
topic  was  reported  by  Aorate  (1991)  and  Cantwell  and  Morton  (1991). 
The  impact  of  metallic  plates  by  spherical  balls  were  reported  by 
Goldsmith  (1960),  Goldsmith  and  Lyman  (1960)  and  more  recently  by 
Sondergaard  et  al  (1990). 


The  concepts  of  imDedded  strain  gages  were  employed  by  Daniel 
and  Wooh  (1985,  1990)  to  study  the  deformation  and  damage  of  composite 
laminates  under  impact  loading.  The  characteristic  features  of  the 
strain  records  are  associated  with  specific  failure  modes  of  the 
laminates.  The  load  history,  imparted  energy  and  transient  strains 
at  various  locations  tnrough  the  thickness  were  oDtained.  The  wave 
propagation  benavior  of  composites  using  imbeaced  gages  were  not  discussec 
Wave  propagation  in  transversely  impacted  composite  laminates  were  obtaine 
by  Kim  and  Moon  (  1979)  and  Daniel  et  al  .  (  1979),  Dut  no  imbedded 
sensors  were  used  for  tne  wave  propagation  studies. 


he  objective  of  tnis  work  is  to  examine  the  wave  velocities  and 
wave  attenuation  in  the  thickness  direction  using  the  PVDF  sensors  which 
are  imbedded  in  the  interior  of  the  laminate.  Upon  a  snarp  impact, 

the  stress  wave  propagates  from  the  imapct  point  into  the  rest  of 

the  material.  Immediately  below  the  impact  point  over  a  small  area, 
the  stress  wave  can  be  assumed  to  propagate  witn  a  plane  front  in 
the  thickness  direction.  A  plane  pressure  sensor  of  relatively  small 
dimension  will  respona  to  this  propagating  stress  wave  front.  A  series 
of  such  sensors  emDecdeo  in  various  depths  in  tne  interfaces  of  the  lamina 

will  respond  to  this  incoming  stress  pulse  in  sequence  at  whicn  the  sensor 

meet  the  pulse.  A  measurement  of  time  difference  Detween  the  start  of 
successive  Pulses,  divided  by  the  distance  between  tne  sensors,  would  give 
the  velocity  of  tne  stress  wave  between  sensors. 
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The  present  impact  prcpiem  is  concernea  wit.n  extremely  short  durati 
pulse  so  that  the  waveiengip  is  snort  relative  to  me  thickness  of 
tne  i nai vidual  lamina.  Suer  snort-wavelength  puise  is  especially  neeaec 
in  laminates  wnicn  contain  relatively  few  layers  since  one  neeas  to 
examine  the  reflection,  transmission  ana  superposition  of  wave 
tnrougn  a  laminatea  plate. 


There  are  three  pnases  involving  the  impact  of  laminated  plates 
cue  to  these  tiny  spherical  projectiles.  They  are  (i)  drop  test 
of  these  projectiles  wnicn  does  not  cause  aamage  (ii)  using  a  gas  gun  to 
shoot  these  projectiles  wnicn  will  cause  partial  aamage  but  still 
no  perforation  (iii)  with  perforation.  The  present  work  is  concerned 
with  the  first  phase  wnicn  is  necessary  to  demonstrate  the  validity  of 
the  techniques.  Further,  the  velocity  of  the  stress  wave  will  enable 
one  to  measure  the  Young’s  modulus  in  the  thickness  direction,  which 
is  known  to  be  hard  to  measure.  The  drop  test  enaDles  one  to 
aetermine  the  compressive  stresses  of  a  particular  iamina  as  the 
wave  propagates  towaras  the  free  back  surface  ana  the  superposition 
of  the  tensile  stresses  as  the  wavea  reflected  from  free  back  surface. 


The  use  of  piezoelectric  polymer  as  a  material  which  transforms  an 
electric  field  to  a  small  mechanical  mechanical  aeformation  directly 
tnrougn  a  readjustment  of  internal  polarization  is  well  known  (Cundari 
and  Abedian  1991).  The  polyviny lidene  fluoride  (PVDF)  piezoelectric 
sensors  are  imbedded  in  the  interior  of  the  laminated  plate. 


The  purpose  of  these  sensors  is  to  enable  one  to  "look"  inside 
the  composite  specimens  and  to  determine  the  sequence  and  propagation 
of  the  stress  waves.  Of  particular  concern  is  the  duration  of  stress 
wave  as  it  crosses  a  lamina  due  to  impact  and  the  reflection  of 
waves  from  the  back  free  surface.  It  appears  that  the  data  collection 
Nicolet  instruments  was  sensitive  enough  to  measure  tne  stress 
pulse  through  the  charge  in  the  PVDF  and  hence  the  force  applied  to 
tne  sensor  within  the  laminate.  The  ultimate  goal  is  to  understand 
the  "process"  ,  rather  than  the  final  stage,  of  damage  due  to  the 
reflection  and  refraction  of  these  waves  causing  oelami nation , 
matrix  cracking  and  fiber  breakage. 


The  wavelength  of  the  pulse  is  assumed  to  be  short  relative 
to  the  lamina  thickness  but  long  relative  to  the  diameter  of  the 
inoividual  fiber.  Thus,  the  material  is  governed  by  tne  effective 
properties  of  the  equivalent  homogeneous  material.  The  "interface" 
effects  are  neglected  and  the  analysis  is  thus,  identical  to  that 
of  the  homogeneous  material. 


The  laminated  plate  consists  of  four  sets  of  layers  (each  set 
consists  of  seven  layers).  The  plate  dimension  is  4"  oy  4".  The 
oiate  is  clamped  in  a  fixture  so  as  to  becomes  a  circular  plate 
with  a  dieipeter  of  three  inenes.  The  fixture  was  usea  in  previous 
experiments  involving  Hookinson  bar  tests  (see  Altamirano  1991). 
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WAVE  SPEED  ANO  ATTENUATION 


2  . 


As  a  first  approximation,  the  ’interface"  effects  are  neglected 
and  the  composite  olates  can  be  modelled  as  homogeneous  isotropic 
material.  The  longitudinal  wave  speed  is  (Kolsky  1963) 


VL  =  X  1-v)/  Ql+v)  (1-2V)]J 

and  the  two  transverse  wave  speeds  are  identical  and  they  are 


In  the  aDOve ,  E  is  Young’s  modulus,  is  Poisson’s  ratio  ana 

is  tne  aensity  of  the  material.  Note  that  the  above  velocities  are 

maepenaent  of  the  frequency  so  tinat  for  plane  waves,  a  puise  shape 
composea  of  a  spectrum  of  frequencies ,  can  propagate  without 
aistortion  of  its  shape.  The  laminae  are  manufactured  by  Fiberite 
( see  Taole  1  ) . 


Using  the  following  material  properties, 

E t  =  transverse  compressive  modulus  =1.2  xl0**6  psi 
v  =  Poisson’s  ratio  =  0.25 
f  =  density  =  0.055  lb-mass/in**3 
g  =  gravity  =  386  in/sec**2 


one  obtains  the  longitudinal  wave  speed  cu  =  8382  ft/sec. 


The  experimental  wave  speed  was  measured  by  any  two  of  the  three 
imbeddea  sensors.  The  wave  speed,  measured  by  two  sensors  wnicn 
are  seven  plies  apart  ana  the  auration  of  0.5  microsecona  (that  is, 
500  nanoseconds  )  found  from 

cu  =  lamina  thickness/propagation  time  =  (  0 .041/12 )/(  500xl0**-9 ) 
=  6833  ft/sec 


It  appears  that  tne  presence  of  the  additional  epoxy  in  the  PVDF 
sensors  cayses  the  wave  to  travel  at  a  slower  speed.  The  measured 
velocity  cbrresponas  to  tne  transverse  modulus  of  0.8xl0**6  osi . 


The  wave  pr ooagat ion  is  accompaniea  by  attenuation  of  its 
amplitude  for  three  major  effects  (i)  geometric  attenuation, 
ii)  interracial  friction  between  fibers  ana  matrix  and  (iii)  the 
snteriaminar  friction  Detween  adjacent  plies.  Geometric  attenuation 
.s  aue  to  tne  spreading  out  of  the  wave  in  a  spherical  direction 
starting  from  the  point  of  impact.  Normally,  the  geometric 
ittenuation  is  predominant  over  the  remaining  effects.  However ,  since 
she  distance  of  wave  propagation  in  tne  thickness  direction  is  so 
small  compared  to  the  planar  direction,  it  is  not  necessarily  the 
dominant  factor. 


The  aisoersion  effects  are  expected  to  be  small  since  all 
.ayers  (with  different  stacking  orientation)  are  identicla  in  the 
thickness  airection.  Careful  calibration  of  the  PVDF  sensor  is  needed, 
since  tne  force  on  the  PVDF  sensors  are  proportional  to  the  charge, 
she  proportional  constants  are  determinea  wnich  are  found  to  be 
i  function  of  tne  appiiea  force. 


The  analysis  is  confined  to  the  transient  stage  since  one  is 

interested  to  aetermine  the  process  of  damage  in  the  first  crucuai  nano¬ 
seconds.  'rhe  subsequent  vibration  problem  in  the  transition  from  transient 
so  steady  state  stage  is  an  important  problem.  Since  one  is  interested 
.n  the  initial  process  of  damage,  particular  emphpasis  is  placed  on  the 
jave  velocities  and  attenuation  in  the  transient  stage.  The  subsequent 
steady  state  vibration  proolem  is  aiso  of  interest  as  it  gives  the  complete 
jamping  process  involving  wave  attenuation. 


f 
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3.  EXPERIMENTAL  work 


Tne  experimental  setup  for  the  drop  test  is  snown  in  Figure  1. 
The  sample  sauare  plate  is  clamped  in  a  fixture  to  produce  a  circular 
plate  with  diameter  oeing  three  incnes.  The  plies  and  orientations 
of  the  laminate  are 

(0  ,  45  ,  90  ,  0  ) 

7  7  7  7 

A  schematic  diagram  for  the  impactor  and  the  clamped  circular  plate 
typical  stress  versus  time  curves  are  shown  in  Figure  2. 


The  PVDF  sensors  of  28  micron  ( 1  micron=  10**-6  meter  )  thickness 
are  imbedded  at  every  seven  plies  interval  where  adjacent  plies  changes 
the  orientation.  Two  tnin  lead  wires  of  diameter  41  standard  gauge  are 
soldered  onto  eacn  PVDF  sensor  and  extended  out  to  the  edge  of  the 
plate  and  are  connected  to  the  digital  oscilloscope.  The  velocity 
measurements  are  maoe  by  monitoring  the  pressure  sensed  by  two 

consecutive  sensors  oy  a  high  resolution  digital  oscilloscope  capable 
of  sampling  at  50  nano-seconds  interval.  Attenuation  ana  energy  of  the 
stress  Pulses  were  monitored  by  a  four  channel  digital  oscilloscope 
having  a  sampling  rate  of  500  nano-seconds. 


The  stress  waves  are  generated  by  dropping  spherical  steel  balls 
of  six  different  sizes  (0.039“ ,  0.219“,  0.250“,  0.344*,  0.500“  and 
0.563“  diameters).  The  resulting  stress  versus  time  curves  for 
each  of  these  balls  are  presented  in  Figure  3.  The  wavp  forms  a-e 
recorded  on  the  high  speed  digital  waveform  oscilloscope  as  shown 
in  Figure  4.  Typical  waveform  is  shown  in  Figure  5a  and  the 
repeatability  of  the  waveforms  from  the  PVDF  sensors  from  eight 
consecutive  drop  ball  impacts  are  demonstrated  in  Figure  5b. 


4.  RESULTS 

Figure  6  shows  the  front  part  of  the  plot  of  two  waves  generated 
by  the  drop  of  a  one  millimeter  diameter  steel  ball.  The  two 
complete  wave  rorms  are  also  snown  in  this  figure.  The  time  difference 
between  the  start  of  tne  stress  waves  shown  in  Figure  6  -s  the  time 
that  the  wave  has  taKen  to  propagate  from  tne  first  sen  .  r  to  the 
next  sensor  through  a  thickness  of  seven  plies  (0.041").  The 
velocity  is  computed  by  dividing  this  distance  by  the  time  interval  . 


Figure  7  snows  the  amplitude  decay  of  the  stress  wave  as  it 
propagates  througn  the  three  consecutive  snsors  located  at  seven 
ply  intervals.  It  can  be  shown  that  as  the  wave  propagates,  the 
peak  amplitude  decay.  These  tests  are  repeated  with  six  different 
sizes  of  steel  oalis.  For  each  ball,  five  measurements  of  the 
waveform  oecay  are  maoe.  This  figure  depicts  the  wave  orm  recorded 
from  tne  oroo  of  a  single  size  ball  (  I  mm)  to  demonstrate  the 
repeatability  of  tne  tests. 
r 
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The  energy  aosorscion  oy  the  elate  oue  to  impact  car.  be 
computed  from  one  difference  Detween  tne  initial  kinetic  energy 
ana  tne  reoounc  Kinetic  energy.  The  energy  aDsorPtino  E  can  be 
comparea  witr.  the  energy  of  tne  stress  wave  E  'ecorasd  by  the 
first  PVDF  sensor  using 


E  =  ( AC/E  )  j  a  at 

where  A  is  tne  equivalent  affecte  area,  C  is  the  wave  sptea,  o  is 
tne  stress,  t  is  time  and  E  is  Young’s  modulus.  The  discrepancy 
between  E  ana  E  is  aue  to  tne  attenuation  of  the  wave  wnicn  was 
due  to  the  three  effects  mentioned  above.  Sucn  ciscreoancy  for 
different  sizes  of  tne  Dails  will  enaole  one  to  estimate  tne 
equivalent  affected  area . 


An  interesting  results  are  ootainea  by  plotting  duration  of 

tne  pulse  versus  tne  size  of  the  spherical  incentor .  It  can  be  seen 
that  the  experimental  data  fits  tne  following  equation , wnicn  can  ds 
comparea  with  tne  equation  by  Sousar  . 


.-inaily  the  waveforms  plot  of  stress  versus  time  for  the  three 
imbedded  sensors  are  shown  in  Figure  8.  One  vertical  division 
represents  2.015  Pascal  and  a  horizontal  division  represents  1.953 
micro-sec.  The  first  curve  (from  the  first  sensor)  is  replotted  in 
Figure  9.  Note  tnat  tne  curve  drops  after  the  *irst  peak  which  is 
an  indication  that  tne  reflected  wave  from  the  free  bac,.  surface  has 
arrive  as  tensile  wave.  A  re-construction  of  tin  superposition  of 
tensile  and  compressive  wave  as  a  function  of  time  can  be  seen  in 
Figure  9.  Finally,  the  same  superposition  of  compressive  and  tensile 
waves  as  sensed  by  the  second  and  third  sensors  are  snwon  in  Figure  10 


5.  CONCLUSIONS 

The  embedded  PVOF  sensors  are  fond  to  be  effective  to  study  the 
wave  propagation  in  tne  thickness  a>  -  ection  of  the  iamiantea  plates. 
The  interference  f^om  reflected  wave  can  De  minimized  or  even 

eliminated  by  reducing  the  size  of  the  impactor  sails.  ”he  method 
provides  a  way  to  determine  tne  Young’s  moaulus  in  tne  thicxness 
direction  from  tne  measured  wave  speed.  This  method  would  also 
ailow  an  estimation  of  the  wave  attenuation  in  the  thicKness  direction 
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Aooencix  a: 


The  PVDF  Sensors 


» 


he  PVDF  sensor  pemg  usea  has  the 


'oiiowing  properties: 


'hicxness  = 
-eao  -ires  = 
reaoout:  aigital 
speeo  = 


28x l0**-6  m  =  i  .  102  x 
41  swg 

oscilloscope 
50xi0**-9  sec/aiv 


10**-3  inch) 


Numoer  of  channels  =  2  (for  velocity  stuay ) 

*  4  (  for  attenuation  study  ) 


uamina  Thickness  (seven  plies)  =  1.04xlO**-3  m  =  0.041  inch 
Sensor  to  lamina  thickness  =  1/  37.2 


A  scnematic  diagram  of  the  PVDF  film  is  shown  in  Figure  Al  .  The  PVDF 
sensor  neasur enent  principle  is  snown  in  Figure  Al  .  The  appliea  stress 
cr  can  oe  found  from 

O'  =  2/(  A.  d  ) 


wnere  0  is  tne  c.narge,  A  is  the  sensor  area  and  d  is  the  charge 
sensitivity.  Further, 

acr/dt  =  (  l/(  Ad  ) )  dQ/dt 

and  the  current  I  can  be  found  from 
aQ/ot  =  I  =  v/R 

where  v  is  voltage  and  R  is  resistance.  Finally, 
ao/dt  =  v/(  RAd  ) 

Integrating,  one  obtains, 

t 

tf(  t  )  =  (  l/(  RAd  )  )  [  V  at 

o 

For  application  purposes, 

6 

R  a  10  ohms 

-12  2  2 
c  =  33x  10  ( columb/m  )/( N/m  ) 

<  -4  2 

A  a  1 .88  x  10  m 

f  v  dt  ( Pascal  ) 

0 


'us,  one  ootains 


8 


=  1  .6#19  x  1 0 
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—  P  ^  5.  N  0 1 X  ^2  '•'ateriai  Specification  or  -me  _  &  s  t  C  c  m  o  o  s  '  "  -  ;  3  r  e 


■■*anur  acturer  : 

ioer  1 

te  Corporation 

-  i oe--  : 

0-40-7 

00  Grapnite 

res i n : 

F ioer i 

te  974  epoxy 

Numoer  of  Plies: 

Eignt 

~n ic  Kness/p 1 y 

0  .00537  i non 

totai  t.nicKness 

0.038 

i  non 

lens  1  ;y 

0.055 

10-mass/cuo ic 

r iter  jo  1 ume 

55% 

'"ajor  °oisson  's  ratio  is  0.25 

£llc  =  _ong 1 tudi nal  compressive  moauius  =  21.4  Msi 

illt  =  longitudinal  tensile  moauius  =  24. o  Msi 

£22t  =  transverse  tensile  moauius  =  1.47  Msi 

£45 1  =  tensile  moauius  45  aegree  =  3.22  Msi 

'-'Lt  -  _ongituainal  ultimate  tensile  strength  =  380  Ksi 

-lc  =  longituainai  ultimate  compressive  strengtn  =  150  ksi 

r~t  =  "ansverse  ultimate  tensile  strengtn  =  8.70  Ksi 

r45t  =  ultimate  Tensile  Strengtn  45  aegree  =  27.8  ksi 

112  =  sr.ear  moauius  =  0.e>7  Msi 
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1  pxaerimentai  setup  *or  aroo  pail  stress  waves  measurement 

2  Scnematic  diagram  of  the  imoeaaea  sensors  ana  tne  ciamoea 

circular  D.ates  ana  typical  stress  versus  time  as 
sensea  oy  eacn  of  tne  imneadea  sensors 

3  Stress  parameter  versus  time  for  aifferent  Dali  aiameters 

4  The  two  waveforms  as  sensed  by  the  two  imoeaaea  sensors 

recoraea  on  tne  nign  speea  aigitai  waveform  recoraing 
oscilloscope 

5a  An  emargea  pnotograon  from  tne  oscilloscope  snowing  tne 
aeiay  in  the  starting  times  from  the  two  sensors 

5q  Waveforms  from  tne  PVDF  sensors  snow  reoeataDi 1  icy  of 
of  tne  test  in  eignt  consecutive  arop  Dali  impacts 


6  -ront  part  of  tne  piot  of  two  waves  generatea  Dy  tne  arop  of 

a  one  millimeter  ciameter  steel  bail. 

7  Amplitude  aecay  of  tne  stress  wave  as  it  propagates  tnrougn 

tne  tnree  consecutive  snsors  locatea  at  seven  ply  intervals 

8  Stress  versus  time  for  the  three  imbedded  sensors 

9  the  wave  sensed  oy  the  first  sensor  is  repiotted  to 

snow  superposition  of  compressive  ana  tensile  waves 

10  Superposition  of  wave  as  sensea  oy  all  three  sensors 

Ai  PVDF  sensor  measurement  principle  and  typical  votage  vs  time 
ana  stress  versus  time  curves 

A2  Scnematic  piezopoiymer  PVDF  film 
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versus  time  for  different  mail  diameters 


WAVE  THRU  CR/EP  PLATE  R13 
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Figure  6  Front  part  of  the  plot  of  two  waves  generated  by  the  drop  of 
a  one  millimeter  diameter  steel  ball. 


CO.  4 MM)  TEST 


rs 

<D 

s- 

3 

Cf) 

••h 

u. 


61-19 


ry 

c 

on 

V* 

CO 

.  CM 

1 

t 

a 

m 

a 

<o 

, 

cO 

in 

a 

co  1  ai 

LD 

c: 

no 

i  a 

to 

r- 

LO 

qj  ■:  in 

r- 

cm 

n 

ai 

LD  :  CM 

• 

• 

• 

*  • 

;  — 

c\j 

a 

a 

N 

1  1 

<  a 

_ i  i  \ 

>. 

t— 

> 

“*~T — 

— 

— 

01 

CO 

■  t 

i  v  ■* 

cn 

CO 

1 

CO 

1 

CD 

co 

G! 

in 

a i 

CO 

m 

i 

3 

CO 

I 

in 

01 

'  LO 

r-- 

l n 

Z) 

CD 

LD 

CM 

cn 

in 

• 

• 

* 

• 

CM 

m 

Q 

i  a 

a 

1 

X 

_ j 

■  X 

X 

_j 

— 

i — 

> 

— - *T 

t— 

h— * 

1 


*1 


X 


“1 


1 


1 


61-20 


Figure  8  Stress  versus  time  for  the  three  imbedded  sensors 
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ABSTRACT 

The  existing  long-term  fatigue  test  procedure  requires  the 
breaking  of  20  to  30  identically-prepared  specimens  and  one  month 
to  complete.  Thus,  manufacturers  do  not  perform  a  conventional 
fatigue  test  in  spite  of  its  obvious  utility.  Therefore,  there  is 
a  definite  need  for  an  accelerated  fatigue  test  which  can  be 
completed  in  approximately  one  day.  The  accelerated  test  procedure 
proposed  in  this  project  was  developed  on  the  basis  of  data 
obtained  during  the  conventional  fatigue  test  performed  under  UES 
mini  grant  S-210-9MG-038  in  1989,  and  as  a  continuation  of  research 
started  by  the  principal  investigator  at  the  Flight  Dynamics 
Directorate,  Wright-Patterson  Air-Force  Base,  Summer  1990.  The 
accelerated  test  proposed  will  permit  to  control  a  stability  of  the 
manufacturing  process  and  enable  preliminary  estimates  of  the 
design  changes. 

The  additional  long-term  tests  were  run  to  check  a 
compatibility  of  the  new  and  previous  test  results,  gain  more 
statistics,  and  increase  reliability  of  the  basis  information. 

The  fatigue  experimental  investigation  was  conducted  with 
polycarbonate  specimens  of  different  thicknesses  both  with  and 
without  stress  concentrators.  The  influence  of  the  frequency  on 
the  fatigue  life  of  these  specimens  was  also  investigated. 
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INTRODUCTION 


It  appears  (based  on  preliminary  tests)  that  the  fatigue 
properties  of  polycarbonate  sheets  vary  significantly  from  sheet  to 
sheet,  and  probably  within  the  same  sheet.  Therefore,  it  is  very 
important  to  have  a  mechanism  which  permits  (quick  and  continuous) 
quality  control  of  polycarbonate  sheets  and  detects  deviations  in 
the  manufacturing  process;  in  other  words,  a  mechanism  to  control 
the  stability  of  a  manufacturing  process.  The  accelerated  fatigue 
tests  can  be  utilized  for  such  control,  and  the  test  procedure 
developed  in  this  project  can  be  useful  for  this  purpose.  The 
accelerated  fatigue  tests  would  also  be  useful  for  preliminary 
evaluations  of  new  designs.  For  example,  the  Flight  Dynamics 
Directorate  is  developing  frameless  canopy  design  and  the 
accelerated  fatigue  test  used  in  this  project  can  be  recommended 
for  a  preliminary  estimate  of  fatigue  resistance  of  the  frameless 
canopy  polycarbonate. 

The  accelerated  fatigue  test  procedure  proposed  in  this 
project  was  partially  developed  on  the  basis  of  data  obtained 
during  the  long-term  fatigue  test  performed  under  UES  mini  grant 
S-2 10-9MG-038  in  1989,  and  as  a  continuation  of  research  started  by 
the  principal  investigator  at  the  Flight  Dynamics  Laboratory, 
Wright-Patterson  Air  Force  Base,  Summer  1990. 
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OBJECTIVES 


The  major  objectives  of  the  study  were: 

1.  Development  of  detailed  procedure  for  the  accelerated  fatigue 
test  for  coupons  cut  from  a  structural  polycarbonate  sheet  of  F-16 
canopy  laminate. 

2.  Conducting  additional  long-term  fatigue  tests  using  the 
procedure  given  in  reference  1.  Additional  fatigue  tests  are 
necessary  to  get  reliable  parameters  for  the  accelerated  fatigue 
tests. 

3.  Conducting  accelerated  fatigue  tests  based  on  the  procedure 
developed  and  using  statistics  gained  from  long-term  fatigue  tests. 

The  objectives  of  the  initiation  investigations  for  the 
possible  future  detailed  research  were: 

1.  Determination  of  a  dependency  of  fatigue  resistance  of 
polycarbonate  sheet  on  thickness  of  the  sheet. 

2.  Determination  of  fatigue  lives  of  specimens  tested  at  different 
frequencies. 
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LOCATY ' 8  PROCEDURE  FOR  ACCELERATED  FATIGUE  TEST 


The  theory  and  description  of  the  Locaty's  accelerated  fatigue 
test  had  been  given  in  [1]  and  [2]  and  repeats  here  in  concise  form 
for  the  convenience. 

The  method  is  based  on  the  concept  of  cumulative  fatigue 
damage  or  Palmgren-Minor  rule  considering  XI  ~  1-5, 

where  ni,  is  the  number  of  cycles  which  specimen  worked  in  the 
specified  test  regime,  and  Nu  is  the  number  of  cycles  which 
specimen  could  potentially  work  in  accordance  with  the  fatigue 
curve  received  from  the  results  of  the  long-term  fatigue  tests 
of  the  same  type  of  the  specimens. 

The  loading  program  and  the  treatment  of  results  are  presented 
in  Figures  1  and  2.  Figure  1  shows  three  fatigue  curves  (for 
example,  5%,  50%,  and  90%  probability  of  failure)  received  from  a 
long-term  fatigue  test.  During  an  accelerated  test  the  specimen 
works,  for  example,  for  50,000  cycles  at  the  first  load  level,  then 
the  load  is  increased  and  the  specimen  is  again  tested  for  50,000 
cycles.  The  procedure  is  repeated  until  the  instant  when  the 
specimen  fails.  Then  using  Figure  1,  the  magnitudes  of 


are  determined.  With  these  three  parameters  (or,  if  necessary,  a 
greater  number  of  points)  and  knowing  the  corresponding  stresses, 
we  can  find  the  coordinates  of  the  points  which  result  in  the  curve 
shown  in  figure  2.  Now,  if  according  to  an  accepted  hypothesis 


fatigue  strength  corresponds  to 
=  1  /  it  is  possible 
fatigue  strength  (Figure  2). 


a  definite  value  (for  example, 
to  determine  the  magnitude  of 


62-7 


MAX  x  10  ,  MPA 


Figure  1.  Loading  program  for  the  accelerated  test.  Three 
fatigue  curves  A,  B,  C  are  received  from  a  long¬ 
term  fatigue  test  and  correspond,  for  example,  5%, 
50%,  and  95%  probability  of  failure. 


Figure  2.  A  diagram  for  graphical  determination  of  fatigue 
strength. 
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EQUIPMENT/  SPECIMENS/  AND  PROCEDURE  OF  LONG-TERM  FATIGUE  TE8T8 

The  flexure  conventional  and  accelerated  fatigue  tests  were 
conducted  on  an  MTS  machine  using  the  four  point  MTS  flexure 
systems  to  provide  bending  (Figure  3) .  The  loading  diagram  is 
shown  in  Figure  4.  The  specimen  geometry  is  given  in  Figure  5.  It 
can  be  noted  that  the  testing  machine,  fixtures,  specimens  with 
stress  concentrators,  loading  diagram,  and  load  spans  were  used 
absolutely  the  same  as  described  in  [1]  and  [2]  to  provide  a 
compatibility  of  the  previous  and  new  tests. 

The  specimen  machining  procedure,  conventional  fatigue  test 
procedure,  and  failure  criteria  were  also  similar  to  described  in 
[1]  and  [2]. 


Figure  3.  MTS  flexure  fatigue  fixture 
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Figure  5.  Spej2ij(^n  geometry 


LONG-TERM  AND  ACCELERATED  FATIGUE  TESTE 


The  long-term  fatigue  test  regimes  are  given  in  Table  1,  and 
the  programs  and  parameters  of  the  accelerated  tests  are  given  in 
Figures  8,9  and  10. 


Table  1 

Testing  Regimes  (Long-  term  test;  variable  bending) . 


Amplitude  load,  Lb.  Amplitide 

Min,  stress 

Frequency, 

stress,  osi 

Hz 

Plate  vrth  Solid 

a  hole _ plate 


180 

240 

1440 

0.2 

12 

360 

2160 

0.2 

8 

360 

480 

2880 

0.2 

6 

600 

3600 

0.2 

5 

540 

720 

4320 

0.2 

4 

750 

990 

5760 

0.2 

3 

The  results  of  the  long-term  fatigue  tests  from  [l]  and  [2] 
are  shown  in  Figures  6  and  7  and  repeated  in  this  report  for  the 
comparison  purposes.  The  results  long-term  fatigue  tests 
completed  in  1991  are  shown  in  Figure  8.  From  the  results  gained 
it  can  be  assumed  that  the  1989  material  had  greater  fatigue 
strength  than  the  polycarbonate  sheet  received  in  1990  and  tested 
in  1990  and  1991  years.  But  the  scatter  is  significantly  less  in 
the  last  tests.  The  possible  difference  in  strength  of  two 
polycarbonate  sheets  of  the  same  type  shows  one  more  time  that  the 
quick  control  of  fatigue  characteristics  is  very  useful  to  verify 
the  initial  product  quality. 

The  results  and  programs  of  the  accelerated  tests  are  given 
on  loading  diagrams  in  Figures  8,  9,  10  and  in  Table  2.  The 
treatment  of  the  test  data  is  shown  in  Table  3.  The  testing  lives 
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Amplitude  stress,  psi 


TABLE  2. 

Accelerated  test 

results. 

Specimen 

Max.  load 

Number  of 

Number  of 

Number  of 

number 

increment, 

cycles  at 

steps  until 

cycles  until 

Lb 

one  load 
level 

failure 

failure 

1 

200 

15000 

4 

56000 

2 

200 

15000 

5 

74200 

3 

200 

15000 

4 

50500 

4 

200 

15000 

3 

53900 

TABLE  3 .  Accelerated 

test  result  treatment. 

Amplitude 
stress.  Dsi 

Curve 

A 

Curve 

B 

Curve 

C 

Ni, 

cycles 

n  /Ni 

Ni, 

cycles 

n  /Ni 

Ni, 

cycles 

n  /Ni 

1280 

660000 

0.023 

550000 

0.027 

280000 

0.054 

1920 

480000 

0.031 

320000 

0.047 

240000 

0.054 

2560 

250000 

0.060 

135000 

0.111 

105000 

0.143 

3200 

110000 

0.100 

85000 

0.129 

68000 

0.162 

0.214 

0.314 

0.422 
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ni  are  taken  from  accelerated  test  program  which  is  given,  for 
example,  in  Figure  9,  and  expected  lives  Ni  for  the  curves  A,  B, 
and  C  (90%,  5%,  and  95%  probability  of  survival)  are  taken  from  the 
Figure  8.  The  repeatability  of  the  results  is  quite  reasonable 
and,  therefore,  the  sum  of  relative  lives  which  is  determined 
experimentally  can  probably  be  recommended  as  a  basic  parameter  to 
confirm  whether  the  specimens  tested  during  the  control  procedure 
belong  to  the  entire  population.  The  time  of  accelerated  tests  was 
never  more  than  6  hours.  It  is  even  probably  possible  to  decrease 
the  testing  time  trying  different  combinations  of  the  load 
increment  and  number  of  testing  cycles  at  each  load  level,  but  it 
was  beyond  the  scope  of  this  initiation  research  project. 

The  failure  damage  and  failure  mechanism  during  long-term  and 
accelerated  fatigue  tests  completed  in  1991  year  were  similar  to 
those  which  were  described  in  [1]  and  [2].  The  cracks  always 
started  at  the  bottom  tensile  zones  of  the  specimens.  In  all 
specimens  with  stress  concentrators  the  cracks  propagated  from  the 
hole  edges  toward  the  specimens  sides.  Usually  a  minute  crack  spot 
preceded  the  crack  formation  and  propagated  ahead  of  the  crack  tip 
during  the  entire  damage  process. 
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ADDITIONAL  FATIGUE  TESTS 


The  investigation  of  the  fatigue  lives  of  the  specimens  with 
different  thicknesses  and  the  investigation  of  the  testing 
frequency  influence  on  a  final  fatigue  life  were  conducted  in 
addition  to  the  major  goal  of  the  project  (development  of  the 
accelerated  fatigue  test  procedure) . 

Comparison  of  the  fatigue  lives  for  the  0.5M  and  o.25M 
thickness  specimens  tested  under  the  same  stress  is  given  in 
Figure  12,  The  test  result  show  that  the  life  of  the  thinner 
material  is  less.  It  was  observed  that  the  cracks  propagated  with 
almost  equal  speed  across  the  specimens  and  into  their  bodies. 
Therefore,  taking  into  account  that  the  width  of  the  specimen  from 
the  hole  edge  to  the  side  is  0.5M ,  it  is  understandable  that  the 
test  results  are  logical.  The  results  could  be  important  in  the 
selection  of  the  appropriate  canopy  material. 

Table  4  reflects  the  results  of  fatigue  tests  conducted  at 
different  frequencies.  The  results  show  that  the  properties  of  the 
material  tested  are  obviously  time  dependent.  The  number  of  cycles 
until  complete  separation  strongly  depends  on  frequency,  bit 
specimen  life  in  hours  was  almost  the  same  for  the  specimens  tested 
at  the  same  load  level  and  different  frequencies.  It  provides  an 
interesting  observation  about  optimal  usage  and  efficiency  of  the 
polycarbonate  parts. 
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AMPLITUDE  stress  amplitude  stress 

2880  PSI  4200  PSI 

Figure  12.  Comparison  of  fatigue  lives  of  the  specimens  with 
different  thickness.  Variable  bending.  Specimen 
with  stress  concentration.  Frequency  5  Hz.  rive 
specimens  tested  in  each  groc: . 


TABLE  4 

Fatigue  lives  of  specimens  tested  at  different  frequencies. 
Specimens  with  stress  concentration.  Amplitude  load  is  240  Lb. 


Specimen 

number 

Frequency, 

Hz 

Number  of  cycles 
until  failure 

Time  until 
failure,  h 

Average 
failure 
time,  h 

1 

8 

119300 

4 . 14 

4.69 

2 

8 

151000 

5.24 

3 

5 

92000 

5.11 

4 

5 

99000 

5.5 

5 

5 

115000 

6.39 

5.31 

6 

5 

100000 

5.57 

7 

5 

72000 

4.01 

8 

2 

40900 

5.68 

9 

2 

33700 

4.68 

4.70 

10 

2 

61300 

3.74 
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CONCLUSIONS  AND  RECOMMENDATIONS 

The  project  considered  in  this  report  is  a  continuation  of  the 
study  started  by  the  principal  investigator  at  Flight  Dynamics 
Laboratory,  Wright-Patterson  Air  Force  Base  in  1989  and  continued 
in  1990  and  1991  years.  Therefore,  the  conclusions  given  below  are 
overall  conclusions  based  on  the  results  given  in  this  report  and 
in  [1]  and  [2]. 

1.  The  0.5"  thickness  polycarbonate  specimens  have  significant 
life  until  complete  separation  after  appearance  of  the  minute 
cracks . 

2.  The  massive  minute  cracks  usually  appear  only  on  the  specimens 
tested  at  high  load  levels.  Therefore,  it  is  possible  to  make  a 
certain  judgment  about  load  history  by  analyzing  the  specimens 
after  failure. 

3.  The  influence  of  a  stress  concentration  on  the  fatigue  life  of 
the  polycarbonate  specimens  tested  is  very  strong.  The  specimens 
without  stress  concentration  have  much  longer  fatigue  life.  From 
this  point  of  view,  fraroeless  canopies  without  holes  for  fasteners 
can  be  very  promising.  However,  it  is  important  to  keep  in  mind 
that  the  scatter  of  the  test  results  for  the  solid  specimens  is 
greater  than  that  for  the  specimens  with  holes. 

4.  The  results  of  the  accelerated  tests  run  have  good 
repeatability  and,  a  procedure  used  can  be  recommended  to  control 
the  stability  of  manufacturing  process.  The  procedure  can  be  also 
useful,  for  example,  for  preliminary  comparison  of  the  material  for 
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frameless  canopy  and  conventional  polycarbonate  sheet  which  is  used 
today . 

5.  The  search  of  the  optimal  parameters  of  the  accelerated  test 
can  be  continued.  The  author  believes  that  the  testing  time  can  be 
decreased. 

6.  The  fatigue  tests  run  proved  that  the  fatigue  strength  of  the 
polycarbonate  sheets  varies  significantly  from  sheet  to  sheet  and 
probably  from  manufacturer  to  manufacturer.  Therefore,  the  quick 
control  of  fatigue  characteristics  can  be  recommended  as  a 
compulsory  procedure. 

7.  The  polycarbonate  sheet  thickness  (at  least  in  a  range  from 
0.25"  to  0.5")  has  very  strong  influence  on  the  fatigue  life  of 
this  material. 

8.  The  properties  of  the  specimens  are  strongly  time  dependent. 
The  final  specimen  life  in  hours  does  not  depend  on  the  testing 
frequency. 

9.  The  investigation  of  fatigue  resistance  of  the  canopy  materials 
in  different  environmental  conditions  is  strongly  recommended. 
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ABSTRACT 


Intcrply  shear  fatigue  behavior  of  nylon  cord-reinforced  rubber  matrix  composites  has  been 
investigated  to  assess  the  mechanisms  of  local  damage  accumulation  and  delamination 
failure  of  bias  aircraft  tire  carcass.  In  the  case  of  flat  coupon  specimens  of  finite  width, 
interply  shear  strain  was  found  to  induce  localized  failure  in  the  forms  of  cord-matrix 
debonding  and  matrix  cracking  which  were  eventually  developed  into  the  delamination. 
One  unique  feature  of  aircraft  tire  carcass  composites  was  a  relatively  long  period  of  time 
sustained  after  the  onset  of  delamination.  Fatigue  lifetime  profile  of  aircraft  tire  carcass 
composites  was  found  to  be  dependent  on  the  level  of  frequency  (speed).  A  more 
dominant  role  of  creep  effect  due  to  the  viscoelastic  properties  of  constituent  materials  is 
suspected  for  reduced  fatigue  lifetime  at  a  lower  frequency.  The  process  of  damage 
accumulation  in  aircraft  tire  carcass  composites  was  accompanied  by  a  continuous  increase 
of  temperature  and  cyclic  strain.  In  determining  the  rate  of  cyclic  strain  increase  for  the 
composites,  the  effect  of  creep  caused  by  viscoelastic  properties  of  constituent  materials 
appears  to  be  as  important  as  the  effect  of  material  flow  and  subsequent  crack  growth  from 
the  debonded  cord  ends.  The  fatigue  life  of  aircraft  tire  carcass  composites  was  inversely 
proportional  to  the  rate  of  cyclic  strain  increase.  However,  the  exte.it  of  dynamic  creep  at 
gross  failure,  which  is  defined  as  the  increase  of  cyclic  strain  beyond  initial  elastic 
deformation,  was  roughly  independent  of  stress  amplitude.  The  use  of  internally- 
pressurized  tube  specimens  of  angle-plied  composites  was  explored  to  simulate  the 
delamination  of  aircraft  tire  carcass  in  shoulder  region  with  no  cut  ends  of  cords. 
However,  under  the  externally  applied  axial  tension,  the  tubular  specimens  exhibited 
extensive  interply  shear  deformation  superseded  by  straining  of  cord  reinforcement. 
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I.  INTRODUCTION 


Our  research  effort  ultimately  aims  at  the  laboratory  prediction  of  the  durability  of  aircraft 
tires  based  on  the  study  of  deformation  and  fracture  behavior  of  cord-rubber  composite 
elements.  The  effort  has  been  initiated  for  the  Landing  Gear  Systems  Group  (LGSG)  of 
the  Flight  Dynamics  Directorate  at  Wright  Research  Laboratory  to  develop  the  capabilities 
of  predicting  long-term  performance  and  life  expectancy  of  aircraft  tires  in  a  cost-effective 
manner.  Compared  with  other  types  of  pneumatic  tires,  aircraft  tires  are  subjected  to 
unusual  combinations  of  speed  and  load  (1,2).  For  example,  the  baseline  tire  used  by 
LGSG  in  their  current  durability  study  is  a  49X17/26PR  bias  construction  and  is  rated  at  a 
speed  of  358  km/hr  (224  mph)  and  a  load  of  173  500  N  (39  000  lbf).  Extreme 
combinations  of  speed  and  load  result  in  high  cyclic  frequencies,  large  deflections  and 
significan  heat  generation  due  to  hysteredc  loss  in  aircraft  tires. 

As  confirmed  by  field  experience  (3),  these  conditions  cause  damage  in  critical  sub-regions 
of  tires  such  as  shoulder,  bead,  lower  sidewall  or  tread  which  eventually  develops  into 
catastrophic  failures  of  whole  tires.  The  failure  of  the  carcass  ply  in  the  shoulder  area, 
often  called  a  ply  separation,  occurs  in  the  form  of  delamination  which  involves  crack 
propagation  mainly  in  the  rubber  matrix  and  cord-matrix  interface.  So-called  bead  area 
cracking  and  lower  sidewall  break  also  involve  crack  propagation  in  the  rubber  matrix  of 
carcass  ply,  but  there  are  strong  indications  of  fiber  fracture  as  well.  The  processes  of 
damage  accumulation  and  structural  failure  of  tire  carcass  in  the  shoulder,  bead  and  lower 
sidewall  areas  are  attributed  to  a  combination  of  mechanical  overloading  and  heat  generation 
along  with  the  resultant  deterioration  of  constituent  materials.  On  the  other  hand,  tread 
groove  cracking  and  subsequent  tread-carcass  separation  of  aircraft  tires  are  attributed  to  the 
presence  of  strong  centrifugal  force  resulting  from  unusually  high  speed  particularly  when 
so-called  standing  wave  is  present  (1-5). 

As  discussed  so  far,  basic  understanding  of  tire  failure  mechanisms  has  been  established  at 
least  in  qualitative  sense.  In  the  case  of  tread  failure,  its  major  cause,  the  occurrence  of 
standing  waves  can  be  avoided  by  a  proper  design  of  tires  (4).  However,  as  far  as  the 
carcass  ply  is  concerned,  the  tasks  of  identifying  critical  operating  conditions  (speed,  load, 
underinflation  and  tire  deflection)  responsible  for  the  failure  and  predicting  the  useful  life 
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expectancy  of  an  aircraft  tire  are  still  difficult  at  best.  Accelerated  testing  based  on 
dynamometers  provides  a  valuable  means  of  evaluating  the  structural  durability  and  life 
expectancy  of  aircraft  tires.  But  it  is  too  costly  to  rely  solely  on  the  dynamometer  test 
results  in  specifying  the  number  of  takeoff  and  landing  cycles  a  tire  may  endure  before 
replacement  (/).  Besides  the  results  of  these  dynamometer  tests  reflect  merely  the 
sensitivity  of  each  specific  tire  design  and  construction  to  a  specific  combination  of  test 
conditions,  unless  underlying  mechanisms  of  material  property  degradation  and  damage 
accumulation  are  identified. 

Past  approaches  to  characterize  the  property  degradation  profile  of  the  aircraft  tires  have 
utilized  coupon  specimens  cut  from  the  carcass  region  (3,6,7).  This  type  of  durability 
analysis  provides  useful  information  on  failures  caused  by  the  deterioration  of  material 
properties  such  as  ply-to-ply  adhesion  strength.  Some  estimates  assume  that  tire  failure 
occurs  at  a  50  percent  reduction  of  carcass  ply  adhesion  strength.  However,  these  works 
did  not  show  how  the  deterioration  of  material  properties  leads  to  damage  accumulation  and 
eventual  structural  failure  of  tire  carcass.  Our  research  program  plans  to  clarify  these  issues 
by  investigating  the  mechanisms  of  local  damage  accumulation,  material  property 
degradation  and  structural  failure  for  the  cord-rubber  composites  which  represent  the 
aircraft  tire  carcass.  In  the  study,  the  mechanisms  of  damage  accumulation,  material 
property  degradation  and  structural  failure  of  cord-rubber  composites  will  be  examined 
under  the  laboratory  loading  conditions  which  in  tum  simulate  individual  elements  of 
complex  loading  for  the  actual  tire  carcass. 

The  above-described  research  program  has  begun  in  the  1990  Summer  Faculty  Research 
Project  (SFRP)  with  an  emphasis  on  failure  modes  in  the  shoulder  area  (8).  In  the  1990 
SFRP,  an  experimental  method  was  developed  to  simulate  carcass  delamination  process  in 
the  flat  coupon  specimens  of  angle-plied  cord-rubber  composites.  For  a  better 
determination  of  the  failure  modes,  SFRP  utilized  mainly  the  model  composite  coupons 
reinforced  by  steel  wire  cables  of  large  diameter.  Under  uniaxial  cyclic  loading  which 
represents  circumferential  tension  in  the  footprint  region  of  bias  tires,  angle-plied  cord- 
rubber  composite  specimens  were  found  to  exhibit  extensive  interply  shear  deformation 
which  eventually  leads  to  delamination  type  failures.  Although  the  use  of  the  coupon 
specimens  with  free  edges  allows  a  direct  observation  of  the  mechanisms  of  damage 
accumulation  and  delamination,  the  cut  ends  of  cords  exposed  at  the  specimen  edge  also  act 
as  built-in  defects  in  the  process  of  damage  initiation  (9,10).  This  fact  is  expected  to 
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complicate  our  future  interpretation  of  composite  test  results  in  correlation  with  tire 
durability  data. 

As  a  follow-up  study  of  SFRP,  a  current  Research  Initiation  Program  has  expanded  the 
scope  of  work  to  include  detailed  and  more  realistic  assessment  of  interply  shear  fatigue 
behavior  of  the  nylon  ccxti-reinforced  composite  coupons  which  represent  the  actual  aircraft 
tire  carcass.  The  study  has  allowed  initial  screening  of  the  operating  parameters  (e.g. 
frequency,  stress  amplitude,  minimum  cyclic  stress,  strain  range,  dynamic  creep  rate, 
temperature,  heat-up  rate,  etc)  which  can  influence  the  mechanisms  of  fatigue  damage 
accumulation  and  delamination  failure  of  the  composites.  The  mechanism  of  material 
property  degradation  has  not  been  fully  investigated  because  of  time  limitations.  Finally,  in 
addition  to  the  flat  coupons,  the  use  of  internally-pressurized  tube  specimens  of  angle-plied 
composites  was  explored  to  simulate  the  condition  of  aircraft  tire  carcass  which  has  no  cut 
ends  of  cads  in  shoulder  region  but  delaminates  under  biaxial  tension. 


II.  OBJECTIVES  OF  THE  RESEARCH  EFFORT 

(a)  To  assess  interply  shear  deformation  and  fracture  behavior  of  aircraft  tire  carcass 
composites  under  fatigue  loading,  (b)  to  identify  the  operating  parameters  which  can 
influence  the  mechanisms  of  fatigue  damage  accumulation,  material  property  degradation 
and  delamination  failure  of  angle-plied  cord-rubber  composites,  and  (c)  to  develop  a  new 
biaxial  fatigue  test  method  involving  internally-pressurized  tube  specimens  without  the  cut 
ends  of  cads. 


IB.  EXPERIMENTS 

As  in  the  case  of  the  1990  SFRP,  the  current  study  used  mainly  flat  coupon  specimens 
press-molded  from  calendered  ply  stocks  (Table  1).  The  specimen  consists  of  a  cord- 
rubber  composite  representing  typical  bias  aircraft  tire  carcass  using  a  +/-  38  deg 
reinforcement  angle,  1260/2  nylon  cord  and  a  proprietary  rubber  compound  based  matrix. 
The  aircraft  tire  carcass  composite  was  found  to  have  a  lower  modulus  and  a  lower  strength 
than  the  model  composite  which  will  be  often  quoted  in  this  report  as  a  reference  system 
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( 8 ).  The  model  composite  uses  a  +/-  19  deg  cord  angle  and  steel  wire  cables  of  1.62mm 
diameter  circular  cross-section,  because  the  large  cord  diameter  and  the  cord  angle  chosen 
maximize  the  interply  shear  strain,  a  major  contributing  factor  in  composite  failure.  The 
composite  coupon  specimens  had  free  edges  with  the  cut  ends  of  reinforcing  cords 
exposed.  To  avoid  tension-bending  coupling,  the  coupon  specimens  were  constructed  with 
a  symmetric  ply  lay-up.  The  end  tabs  were  added  to  the  specimens  to  prevent  failure  near 
the  gripping  region. 

In  simulation  of  the  circumferential  tension  imposed  to  an  aircraft  tire  in  the  footprint 
region,  composite  coupon  specimens  were  subjected  to  both  static  and  cyclic  uniaxial 
tensile  loading.  Cyclic  testing  was  performed  under  a  broad  range  of  stress  amplitude  and 
three  different  levels  of  frequency  (1,5  and  10  Hz)  to  generate  a  series  of  S-N  (stress  range 
vs  number  of  cycles  to  failure)  curves.  Minimum  cyclic  stress  was  kept  constant  at  0.2  ksi. 
Throughout  the  testing,  heat  generation  due  to  hysteretic  loss  was  closely  monitored  relying 
on  the  surface  temperature  of  the  specimens.  In  both  static  and  cyclic  testing,  local  strain 
was  estimated  by  measuring  the  displacement  of  line  markings  drawn  on  the  specimen 
edge. 

Although  a  systematic  variation  was  not  attempted,  the  following  four  different  length-to- 
width  ratios  were  employed  in  testing  of  flat  coupons  of  aircraft  tire  carcass  composites: 
5.3  (0.75  inch  wide  /  4  inch  long  or  1  inch  wide  /  5.25  inch  long);  4  (1  inch  wide  /  4  inch 
long);  3.3  (0.75  inch  wide  /  2.5  inch  long);  0  J  (2.5  inch  wide  /  1.25  inch  long).  Among 
them,  the  specimens  of  a  length-to- width  ratio  of  0.5  were  tested  to  observe  failure  mode  of 
the  composites  when  no  single  cord  reinforcement  remains  ungripped. 

In  addition  to  the  flat  coupons,  tube  specimens  with  no  cut  ends  of  cords  were  prepared 
from  an  identical  cord-rubber  composite  ply  stock  representing  bias  aircraft  tire  carcass  (+/- 
38  deg  cord  angle,  1260/2  nylon  cord  and  a  proprietary  rubber  compound  based  matrix). 
In  this  case,  a  symmetric  ply  lay-up  was  not  used.  Some  tube  specimens  were  prepared 
with  a  pre-delaminated  area  by  inserting  Teflon  sheet  between  the  plies.  The  composite 
specimens  with  or  without  pre-delamination  were  internally  pressurized  and  tested  under 
static  tension  along  the  longitudinal  axis. 
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IV.  RESULTS  AND  DISCUSSIONS 


IV.  1 .  Failure  Modes  of  Aircraft  Tire  Carcass  Composites 

As  shown  in  our  previous  study  (5)  as  well  as  numerous  studies  in  the  past  (9-13),  the 
angle-plied  cord-rubber  composites  exhibit  a  large  interply  shear  strain  under  static  or  cyclic 
tension.  Interply  shear  strain  develops  in  angle-ply  laminates,  because  the  constituent  plies 
exhibit  in-plane  shear  deformation  of  opposite  direction  but  the  action  is  prevented  by 
mutual  constraint  due  to  interply  bonding.  Compared  with  the  case  of  fiber-reinforced 
plastic  composites  (14),  cord-rubber  composites  exhibit  unusually  high  level  of  interply 
shear  strain  which  results  from  the  load-induced  change  of  reinforcement  angle  allowed  by 
extreme  compliance  of  rubber  matrix.  The  previously  reported  data  indicate  that,  at  an  axial 
strain  of  say  10  percent  under  static  tension,  an  interply  shear  strain  of  70  percent  develops 
in  the  nylon  cord-reinforced  aircraft  tire  carcass  composites  and  120  percent  in  the  model 
composites  (Figures  5  and  6  of  Ref.  8). 

Such  a  large  interply  shear  strain  was  found  to  readily  induce  localized  failure  in  the  form 
of  cord-matrix  debonding  under  cyclic  as  well  as  static  tension.  Debonding  was  started 
around  the  cut  ends  of  reinforcing  cords  at  the  edge  of  the  finite  width  coupons,  which  is 
justified  since  the  maximum  interply  shear  strain  occurs  at  the  edge  of  the  specimen.  A 
critical  level  of  interply  shear  strain  for  cord-matrix  debonding  corresponded  to  the  axial 
stress  of  approximately  20  to  30%  of  ultimate  strength  of  the  composites.  Under  static 
tension,  cord-matrix  debonding  developed  into  matrix  cracking  as  the  strain  increases.  At 
higher  strain,  the  axial  stress-strain  curves  exhibited  strain  hardening  type  response  for 
both  aircraft  tire  carcass  composites  (Figure  1)  and  model  composites  (Figure  8  of  Ref.  8). 
Debonding  and  matrix  cracking  were  widened  with  increasing  strain  and  eventually 
developed  into  the  delamination  leading  to  gross  fracture  of  the  composites.  The  same 
sequence  of  failure  modes  was  observed  under  cyclic  loading  as  long  as  minimum  stress 
remains  in  tensile  regime. 

Our  unpublished  results  indicate  that  the  described  sequence  of  failure  modes  is  reversed  in 
the  model  composites  when  minimum  cyclic  stress  is  near  zero.  The  delamination  process 
appears  to  precede  cord-matrix  debonding  in  this  situation.  One  interesting  observation  on 
the  aircraft  tire  carcass  composites  was  that  the  development  into  the  delamination  occurs 
when  a  length-to-width  (LJW)  ratio  of  the  specimen  is  greater  than  1.28  (Cot  38  deg)  and 
therefore  some  of  the  reinforcing  cords  remain  ungripped.  In  the  case  of  aircraft  tire 
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carcass  composite  specimen  of  a  L/W  ratio  of  0.5,  the  absence  of  any  ungripped 
reinforcing  cords  led  to  a  final  failure  mode  of  fiber  fracture  instead  of  the  delamination 
under  static  tension. 


IV.  2.  Fatigue  Behavior  of  Aircraft  lire  Carcass  Composites  and  Frequency  Effects 

Since  the  minimum  cyclic  stress  is  kept  at  0.2  ksi  safely  in  tensile  regime,  all  tested 
specimens  (L/W  =  3.3  to  5.3)  of  nylon  cord-rcinforced  aircraft  tire  carcass  composites 
exhibited  a  normal  sequence  of  failure  modes  i.e.  cord-matrix  debonding  developing  into 
the  delamination  as  in  the  model  composites.  One  unique  feature  of  aircraft  tire  carcass 
composites  was  a  relatively  long  period  of  time  sustained  after  the  onset  of  partial 
delamination.  This  tendency  was  more  pronounced  at  lower  stress  amplitudes.  As  a 
result,  the  S-N  curve  for  aircraft  tire  carcass  composites  is  linear  on  a  log-log  scale 
allowing  the  determination  of  a  power  law  factor  (Figures  2  and  3),  while  the  S-N  curve 
for  the  model  composites  is  linear  on  a  sem-log  scale  (Figure  9  of  Ref.  5).  As  in  the  case 
of  the  model  composites,  the  critical  lead  for  the  onset  of  cord-matrix  debonding  seems  to 
constitute  a  threshold  level  for  semi-infinite  fatigue  life,  i.e.  endurance  limit,  of  the  aircraft 
tire  carcass  composites.  However,  the  presence  of  the  endurance  limit  is  less  clear-cut, 
because  prolonged  fatigue  life  after  the  onset  of  delamination  requires  testing  beyond  107 
cycles. 

The  effect  of  frequency  on  the  fatigue  behavior  of  aircraft  tire  carcass  composites  has  been 
studied  using  the  specimens  of  three  (eventually  four)  different  L/W  ratios  as  shown 
below. 
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Although  testing  has  not  been  completed,  fatigue  lifetime  profile  of  aircraft  tire  carcass 
composites  seems  to  be  dependent  on  the  level  of  frequency  (speed).  Compared  with  the 
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case  of  5  Hz,  consistently  shorter  fatigue  lifetime  is  observed  under  the  frequency  of  1  Hz 
at  a  given  stress  range  or  maximum  stress  (Figures  3  and  4).  To  a  much  lesser  extent, 
shorter  fatigue  lifetime  is  also  observed  under  the  frequency  of  10  Hz  compared  with  the 
case  of  5  Hz  (Figures  5  and  6).  If  our  future  test  results  confirm  a  further  reduction  of 
fatigue  lifetime  at  a  higher  frequency  of  20  Hz,  the  phenomenon  could  be  attributed  to  a 
greater  rate  of  heat  generation  due  to  hysteretic  loss.  In  contrast,  a  more  dominant  role  of 
creep  effect  due  to  the  viscoelastic  properties  of  constituent  materials,  which  will  be 
discussed  in  the  next  section,  is  suspected  for  distinctly  shorter  fatigue  lifetime  at  a  lower 
frequency  of  1  Hz.  Static  creep  loading  experiments  are  planned  to  assess  the  contribution 
of  viscoelastic  properties  of  constituent  materials  to  the  damage  accumulation  process  of 
composites. 


IV.  3.  Fatigue  Damage  Accumulation  Process  of  Aircraft  Tire  Carcass  Composites 

As  observed  in  the  model  composites  (Figure  10  of  Ref.  8),  the  process  of  damage 
accumulation  in  aircraft  tire  carcass  composites  is  accompanied  by  a  continuous  increase  of 
temperature  and  cyclic  strain.  The  change  of  cyclic  strain  (either  maximum  strain  or  strain 
range)  undergoes  three  stages  after  an  initial  stepwise  increase.  At  first,  the  strain  increases 
rapidly  but  at  a  progressively  lower  rate.  When  plotted  against  the  logarithm  of  the  time, 
this  first  region  often  forms  a  linear  part  of  a  sigmoidal  shape  curve  (Figures  7,  8  and  9). 
In  the  second  region,  the  cyclic  strain  increases  at  a  constant  rate.  The  region  forms  an 
increasingly  steeper  (upward  curvature)  portion  of  sigmoidal  shape  curve  when  plotted 
against  the  logarithm  of  the  time.  Throughout  the  first  and  second  regions  which  consume 
a  major  part  of  fatigue  life,  the  damage  accumulation  occurs  in  the  forms  of  cord-matrix 
debonding  and  matrix  cracking.  Towards  the  end  of  the  second  region,  partial 
delamination  appears  at  the  specimen  edge.  In  the  final  third  region,  the  cyclic  strain 
increases  rapidly  at  a  progressively  higher  rate  eventually  leading  to  a  catastrophic  failure  of 
gross  delamination. 

The  first  region  of  the  cyclic  strain  vs  time  curve  was  found  to  be  considerably  longer  in  the 
aircraft  tire  carcass  composites  compared  with  the  case  of  model  composites.  In  the  model 
composites,  the  first  region  takes  less  than  20%  of  the  fatigue  lifetime.  On  the  whole,  the 
curves  of  cyclic  strain  vs  time  for  both  composites  strikingly  resemble  a  static  creep  curve 
of  four-element  spring-dashpot  model  for  typical  polymeric  materials  (Figures  10  and  11) 
(15).  The  model  postulates  that  the  steepest  part  of  the  sigmoidal  shape  curve  (i.e.  the  first 
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region)  occurs  within  a  period  of  time  around  the  retardation  time.  At  periods  much  longer 
than  the  retardation  time,  the  only  response  to  stress  is  believed  to  be  due  to  the  viscous 
flow.  Therefore  the  prolonged  presence  of  the  first  region  suggests  that,  in  determining  the 
rate  of  cyclic  strain  increase  for  nylon  cord-reinforced  aircraft  tire  carcass  composites,  the 
effect  of  creep  caused  by  viscoelastic  properties  of  constituent  materials  is  as  important  as 
the  effect  of  material  flow  and  subsequent  crack  growth  from  the  debonded  cord  ends. 

As  shown  in  Figures  7-9,  the  fatigue  life  of  aircraft  tire  carcass  composites  was  found  to  be 
inversely  proportional  to  the  rate  of  cyclic  strain  increase  (referred  as  dynamic  creep  rate  in 
our  previous  study).  The  fatigue  life  of  composites  was  also  found  to  be  inversely 
proportional  to  the  cyclic  strain  at  gross  failure  (Figures  12,  13  and  14),  although  some 
difficulty  is  encountered  in  establishing  a  clear  trend  with  the  maximum  cyclic  strain  rather 
than  the  strain  range.  One  interesting  fact  was  that  the  extent  of  dynamic  creep  at  gross 
failure,  which  is  defined  as  the  increase  of  cyclic  strain  beyond  initial  elastic  deformation,  is 
roughly  independent  of  stress  range.  Under  the  frequency  of  1  Hz,  most  specimens  were 
found  to  fail  when  the  maximum  cyclic  strain  reaches  approximately  30%  above  the  level 
imposed  by  initial  stepwise  increase  in  strain  (Figure  15).  Although  it  may  be  premature  to 
propose  a  formal  criterion  without  additional  data  at  different  levels  of  frequency,  the 
observation  strongly  suggests  the  existence  of  a  critical  level  of  dynamic  creep  for  gross 
failures  of  the  angle-plied  cord-rubber  composites  and  possibly  for  the  tire  carcass. 

Although  more  data  are  needed  before  drawing  a  final  conclusion,  it  appears  that  the 
correlation  between  the  fatigue  life  and  the  rate  of  cyclic  strain  increase  is  independent  of 
the  frequency  and  therefore  provides  a  universal  means  of  life  prediction  (Figures  12,  13 
and  14).  However,  the  correlation  between  the  fatigue  life  and  the  rate  of  temperature 
increase  seems  to  be  dependent  upon  the  frequency  used  (Figures  16,  17  and  18).  These 
facts  suggest  that,  unless  very  high  frequency  (above  10  Hz)  loading  is  applied,  the  origins 
of  dynamic  creep  and  damage  accumulation  of  composites  may  be  predominantly 
mechanical  fatigue  rather  than  thermal  fatigue.  In  our  future  work,  a  more  accurate 
determination  of  temperature  distribution  throughout  the  specimen  will  be  attempted  by 
embedded  thermocouple  technique.  Aside  from  these  questions  on  the  roles  in  fracture 
mechanisms,  the  results  demonstrate  the  measurement  of  local  strain  change  and  heat 
generation  as  a  viable  experimental  technique  for  real-time  monitoring  of  the  damage 
accumulation  process. 
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IV.  4.  Development  of  Test  Methodologies  for  Composite  Tube 

In  the  current  study  of  aircraft  tire  carcass  composites,  the  use  of  the  coupon  specimens 
with  free  edges  allows  a  direct  observation  of  the  processes  of  damage  accumulation  and 
delamination  failure.  At  the  same  time,  monitoring  of  the  line  markings  drawn  on  the 
specimen  edge  provides  a  straightforward  means  of  determining  interply  shear  strain  under 
circumferential  tension.  However,  the  cut  ends  of  cords  exposed  at  the  specimen  edge  act 
as  built-in  defects  in  the  process  of  damage  initiation.  Besides  the  variation  of  meridional 
tension  due  to  the  inflation  pressure  cannot  be  simulated  in  the  case  of  flat  coupon 
specimens  under  uniaxial  loading.  Considering  the  above-described  limitations,  a  new 
specimen  configuration  of  internally-pressurized  tube  of  cord-rubber  composite  was 
designed  with  an  internal  diameter  of  0.5  inch  and  a  gage  length  of  4  inches. 

Under  the  externally  applied  axial  tension,  the  air-inflated  tube  specimens  exhibited 
extensive  interply  shear  deformation  and  strain  hardening  behavior  as  in  the  case  of  flat 
coupons  (Figures  19  and  20).  One  interesting  observation  was  that,  even  with  the  presence 
of  pre-delaminated  area,  straining  of  reinforcing  cords  eventually  becomes  a  dominant 
mode  of  deformation  leading  to  the  slippage  of  the  specimen  from  the  grips.  Since  any 
ungripped  reinforcing  cords  do  not  exist  in  the  tube  specimen  subjected  to  axial  tension 
(regardless  of  gage  length  variation),  the  prevention  of  grip  slippage  if  possible  is  expected 
to  lead  to  the  fracture  of  reinforcing  cords  rather  than  delamination.  As  discussed 
previously,  in  the  case  of  aircraft  tire  carcass  composite  coupon  of  a  L/W  ratio  of  0.5  which 
is  less  than  1.28  (Cot  38  deg),  the  absence  of  any  ungripped  reinforcing  cords  led  to  a  final 
failure  mode  of  fiber  fracture  instead  of  the  delamination  under  static  tension. 

The  initial  test  results  involving  the  inflated  tube  specimens  of  aircraft  tire  carcass 
composites  clearly  indicate  that  the  occurrence  of  delamination  cannot  be  simulated  under 
static  tension  applied  in  the  axial  direction.  Therefore  our  future  study  will  be  concentrated 
in  the  use  of  cyclic  tension  and/or  out-of  plane  bending  mode  to  induce  localized 
delamination  and  to  avoid  straining  of  the  reinforcing  cords  as  a  dominant  mode  of 
deformation.  The  use  of  out-of-plane  bending  mode  of  deformation  for  the  inflated  tube 
specimen  will  become  essentially  an  upgrading  of  so-called  Mallory  tube  testing  (16)  which 
has  been  used  mainly  for  the  purpose  of  product  quality  control.  The  assessment  of 
deformation  and  fracture  behavior  of  carcass  composites  in  biaxial  stress  will  certainly 
complement  our  continuing  research  work  of  observing  damage  accumulation  process  of 
composites  under  simpler  uniaxial  tension. 
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V.  CONCLUDING  REMARKS 


The  described  research  effort  ultimately  aims  at  the  laboratory  prediction  of  the  durability  of 
aircraft  tire  based  on  the  study  of  deformation  and  fracture  behavior  of  cord-rubber 
composite  elements.  As  a  major  step  toward  the  goal,  our  research  program  plans  to 
investigate  the  mechanisms  of  local  damage  accumulation,  material  property  degradation 
and  structural  failure  of  the  angle-plied  cord-rubber  composites  which  represent  the  aircraft 
tire  carcass.  These  mechanisms  will  be  studied  under  the  laboratory  loading  conditions 
which  in  turn  simulate  individual  elements  of  complex  loading  for  the  tire  carcass.  The 
program  has  begun  in  the  1990  Summer  Faculty  Research  Project  (SFRP)  with  an 
emphasis  on  carcass  delamination  process  in  the  shoulder  area  using  the  model  composites. 
As  a  follow-up  study  of  SFRP,  the  current  Research  Initiation  Program  has  expanded  the 
scope  of  work  to  include  detailed  assessment  of  interply  shear  fatigue  behavior  of  nylon 
cord-reinforced  composites. 

In  angle-plied  cord-rubber  composites,  unusually  high  level  of  interply  shear  strain  was 
found  to  readily  induce  localized  failure  in  the  form  of  cord-matrix  debonding  under  cyclic 
as  well  as  static  tension.  As  the  strain  increases,  cord-matrix  debonding  developed  into 
matrix  cracking.  Debonding  and  matrix  cracking  were  further  widened  and  eventually 
developed  into  the  delamination  leading  to  gross  fracture  of  the  composites.  One  unique 
feature  of  aircraft  tire  carcass  composites  was  a  relatively  long  period  of  time  sustained  after 
the  onset  of  delamination.  As  in  the  case  of  the  model  composites,  the  critical  load  for  the 
onset  of  cord-matrix  debonding  seemed  to  constitute  the  endurance  limit  of  the  aircraft  tire 
carcass  composites.  Fatigue  lifetime  profile  of  aircraft  tire  carcass  composites  was 
dependent  on  the  level  of  frequency  (speed).  Compared  with  the  case  of  5  Hz,  consistently 
shorter  fatigue  lifetime  was  observed  under  the  frequency  of  1  Hz  at  a  given  stress  level.  A 
more  dominant  role  of  creep  effect  due  to  the  viscoelastic  properties  of  constituent  materials 
is  suspected  for  reduced  fatigue  lifetime  at  a  lower  frequency. 

The  process  of  damage  accumulation  in  aircraft  tire  carcass  composites  is  accompanied  by  a 
continuous  increase  of  temperature  and  cyclic  strain.  The  change  of  cyclic  strain  undergoes 
three  stages  after  an  initial  stepwise  increase.  At  first,  the  strain  increases  rapidly  but  at  a 
progressively  lower  rate.  In  the  second  region,  the  cyclic  strain  increases  at  a  constant  rate. 
Throughout  the  first  and  second  regions  which  consume  a  major  part  of  fatigue  life,  the 
damage  accumulation  occurs  in  the  forms  of  cord-matrix  debonding  and  matrix  cracking. 
Towards  the  end  of  the  second  region,  partial  delamination  appears.  The  first  region  of  the 


cyclic  strain  vs  time  curve  was  found  to  be  considerably  longer  in  the  aircraft  tire  carcass 
composites.  The  fact  suggests  that,  in  determining  the  rate  of  cyclic  strain  increase  for  the 
composites,  the  effect  of  creep  caused  by  viscoelastic  properties  of  constituent  materials  is 
as  important  as  the  effect  of  material  flow  and  subsequent  crack  growth  from  the  debonded 
cord  ends. 

The  fatigue  life  of  aircraft  tire  carcass  composites  was  found  to  be  inversely  proportional  to 
the  rate  of  cyclic  strain  increase.  One  interesting  fact  was  that  the  extent  of  dynamic  creep 
at  gross  failure ,  which  is  defined  as  the  increase  of  cyclic  strain  beyond  initial  elastic 
deformation,  is  roughly  independent  of  stress  range.  Although  it  may  be  premature  to 
propose  a  formal  criterion  without  additional  data  at  different  levels  of  frequency,  the 
observation  strongly  suggests  the  existence  of  a  critical  level  of  dynamic  creep  for  gross 
failures  of  angle-plied  cord-rubber  composites  and  possibly  for  the  tire  carcass.  The 
correlation  between  the  fatigue  life  and  the  rate  of  cyclic  strain  increase  is  independent  of 
the  frequency  and  therefore  provides  a  universal  means  of  life  prediction.  However,  the 
correlation  between  the  fatigue  life  and  the  rate  of  temperature  increase  seems  to  be  more 
dependent  upon  the  frequency  used. 

Finally,  in  addition  to  the  flat  coupons,  the  use  of  internally-pressurized  tube  specimens  of 
angle-plied  composites  was  explored  to  simulate  the  condition  of  aircraft  tire  carcass  which 
has  no  cut  ends  of  cords  in  shoulder  region  but  delaminates  under  biaxial  tension.  Under 
the  externally  applied  axial  tension,  the  tubular  specimens  exhibited  extensive  interply  shear 
deformation  and  strain  hardening  behavior  as  in  the  case  of  flat  coupons.  However, 
straining  of  reinforcing  cords  eventually  became  a  dominant  mode  of  deformation  since  any 
ungripped  reinforcing  cords  do  not  exist  in  the  tubular  specimen  subjected  to  axial  tension. 
Testing  of  flat  coupon  of  a  very  small  length-to- width  ratio  demonstrated  that  the  absence 
of  any  ungripped  reinforcing  cords  leads  to  a  Final  failure  mode  of  fiber  fracture  instead  of 
the  delamination. 
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VI.  RECOMMENDATIONS 


The  following  recommendations  can  be  made  for  our  continuing  study  on  the  deformation 
and  fracture  behavior  of  nylon  cord-reinforced  aircraft  tire  carcass  composites: 

(1)  Continue  fatigue  testing  until  statistical  confidence  in  the  fatigue  lives  is  obtained. 
Define  the  S-N  curves  for  the  onset  of  cord-matrix  debonding  and  ddamination  as  well  as 
gross  fracture. 

(2)  Continue  the  study  of  damage  accumulation  mechanisms  with  a  systematic  variation 
of  frequency,  temperature  and  the  ratio  of  minimum-to-maximum  stress.  Perform  a 
microscopic  examination  of  damage  accumulation  modes. 

(3)  Confirm  the  existence  of  a  critical  level  of  dynamic  creep  at  gross  failure  for  aircraft 
tire  carcass  composites.  Check  the  degree  of  reversibility  of  local  strain  encountered  during 
dynamic  creep. 

(4)  Assess  the  contribution  of  viscoelastic  properties  of  constituent  materials  to  the 
dynamic  creep  process  of  composites  by  performing  static  loading  experiments  at  various 
temperatures.  Determine  how  the  process  of  damage  accumulation  interacts  with  material 
property  changes. 

(5)  Define  the  exact  role  of  hysteretic  heating  in  determining  the  fatigue  lifetime  of 
composites  by  performing  isothermal  cyclic  testing.  Cyclic  testing  in  isothermal  condition 
will  resolve  the  issue  of  the  possible  interaction  between  hysteretic  heating  and  the 
progressive  increase  of  strain  due  to  damage  accumulation. 

(6)  Assess  the  failure  modes  of  internally-pressurized  composite  tube  under  cyclic 
tension  and/or  out-of-plane  bending  mode  to  induce  localized  delamination  and  to  avoid 
straining  of  the  reinforcing  cords  as  a  dominant  mode  of  deformation. 
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TABLE  1 

Specifications  of  Aircraft  Tire  Carcass  Composites 
(Cord:  1260/2  Nylon  cord) 

(Matrix:  Proprietary  rubber  compound) 


Cord  Angle 
Cord  Modulus 
Matrix  Modulus 
Cross-Sectional  Area  of  Cord 
Total  Width  of  Specimen 
Total  Thickness  of  Specimen 


-38,  +38,  +38,  -38  deg 
2.07  GPa  (300X103  psi) 
5.51  MPa  (800  psi) 

0.342  mm2  (5.3X1(H  inct»2) 
25.4  mm  (1  inch) 

6.35  mm  (0.25  inch) 
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Figure  1 

Stress-Strain  Curves  for  Aircraft  Tire  Carcass  Composites 
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Figure  2 

Stress  Range  vs  Fatigue  Lifetime  (S-N)  Curve  for  Aircraft  Tire  Carcass  Composites 
(1  Hz  Frequency,  L/W  =  3.3,  Semi-Log  Scale) 
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Figure  3 

Range  vs  Fatigue  Lifetime  (S-N)  Curve  for  Aircraft  Tire  Carcass  Composites 
(1  Hz  Frequency,  L/W  =  3.3,  Log-Log  Scale) 


Figyg-4 

Stress  Range  vs  Fatigue  Lifetime  (S-N)  Curve  for  Aircraft  Tire  Carcass  Composites 

(5  Hz  Frequency,  L/W  =  3.3) 
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Figured 

Stress  Range  vs  Fatigue  Lifetime  (S-N)  Curve  for  Aircraft  Tire  Carcass  Composites 

(5  Hz  Frequency,  L/W  =  5.3) 
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Figured 

Stress  Range  vs  Fatigue  Lifetime  (S-N)  Curve  for  Aircraft  Tire  Carcass  Composites 

(10  Hz  Frequency,  L/W  =  5.3) 
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Maximum  Deflection  (inch) 
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Figure  7 

Maximum  Cyclic  Deflection  vs  Number  of  Cycles  (/  Hz  Frequency) 
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Figure  8 

Maximum  Cyclic  Deflection  vs  Number  of  Cycles  (5  Hz  Frequency) 

0 


•  5Hz, 4"  L.,  l.l-0.2ksi 
■  5Hz,4"L.,0.9-0.2ksi 
a  5Hz,  2.5"  L.,  1.0-0.2ksi 


>♦* 


auP** 


muf* 


63-21 


u 

e 


u 

<u 

a 

V 

Q 


E 

3 

E 

X 

01 

S 


Maximum  Cyclic  Deflection  vs  Number  of  Cycles  (70  Hz  Frequency) 


Figure  10 

Creep  of  a  Four-Element  Model  Plotted  on  a  Linear  Time  Scale  (from  Ref.  15) 
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Figure  11 

Creep  of  a  Four-Element  Model  Plotted  on  a  Log  Time  Scale  (from  Ref.  15) 
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Figure  12 

Cyclic  Strain  Range  vs  Number  of  Cycles  (7  Hz  Frequency) 
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Figure  15 

Dynamic  Creep  at  Gross  Failure  vs  Stress  Range  (7  Hz  Frequency) 
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Figure  16 

Specimen  Temperature  Rise  vs  Number  of  Cycles  (7  Hz  Frequency) 


Figure  17 

Specimen  Temperature  Rise  vs  Number  of  Cycles  (5  Hz  Frequency) 
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Figure  18 

Specimen  Temperature  Rise  vs  Number  of  Cycles  (10  Hz  Frequency) 
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1.0  ABSTRACT 


The  influence  of  viscoelastic  constrained-layer  damping  treatment  on  parametric 
resonances  of  single-degree-of-freedom  (SDOF)  systems  and  autoparametric  r3sonances  of 
multiple-degree-of-freedom  (MDOF)  nonlinear  systems  possessing  autoparametric  coupling  was 
investigated.  The  results  show  that  commercially  available  aluminium-backed  treatment  is 
effective  in  suppressing  parametric  resonances  in  SDOF  systems  by  moving  the  regions  of 
parametric  resonance  away  from  the  frequency  axis.  In  the  MDOF  systems  the  damping  affects 
the  highly  nonlinear  response  characteristics  of  each  mode  and  the  nonlinear  coupling  between 
modes,  it  can  suppress  the  modulation  between  modes.  In  general,  the  effect  of  increased 
application  of  damping  treatment  is  to  contract  the  regions  of  nonlinear  modal  interaction  and,  in 
some  cases,  actually  suppress  the  nonlinear  modal  coupling  entirely  with  a  sufficient  amount  of 
damping  treatment.  Experimental  results  include  slow  swept-sine  excitations  at  constant 
amplitude  and  slow  swept-amplitude  excitations  at  constant  frequency.  Particular  attention  was 
paid  to  the  nonlinear  resonances  and  the  modal  interaction  regions  bounded  by  the  Hopf 
bifurcation. 
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2.0  INTRODUCTION 


Parametric  resonances  are  not  uncommon  in  structural  vibrations.  Zavodney'  and 
Zavodney,  et  al2"*  have  provided  a  summary  of  parametric  resonance  in  SDOF  nonlinear 
structural  systems.  Damping  is  a  common  method  of  controlling  or  reducing  vibration  for 
externally  excited  resonances.  However,  damping  plays  quite  a  different  role  in  parametrically 
excited  systems.  When  systems  exhibit  nonlinear  behavior,  the  analysis  is  more  complicated; 
the  nonlinearity  and  the  damping  together  affects  the  system  response  so  it  is  not  always  obvious 
what  part  the  damping  plays  in  the  response.  Zavodney  and  Shihada5  investigated  the  influence 
of  linear  viscous  damping  on  the  fundamental  and  principal  parametric  resonances  of  SDOF 
systems  possessing  quadratic  and  cubic  nonlinearities.  The  results  showed  that  linear  viscous 
damping  plays  a  significantly  different  role  than  in  the  externally  excited  oscillator;  if  reduction  in 
amplitude  at  resonance  is  the  objective,  then  in  some  cases  certain  critical  levels  of  damping 
must  be  exceed ed-otherwise  one  can  increase  the  damping  by  an  order  of  magnitude  but  realize 
less  than  5%  reduction  in  the  response  amplitude. 

When  the  structure  possesses  more  than  one  mode,  as  is  usually  the  case,  there  is  the 
possibility  of  nonlinear  modal  coupling.  When  modes  are  coupled,  it  is  possible  to  exchange 
energy  from  a  directly  excited  mode  to  another  mode.  The  end  result  is  that  more  than  one 
mode  is  participating  in  the  response  and,  hence,  the  structure  is  vibrating  at  other  frequencies  in 
addition  to  the  frequency  of  the  excited  mode--which  is  not  always  the  same  as  the  excitation 
frequency.  As  a  result  the  analysis  becomes  even  more  complicated.  Attempting  to  find 
appropriate  mathematical  models  for  these  behaviors  is  extremely  difficult.  One  particular 
mathematical  model  may  describe  one  type  of  behavior,  but  when  the  excitation  frequency  is 
changed  by  only  0.1  Hz,  the  behavior  is  something  else--qualitatively  and  quantitatively--!. e.,  a 
bifurcation  has  occurred.  The  nonlinear  coupling  responsible  for  this  behavior  is  further 
enhanced  if  the  structure  possesses  an  internal  resonance;  an  internal  resonance  occurs 
whenever  any  natural  frequencies  are  commensurate  (i.e.,  in  a  2:1,  3:1,  3:2,  etc.  ratio).  When  a 
structure  simultaneously  possesses  an  internal  resonance  and  appropriate  nonlinear  coupling 
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terms,  it  is  possible  for  one  mode  to  parametrically  excite  another  mode;  this  phenomenon  is 
called  an  autoparametric  resonance.  When  this  happens,  the  nonlinear  effects  are  greatly 
intensified  and  completely  dominate  the  response.  Nayfeh  and  Zavodney6  and  Balachandran 
and  Nayfeh7  provided  a  theoretical  model  and  experimental  results  showing  that  such  behavior 
can  lead  to  long-time  responses  that  are  not  steady-state;  mathematically  a  Hopf  bifurcation  has 
occurred. 

The  objective  of  this  project  was  to  conduct  experiments  on  nonlinear  SDOF  and  MDOF 
flexible  structures  to  study  the  effects  of  damping  on  parametric  and  autoparametric  resonance. 
The  experiments  were  performed  on  structures  fabricated  from  prismatic  beams  and  lumped 
masses.  These  types  of  structures  were  easy  to  prepare  and  tune-i.e.,  by  adjusting  the  position 
of  the  masses  the  resonant  frequencies  could  be  changed.  This  was  essential  for  the  MDOF 
structure  when  an  internal  resonance  was  desired. 


3.0  SINGLE-DEGREE-OF-FREEDOM  STRUCTURE 

3.1  Theoretical  Analysis 

Many  structural  elements  can  be  modelled  as  slender  beams  and  concentrated  masses. 
When  the  support  undergoes  motion,  the  beam  is  subject  to  vibration-either  external  or 
parametric,  or  both.  In  this  section  the  governing  equations  are  derived  for  inplane  flexural 
vibration  of  a  thin  elastic  prismatic  cantilever  beam  subject  to  parametric  vibration  at  the  base. 
The  nonlinear  terms  arising  from  the  curvature  and  coupling  effects  are  retained.  Galerkin's 
method  is  used  to  discretize  the  governing  nonlinear  partial  differential  equation  of  motion.  The 
linear  eigenvalue  problem  is  solved  to  determine  the  eigenvalues,  and  the  eigenfunction  is  used 
to  determine  the  coefficients  of  the  time  modulation  equation.  A  multiple-scales  perturbation 
solution  is  obtained  for  the  temporal  modulation  equation. 
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The  governing  equation  of  motion  is  derived  using  Euler-Bemoulli  beam  theory.  The 
beam,  shown  in  Figure  1 ,  is  cantilevered  at  the  oscillating  support,  has  a  length  L,  and  carries  a 
concentrated  mass  m  at  an  arbitrary  distance  s  =  d  along  the  neutral  axis  of  the  beam.  We 
assume  that  the  thickness  of  the  beam  is  so  small  compared  with  the  length  that  the  effects  of 
shearing  deformation  and  rotatory  inertia  of  the  beam  can  be  neglected.  Since  we  are 
investigating  parametric  resonances  (transverse  vibration),  we  will  not  consider  axial  resonances 
of  the  beam  since  the  frequency  of  excitation  will  be  far  below  the  first  axial  resonance.  If  the 
beam  is  kept  relatively  short  (<  30  beam  widths),  the  transverse  vibration  is  purely  in  plane  (if  the 
lumped  mass  is  symmetrical  with  the  centerline);  if  the  excitation  frequency  is  far  below  the  first 
torsional  mode,  then  we  can  neglect  the  torsional  modes  of  the  beam  in  the  analysis.  These 
assumptions  are  consistent  with  observations  in  the  laboratory.  Also,  we  do  not  observe  any 
combination  or  internal  resonances.  When  the  mass  is  removed  and  the  length  is  increased  by 
an  order  of  magnitude,  combination  resonances  do  occur. 

According  the  Euler-Bemoulli  beam  theory,  the  bending  moment  at  any  cross-section  s  is 
given  by 

M(s)  =  +  El  k(s)  ,  (1) 

where  E  is  the  elastic  (Young's)  modulus,  I  is  the  cross-sectional  moment  of  inertia,  and  tc1  is  the 
radius  of  curvature  at  section  s.  From  Figure  1 


and 

sin<|>  =  vs  ,  (3) 

where  <|>(s)  is  the  slope.  The  subscript  s  denotes  a  partial  derivative  with  respect  to  s,  and  the 
subscript  t  or  the  overdot  denotes  a  partial  derivative  with  respect  to  time  t.  Differentiating  (3) 
and  substituting  into  (4),  and  expanding  the  radical,  we  obtain 
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M(S)  =  El  Vjj  jl  +  2  vs  +  K  +  •••]  •  W 

For  the  analysis  presented  here,  up  to  third-order  terms  will  be  retained. 

The  moment  in  the  beam  is  the  result  of  three  sources: 

M(s)  =  +  M2  +  M3  (5) 

where  M,  is  the  moment  at  s  due  to  the  lateral  inertia  of  the  beam  element  d£,  and  the  mass  m, 
M2  is  the  moment  due  to  the  longitudinal  inertia  of  the  beam  element  d^  and  the  mass  m,  and  M3 

is  the  moment  at  s  caused  by  the  angular  acceleration  of  the  mass  m  due  to  its  mass  moment  of 
inertia  J.  In  the  derivation  that  follows,  the  following  nomenclature  will  be  used: 

x,y  -  Newtonian  Cartesian  reference  frame 

g  -  acceleration  of  gravity 

s  -  reference  variable  along  beam 

%  -  variable  of  integration  along  beam 

d^  -  differential  length  of  beam  element 

L  -  beam  length 

w  -  beam  width 

t  -  beam  thickness 

d  -  position  of  mass  center  of  mass  m 

p  -  mass  density  of  homogeneous  beam  per  unit  length 

c  -  coefficient  of  viscous  damping 

m  -  mass  of  concentrated  weight  on  beam 

J  -  polar  moment  of  inertia  of  mass  m 

v£,t)  -  lateral  displacement  of  beam  element  d^ 

u£,t)  -  longitudinal  displacement  of  beam  element  d^ 

<t>< s)  -  angle  with  respect  to  vertical  of  beam  at  s 

k(s)  -  curvature  of  beam  at  s 

I  -  cross-sectional  moment  of  inertia  of  beam 

E  -  modulus  of  elasticity  of  beam 

Using  the  sign  convention  indicated  in  Figure  1 ,  the  total  moment  of  the  beam  section 
from  s  to  L  due  to  the  lateral  (inertial)  motion  is 
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f  pv(^.t) 

J  s 

f  COS<J>(T|,t)  dll 

S 

The  moment  due  to  m  is  similarly  obtained  as 


-  m  v(d,t) 


J*  cos<t>(^)  d£ 


and  the  moment  of  the  assumed  viscous  damping  is 


6$  . 


Combining  these  and  using  the  Dirac  delta  function  to  locate  the  mass,  we  obtain 


L  . 

f  c  V&t) 

C  cos<t»(Ti,t)  dii 

J  s 

s 

L 

(6) 
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(8) 


M 


i  -  ■  f  {[p  +  m  5(4  -  d)]  v  +  cv} 


J  cos<|>  dii 


0) 


In  a  similar  fashion,  we  obtain 


M, 


f  {p(u-g]  +  m8(^-d)  [u-g] } 

f  sin<t>  dri 

S 

s 

-  - 

and 


M3  =  -  J0(d.t)  =  -  r  J  5($  -  d)  4>  d£  . 

s 

The  longitudinal  displacement  caused  by  the  shortening  effect  is 


(10) 


(11) 


U(^,t)  =  ^  -  J*  COS<J»(Tl,t)  dtl 


(12) 


which,  when  combined  with  the  displacement  z(t)  of  the  base  yields  the  total  axial  displacement 
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u(^.t)  *  %  -  J  cosM.t)  Chi  +  z(t)  . 


(13) 


To  obtain  the  governing  differential  equation,  we  differentiate  (4)  and  (5)  twice  with 
respect  to  s.  This  involves  differentiating  (9),  (10),  and  (11)  and  using  Leibnitz’  rule.  The  second 
time  derivative  of  the  axial  displacement  u(5,t)  is  obtained  by  substituting  for  cos<t>  from  (3)  and 
differentiating  (13)  with  respect  to  t  twice  to  yield 


u=2  f  (v^)„dn  +  z(t)  + 

o 


(14) 


Differentiating  (9)-(1 1)  twice  with  respect  to  s  and  using  (14)  yields 
32M, 


3s2 


{  [  p  +  m  8(s  -  d) )  v(s,t)  +  c  v(s.t) } 
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•  a  , L 

•  [p  +  m  8(s  -  d)]  V  -  ^  (Nvs)  +  vs  vss  f  [p  +  m  5(£  -  d)]  vd$ 

J  s 

*  h  lJ5<s'd)  ^  1  +  5vs  +  -J  +  vs  vs  } 


'  12  1  L 

+  1  -2VS  -  ...  cv  +  vsvss  f  c  v  d£  =  0  .  (20) 

s 

This  field  equation  is  subject  to  the  following  boundary  conditions: 

v(0.t)  -  0  ,  (21) 

vs(0.t)  =  0  ,  (22) 

v„(L.t)  =  0  ,  (23) 

vsss(L.t)  =  0  (24) 


The  governing  problem  (20)-(24)  is  nonlinear  and  does  not  admit  a  closed-form  solution. 
Therefore,  an  approximate  solution  was  sought  that  satisfies  both  the  equation  and  the  boundary 
conditions.  Since  the  boundary  conditions  are  spatial  and  independent  of  time,  the  solution  of  the 
nonlinear  problem  is  assumed  to  take  the  form 
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(25) 


v(s.t)  =  £  r  f  n  (s)  Gn  (t) 


where  r  is  a  scaling  factor,  yn(s)  is  the  shape  function  of  the  n*  linear  mode,  and  Gn(t)  is  the  time 
modulation  of  the  n^  mode. 

The  undamped  linear  free  vibration  problem  is  governed  by 


Elviv  +  [p  +  m  S(s  -  d)]  v  =  0 


and  is  subject  to  the  same  boundary  conditions  given  by  (21)-(24).  Without  loss  of  generality,  we 
will  solve  explicitly  for  the  first  mode,  with  the  understanding  that  the  eigenfunction  of  the  nm 
mode,  and  its  associated  eigenvalue,  correspond  to  the  n*  characteristic. 

To  solve  this  problem,  the  beam  is  separated  into  two  parts  at  the  concentrated  mass. 
The  appropriate  boundary  conditions  are  imposed  on  the  two  sections  to  determine  the 
coefficients  of  the  general  solution. 

The  general  solution  can  be  stated  as  the  composite  function 

y(s)  =  C,  [[sin  £  s - sinh  £  s]  -  A  [cos  £  s-cosh  £  sjj 
+  C,  U(s  -  d)  {h,  [sin  £  (s-d)  -  sinh  £  (s-d)J 


+  h3  [cos  £  (s-d)  -  cosh  £  (s  -  d)J 

-  Ah2  [sin  £  (s-d)  -  sinh  £  (s - d)J 

-  Ah4  [cos  £  (s-d)  -  cosh  [  (s  - d)J }  , 


where  C,  is  an  arbitrary  constant  and  k  is  the  characteristic  root  of  the  frequency  equation 


4pJL* 

-£g~  lhi  h4  *  h2  N 


*(^] 


r  f  .  kd  .  .  kdl  . 

I  h,  Ism  —  +  sinh  ~J  +  h2  Ic 
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o,  a  f  *  kd  u  kdl  A 

+  2k4  ll  -  cos  —  cosh  —I  =  0 


(28) 


The  frequency  of  oscillation  is  given  by 


b) 


2  _ 


El 

P 


(29) 


Since  a  finite  number  of  modes  is  being  considered,  this  continuous  system  can  be 
discretized  by  any  of  the  variational  methods  available  such  as  Rayleigh-Ritz.  When  the 
assumed  comparison  function  is  the  eigenfunction  in  particular,  the  procedure  is  known  as 
Galerkin's  method.  It  requires  multiplying  each  term  by  the  eigenfunction  y(s)  and  then 
integrating  from  0  to  L.  When  this  procedure  is  applied  to  (20)  we  obtain 


G  +  2e£  G  +  02(1  -efcos«t)G  +  eaG3  +  e^GG? 
+  ek2G2G  -  2e£  v  G2  G  =  0  , 


(30) 


where 


C  - 


gH?i 

2ep(H,,  +  pH12  +  jX2  H13] 


e2  = 


f  (r)4  [H3i  +  uHJ  -  g  (H33  +  pHM) 

(H,,  +  pH12  +  \\2  HJ 


+  pH,z  +  jA2  H,J 
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[HS1  +  |iHM  +  X2  jHgJ  X2 
1  =  cJH,,  +  hH12  +  jX2  H13] 


[H81-Hb  +  hHm-hHmHX*H6S]X2 
£[H11  +  nHt2  +  iX2H13] 


ia  Hti 
v  *  X.2  TJ- 
**21 


The  Hjj  are  defined  in  Appendix  C. 

A  new  time  scale  is  introduced  as 
t  *  0t  . 

The  time  derivatives  become 


and 


dt2  '  0  ch2  • 


Hence,  equation  (30)  is  transformed  into 

Gn  +  2e£GT  +  (1  -  ef  cos($r)]G  +  eaG3  +  eKiGG* 


+  eK2G2Gn  -  2eCvG2Gt  =  0 


where 


A 


A 

a 


a  =  -  ,  and 


(31) 


(32) 


(33) 


(34) 


(35) 


This  equation  contains  cubic  nonlinearities  and  nonlinear  damping,  and  hence  it  does  not  lend 
itself  to  a  closed-form  solution.  It  must  be  analyzed  by  a  perturbation  or  a  numerical  technique. 
Moreover,  the  coefficients  are  evaluated  according  to  (31)  and  (Cl)  and  are  dependent  on  \y(s). 
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When  the  amplitude  of  vibration  gets  large,  \y(s)  may  no  longer  approximate  the  mode  shape, 
and  consequently  the  coefficients  of  the  time  modulation  equation  have  increasing  error. 

A  first-order  uniform  solution  of  (34)  is  sought  using  the  method  of  multiple  scales  in  the 

form 

G(t;e)  =  u^Tq.T^)  +  eu^Tq.T,)  +  ...  (36) 

The  detuning  o  was  introduced  into  the  normalized  natural  frequency  of  the  system  as 

,  2 

1  =  (20)  +  w  (37) 


because  the  frequency  of  the  response  for  a  principal  parametric  resonance  is  exactly  one  half 
that  of  the  excitation.  Following  the  standard  procedure  of  eliminating  secular  terms,  we  obtain 


1  f  f  dP 

G(t)  =  a  cos  (g  <)>x  +  p)  +  +  32 


f4g 

<02 


COS  [3(j  <t*T  +  P)] 


r v  1  1 

+  ^  a2  sin  (3(j <(«  +  P)1  )  +  ...  , 


(38) 


where  a  and  p  are  given  by 

1 

a  =  -  e£  (4  va2>  a  +  sin2p  .  (39) 

£  efa 

ap  =  -  -  (0  +  aea2)a  +  ^  cos2p  (40) 

Periodic  solutions  of  (34)  correspond  to  the  fixed  points  (i.e.,  constant  solutions)  of  the 
modulation  equations  (39)  and  (40),  which  in  turn  correspond  to  a  =  0  and  p  =  0.  By  squaring 
and  summing  (39)  and  (40)  and  after  imposing  a  =  0  and  p  =  0,  we  obtain 


a2  =  °  .  aj  [-  f±  Vr2"" 4qs  ] 


(41) 


where 
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q  =  1  (1  -ea)C2v2  +  a*  , 

r  =  2a(ae  +  e£2v)  -  2v£2  , 
s  =  4(1  -  e<r)C2  +  a2  -  Jf2  . 

ae  =  J(3a  +  (1  -  ectKk,  -  3<2)]  ,  (42) 


where  $2  was  replaced  with  4(1  -  eo)  from  (37).  A  trivial  fixed  point  is  unstable  if  and  only  if 


f  >  fCTjt  =  ,  (43) 

otherwise  it  is  stable. 

A  nontrivial  fixed  point  is  stable  if  and  only  if  the  real  parts  of  both  eigenvalues  of  the 
coefficient  matrix  in  (44)  and  (45)  are  less  than  or  equal  to  zero: 


a,  =  e 


-  c  * ;  ;*4  ♦  5  sin2pc 


fe{  ]„ 

®i  +  a0cos2pojp1  , 


(44) 


Pi 


f 


3i  + 


T  sinZPolP, 


(45) 


3.2  Experimental  Analysis 

Experiments  were  performed  on  the  structure  shown  in  Figure  1 .  The  excitation  was  a 
base  displacement  (along  the  axis  in  the  vertical  direction)  at  a  frequency  nearly  twice  that  of  the 
first  flexural  mode  of  the  cantilever  beam.  The  resonant  response  that  ensues  is  called  a 
principal  parametric  resonance.  The  experiments  consisted  of  frequency  sweeps  at  constant- 
amplitude  acceleration  and  amplitude  sweeps  at  constant  frequency.  The  amplitude  was  held 
constant  by  a  computer-controlled  feedback  loop;  as  the  frequency  was  changed,  the 
corresponding  excitation  amplitude  required  adjustment  to  keep  the  acceleration  level  constant. 
The  excitation  level  was  chosen  as  large  as  possible  without  causing  excessive  amplitudes  of 
displacement;  a  maximum  level  of  0.350  g's  was  selected.  The  response  was  measured  by 
strain  gages  mounted  on  the  beam. 
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The  first  beam,  called  beam  1,  was  a  very  flexible  steel  beam.  Its  dimensions  and 
properties  are  listed  in  Table  1  in  Appendix  A.  Using  these  numbers,  a  theoretical  frequency- 
response  curve  was  obtained  using  the  theory,  and  is  shown  in  Figure  2.  This  figure  shows 
hardening  behavior  because  the  curve  bends  over  to  the  right.  The  theoretical  amplitude 
response  for  this  beam  is  shown  in  Figure  3  for  a  direct  excitation  to  the  principal  parametric 
resonance.  It  shows  nonlinear  behavior  because  the  amplitude  curve  bends  over.  If  the  system 
were  linear,  the  line  would  be  perfectly  vertical. 

The  time  history  of  the  table  acceleration  and  the  response  displacement  (strain  gage)  is 
shown  in  Figure  4.  From  this  figure  it  is  easy  to  see  that  the  response  is  occurring  at  exactly 
one-half  the  frequency  of  the  excitation.  It  also  shows  that  the  excitation  amplitude  is  constant 
throughout  the  experiment.  The  frequency  response  for  beam  1  for  two  levels  of  excitation  is 
shown  in  Figure  5.  The  amplitude  response  for  beam  1  is  shown  in  Figure  6  for  an  excitation 
frequency  of  2.000.  Both  of  these  figures  are  in  agreement  with  the  theory. 

The  flexibility  of  beam  1  was  such  that  it  allowed  large  amplitudes  and  hence,  suffered 
fatigue  cracking;  it  failed  prematurely  (meaning  that  a  complete  battery  of  experiments  could  not 
be  performed  on  it  before  it  failed).  Therefore,  a  thicker  and  longer  and  wider  beam  was  selected 
for  the  damping  treatment  experiments.  Its  dimensions  and  properties  are  also  shown  in  Table  1 
in  Appendix  A. 

The  theoretical  frequency  response  for  beam  2  for  two  excitation  levels  is  shown  in 
Figure  7.  These  curves  are  quite  similar  to  the  ones  obtained  for  beam  1 .  The  experimental 
frequency  response  of  the  undamped  structure  (here  undamped  means  before  any  damping 
treatment  was  applied)  for  two  levels  of  excitation  is  shown  in  Figure  8.  It  shows  that  the  system 
is  softening  because  it  bends  to  the  left.  This  behavior  was  neither  expected  nor  predicted. 
These  experiments  were  conducted  by  increasing  the  frequency  of  excitation  very  slowly  while 
simultaneously  keeping  the  table  acceleration  constant.  The  arrows  indicate  jumps;  for  example, 
wnen  the  excitation  level  is  0.350  g's  (denoted  by  the  circles),  the  response  jumps  up  to  the  large 
amplitude  and  then  slowly  decreases  as  the  frequency  is  increased.  When  the  frequency  is 
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decreased,  the  response  follows  the  same  curve  and  extends  it  into  an  overhang.  The  response 
eventually  jumps  down  to  the  trivial  response.  The  response  for  the  0.250-g  excitation  level  is 
qualitatively  the  same  but  at  a  lower  amplitude. 

Additional  experiments  at  the  same  levels  of  excitation  were  performed  with  one  and  two 
strips  of  viscoelastic  damping  treatment  applied,  as  shown  in  Figures  9  and  10.  For  the  0.250-g 
excitation  level,  the  principal  parametric  resonance  was  completely  suppressed  with  only  two 
strips  of  damping  treatment  applied.  Two  0.10-inch  strips  of  damping  treatment  covers  only  5% 
of  the  surface  area  of  beam  2  between  the  support  and  the  lumped  mass. 

The  regions  of  parametric  resonance  in  the  excitation-amplitude  versus  the  excitation- 
frequency  plane  were  also  determined.  These  experiments  were  performed  by  repeating  the 
frequency  response  experiments  for  different  levels  of  excitation  and  plotting  only  the  bifurcation 
points  where  either  the  trivial  response  becomes  unstable  (during  both  a  sweep  up  and  a  sweep 
down)  or  a  nontrivial  response  jumps  to  the  trivial  response.  An  example  is  shown  in  Figure  1 1 . 
This  figure  shows  three  boundaries  which  divide  the  domain  into  two  regions;  the  inner-most 
region  (bounded  by  the  center  curve  and  the  right  curve)  represents  the  loss  of  trivial  stability  for 
the  linear  system.  The  additional  curve  on  the  far  left  represents  the  extension  of  the  instability 
region  caused  by  the  overhang  due  to  the  softening  nonlinearity  in  the  system.  Because 
nontrivial  responses  exist  at  excitation  frequencies  below  those  predicted  by  linear  theory,  the 
system  is  said  to  possess  a  subcritical  instability.  The  amount  of  overhang  is  reduced  for 
increased  damping  levels,  as  shown  in  Figures  12  and  13. 

A  summary  of  the  bifurcation  boundaries  showing  only  the  region  where  nontrivial 
responses  exist  is  shown  in  Figure  14.  As  the  equivalent  linear  viscous  damping  coefficient 
increases,  the  instability  regions  move  away  from  the  frequency  axis.  The  bifurcation  boundaries 
predicted  from  the  theory  using  the  experimentally  measured  damping  coefficients  are  shown  in 
Figure  16.  The  values  of  the  equivalent  damping  coefficient  were  obtained  from  the  free 
response  data  shown  in  Figure  15.  The  Eigensystem  Realization  Algorithm  (ERA)  was  used  to 
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estimate  the  damping  coefficients  and  natural  frequencies.  The  theoretical  curves  are  compared 
with  the  corresponding  experimental  data  in  Figures  17, 18,  and  19.  The  agreement  is  excellent. 

3.3  Discussion  of  Results 

Principal  parametric  resonances  can  be  attenuated  by  the  application  of  constrained- 
layer  damping  treatment.  The  experiments  show  that  just  a  small  quantity  of  the  material  can  be 
effective.  The  theoretical  model  did  a  good  job  in  predicting  the  stability  boundaries  for  the 
parametric  resonance,  but  was  unable  to  predict  the  softening  behavior  of  beam  2.  It  also  did  not 
predict  the  jump  down.  More  sophisticated  modelling  to  account  for  damping  and  higher  order 
perturbation  analysis  may  be  required. 

In  summary,  increased  application  of  the  damping  treatment  reduces  the  amplitude  of  the 
resonant  response,  and  reduces  the  region  of  parametric  instability.  The  only  way  to  completely 
suppress  a  parametric  resonance  is  to  move  the  instability  region  far  enough  away  from  the  axis 
so  that  it  is  completely  removed  from  the  area  ol  interest. 


4.0  MULTIPLE-DEGREE-OF-FREEDOM  STRUCTURE 

4.1  Theoretical  Analysis 

Several  researchers  have  investigated  this  model  and  have  developed  mathematical 
models.  Following  Haddow  et  al'°,  the  equations  describing  the  displacement  of  m,  and  m2  are 

given  by 

v,(x.t)  =  d,  0,,  (x)  u,  (i)  +  d,  012  (x)  u2  (t)  (46) 

v2(y,t)  =  d,  021  (y)  u,  (t)  +  d,  022  (y)  u2  (t)  (47) 

where  the  u{  are  given  by 
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2 

u,  +  264,0,  +  to,  u,  +  eX,,(u,)2  +  eX,2u,  02 
+  e  X,3  (u2)2  +  e  V,,  u,  u,  +  eY,2u,  ^ 

+  eY,j  U2  U,  +  eY U2  Uj 

+  2  e  Z,,  F  Cl  u,  cos£ix  +  2eZ,2  F  Q  u2  cos  ft  t 

=  2  F  K,  cos  ftr  (48) 

U2  +  2e4202  +  o)2u2  +  eX2,  (O,)2  +  eX^u,  u2 

+  eX^  (u2)2  +  eY2,  u,u,  +  eY^  u,  u2 
+  eY23  u2  u,  +  eY24  u2  U2 
+  2  e  Z21  F  Q  u,  cosftt  +  Z&zz  F  ft  u2  cos  ft  x 

=  2  F  K2  cos  Or  (49) 

The  constants  X^,  Yjj,  and  Zjj  are  defined  in  reference  [10].  The  two  cases  of  particular  interest 
are  when  ft  *>  to,  and  ft  »  <o2  which  corresponds  to  a  direct  excitation  to  the  first  mode  and  to 
the  second  mode. 

First  Mode  Excitation 

For  the  case  of  an  internal  resonance  and  excitation  to  the  first  mode,  the  detuning 
parameters  are  defined  as 

(o2  =  2to,  +  ea,  and  ft  =  co,  +  ect2  ,  (50) 

From  a  multiple-scales  perturbation  analysis,  the  following  amplitude-and  phase-modulation 
equations  are  obtained: 
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a1  +  ti1a1  -  a,  a2  siny,  -  F  sirvy2  =  0  , 

(51) 

a,  a,  +  a,a2  cosy1  +  F  cosy2  =  0  , 

(52) 

'  2  . 
a2  +  (^  a2  +  at  sirry,  =  0  , 

(53) 

*  2 

a2  a2  +  a1  cosy,  *  0  . 

(54) 

F  is  proportional  to  the  amplitude  of  excitation  and 

Y,  =  <S)  (et)  -  2at  +  a2  , 

(55) 

Y2  =  cr2(et)  -  a, 

(56) 

The  steady-state  solutions  correspond  to 

ai  -  a2  -  0  and  Y,  =  y2  =  0  . 

(57) 

and  when  imposed  on  equations  (51)-(56)  yield 

• 

a^  3  a2 

(58) 

a2  »  2o2-o1 

(59) 

2  /  2  ^  W2 

a2  -  ai  /  [M2  +  (2<t2  -  CT,)2  j 

(60) 

a®  +  2[p1  p2  -  o2  (2a2  -  <r,)]  a* 

+  [A  +  (2o2-a,)2]  [(o2  +  p2)a?  -  F2]  =  0  . 

(61) 

The  stability  of  these  solutions  is  obtained  by  perturbing  each  steady-state  solution  and  studying 

the  behavior  of  the  disturbance.  Substituting 

a1  3  a10  +  a11  ■  a2  =  a20  +  a2l  ■ 

(62) 
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Yi  -  Yio  +  Yn  •  Y2  =  Y20  +  721  (63) 

into  equations  (51)-(54)  yields 

i 

an  +  (^1  'a20sin7lo)a11  -  a10sin7l0a21 

'  a10  a20  COSYlo  7io  ■  ^  COS  720  722  1  (64) 

(cj  +  820  cosy10)  a,,  +  a,0cosy,0a2, 

i 

'  aio  a20  sin7io  7n  ■  aio  7 21  '  F  s*nY2o  721  ■  (66) 

2a10  siny10  a,,  +  a21  +  f^a^  +  aiocosYioYn  =  6  •  (66) 


2a10  cos7io  an  +  (2o2-o,)a21  +  a2o  Y,,  *  aio  s'nYio  7n 

•2a»721  =  0  .  (67) 

Since  these  equations  are  linear  and  have  constant  coefficients,  it  follows  that 
•  • 

an  =  Xa,,  and  yM  =  tyi  (68) 

where  X  is  an  eigenvalue  of  the  coefficient  matrix  obtained  by  substituting  (68)  into  (64)-(67).  Any 
solution  of  (58)-(61)  is  stable  if  the  real  parts  of  all  eigenvalues  are  less  than  zero. 

Second  Mode  Excitation 

For  the  case  of  internal  resonance  and  an  excitation  to  the  second  mode,  we  put 
<o2  =  2a),  +  eg,  and  Q  =  a>2  +  ea2  (69) 

and  obtain  the  following  amplitude-  and  phase-modulation  equations: 
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a1  +  p.1  a,  -  a,  a2  siny,  =  0 


(70) 


a,  +  a,  a2  cos  y,  =  0  , 

(71) 

•  2 

a2  +  H2  a2  +  ai  siny,  -  F  siny2  =  0  , 

(72) 

*  2 

a2  a,  +  a,  cosy,  +  F  cosy2  =  0  . 

(73) 

F  is  again  proportional  to  the  excitation  amplitude  and 

Yi  =  <r,  (et)  -  2a,  +  a2  . 

(74) 

y2  =  a2(et)-a2 

(75) 

The  steady-state  solutions  correspond  to 

t  t  •  * 

a1  -  a2  =  0  and  y,  =  y2  =  0  , 

(76) 

and,  when  substituted  into  (70)-(73)  yields 

at  (m  -  a2  siiry1)  =  0 

(77) 

[  2  (^i  +  g?)  a2  cosy1  ]  =  0 

(78) 

2 

p2  a2  +  a,  siny,  -  F  siny2  =  0 

(79) 

2 

a 2  a2  +  a,  cosy,  +  F  cosy2  =  0 

(80) 

There  are  two  possible  solutions  to  these  equations: 

the  solution  which  corresponds  to  the  linear 

problem  given  by 

a,  =  0  and  a2  =  F  /  -^^o2  +  |i2  , 

(81) 

and  the  nonlinear  solution  which  corresponds  to 
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The  stability  of  the  solutions  for  a  direct  excitation  to  the  second  mode  is  obtained  in  a 
similar  fashion.  Disturbances  are  defined  in  (62)  and  (63);  the  governing  equations  are  found  to 
be 

an  ‘  aio  s’nTio  a2i '  aio  a20  cos7io  7n  -  0,  (84) 

aio  cosy10  a21  +  *  aio  a20  siny10  Yn  -  5a10y21  =  0,  (85) 

*  i 

2aio  s'HTio  an  +  a21  +  p2a21  +  p2  a2i 

+  a^0  cosy10  y,,  -  F  cosYa  y21  =  0,  (86) 

2 

2a10  c°sy10  an  +  CT2a2i  *  a io  sin7ioYn 

-  ago  721  -  F  sinyjo  y21  =  0  .  (87) 

The  behavior  of  the  eigenvalues  X  determines  the  stability  of  the  aj.  The  nature  of  the  response 
predicted  from  these  equations  will  be  discussed  later. 

4.2  Experimental  Analysis 

Experiments  were  performed  on  a  multiple-degree-of-freedom  (MDOF)  structure.  The 
structure  used  for  these  experiments  is  shown  in  Figure  20;  it  has  been  studied  by  several 

researchers6  79  '0.  It  was  chosen  because  it  was  easy  to  fabricate  and  could  be  easily  tuned  (to 
create  an  internal  resonance)  by  adjusting  the  lengths  of  the  beams  and  position  of  m2.  The 
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laboratory  setup  is  shown  schematically  in  Figure  21 .  The  shaker  was  an  Unholtz-Dickie  model 
200  which  has  a  capacity  of  1 1 00-lb  force.  This  insured  that  there  was  no  feedback  distortion  to 
the  shaker  table  from  the  structure.  Data  was  obtained  by  strain  gages  attached  to  the  structure 
and  conditioned  by  a  bridge  amplifier.  The  signals  were  monitored  on  a  digital  oscilloscope  and 
captured  by  a  Hewlett-Packard  data  acquisition  system  attached  to  a  PC.  The  experiments 
consisted  of  the  following: 

a)  random  excitation  to  determine  resonant  frequencies  and  damping  coefficients 

b)  harmonic  excitation  at  constant  frequency  and  amplitude  to  determine  steady-state 
response  levels 

c)  harmonic  excitation  at  constant  amplitude  of  acceleration  and  small  changes  in  the 
frequency  of  excitation  to  effectively  cause  a  slow  frequency  sweep 

d)  harmonic  excitation  at  constant  frequency  and  small  changes  in  the  amplitude  of 
excitation  to  effectively  cause  a  slow  amplitude  sweep 

e)  impulse  response  resulting  from  a  sudden  release  from  a  deformed  position 
approximating  the  first  mode  shape,  the  second  mode  shape,  and  a  combination  of  the 
two 

f)  impulse  response  resulting  from  an  impact  to  the  lower  mass. 

The  nature  of  the  response  was  quite  complicated  at  times,  so  a  variety  of  the  above  techniques 
was  used  throughout  the  experimental  phase. 

The  procedure  used  for  applying  damping  treatment  to  the  SDOF  structure  was  used  for 
the  MDOF  structure:  thin  strips  of  0.10-inch  width  damping  treatment  were  applied  in  increasing 
numbers  to  each  beam,  increasing  the  damping  of  the  structure. 

The  responses  of  this  structure-both  to  impulse  and  harmonic  excitation-were  very 
complicated.  The  primary  source  of  the  quadratic  modal-coupling  nonlinearity  in  this  structure 
was  due  to  the  asymmetrical  geometry.  When  an  internal  resonance  existed,  the  nonlinear 
coupling  was  significantly  intensified  and  would  dominate  the  response;  when  this  happened,  the 
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response  amplitudes  were  three  to  four  times  larger  than  the  linear  response.  Internal  resonance 
is  a  measure  of  the  difference  between  the  frequency  of  the  second  mode  and  two  times  the 
frequency  of  the  first  mode;  the  detuning  parameter  is  defined  by  equation  (50)  or  (69).  Two 
amounts  of  internal  detuning  were  attempted;  perfect  detuning  and  slightly  positive. 

Because  the  results  of  the  SDOF  experiments  showed  that  a  parametric  resonance  could 
be  suppressed  with  a  small  amount  of  damping  treatment,  the  experiments  on  the  MDOF 
structure  were  performed  at  very  high  levels  of  excitation.  Studies879 10  have  shown  that  under 
some  conditions,  nonlinear  responses  can  be  achieved  with  very  small  levels  of  excitation  (on  the 
order  of  20  mili-g's).  In  the  experiments  performed  here,  the  excitation  levels  were  on  the  order 
of  100  mili-g's--five  times  that  required  to  solicit  a  nonlinear  response.  This  large  excitation  level 
was  used  to  extend  the  range  for  which  damping  treatment  could  be  applied;  in  other  words, 
more  damping  treatment  could  be  applied  before  the  nonlinear  motion  was  expected  to  be 
suppressed. 

For  both  the  tuned  and  slightly  detuned  models,  damping  treatment  was  applied  in  five 
steps.  With  each  application  of  damping  treatment  the  model  was  adjusted  (i.e.,  the  length  of  the 
lower  beam)  to  return  it  to  the  original  amount  of  detuning.  This  was  a  painstaking  procedure 
because  it  sometimes  required  more  than  a  dozen  adjustments  in  the  length  of  the  lower  beam 
just  to  get  the  results  reported  here. 

The  labels  in  the  subsequent  figures  correspond  to  the  following  amounts  of  damping 
treatment:  the  unmarked  or  "0"  for  none,  "I"  for  one  strip  on  each  beam,  "I I"  for  two  strips  on 
each  beam  (17%  of  the  surface  area),  "III"  for  four  strips  on  each  beam,  "IV”  for  six  strips  on 
each  beam  (50%),  and  "V"  for  100%  coverage. 


(Nearly)  Perfectly  Tuned  Structure 

The  structure  shown  in  Figure  20  was  tuned  such  that  the  first  resonance  occurred  at 
4.012  Hz  and  the  second  resonance  occurred  at  8.040  Hz;  this  corresponded  to  an  internal 
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resonance  detuning  of  +0.016  Hz,  or  +0.20%  (of  the  second  resonant  frequency).  This  was  as 
close  as  practically  possible  to  perfect  tuning  on  a  real  structure;  a  tuning  closer  to  0%  could  be 
accomplished  at  the  expense  of  more  painstaking  effort  (i.e.,  moving  the  lower  beam  into  the 
clamped  support  in  0.001 -inch  increments  and  repeating  the  experiments  until  the  desired 
detuning  is  achieved).  After  each  application  of  damping  treatment,  the  model  was  retuned  to  get 
nearly  0.0%  detuning. 

The  frequency  response  function  for  four  of  these  cases  is  shown  in  Figures  22  through 
26.  The  structure  was  forced  with  low-level  random  excitation;  the  input  acceleration  was 
measured  by  an  accelerometer  mounted  on  the  base  clamp  and  the  response  displacement  was 
measured  by  the  strain  gages.  The  roots  were  estimated  by  a  complex  exponential  curve  fit. 
The  reconstructed  frequency-response  curve  from  (the  estimated  roots)  is  shown  as  a  dashed 
line  in  the  figures.  The  accuracy  of  the  curve  fit  is  verified  by  the  closeness  to  the  experimental 
data  near  resonance.  The  curves  do  show  a  trend  of  increasing  damping  as  evidenced  by  the 
increased  width  of  the  resonance  peaks.  The  estimates  of  the  natural  frequencies  and  damping 
coefficients  are  listed  in  the  figure  captions  and  summarized  in  Table  2  (Appendix  A). 

The  frequency  responses  for  the  structure  without  treatment  and  for  three  of  the  five 
cases  of  damping  treatment  are  shown  in  Figures  27  and  28.  This  3-D  perspective  plot  aids  in 
visualizing  the  qualitative  and  quantitative  changes  caused  by  the  increase  in  the  damping.  The 
frequency  axis  is  shown  normalized  with  respect  to  the  excitation  frequency  because  the  addition 
of  the  damping  treatment  caused  the  frequencies  to  shift  slightly.  The  response  amplitude 
(displacement  of  m,)  is  represented  in  units  corresponding  to  the  ratio  of  the  lower  mass 

displacement  amplitude  to  the  length  of  the  lower  beam.  This  scaling  provides  a  convenient 
comparison  of  all  of  the  results  for  all  of  the  experiments.  For  example,  the  peak  amplitude  of  the 
response  of  the  first  mode  is  approximately  8%  of  the  length  of  the  lower  beam. 

When  the  first  mode  is  directly  excited  with  a  large-amplitude  excitation  at  a  frequency 
near  the  first  resonance,  the  response  is  nonlinear,  as  shown  in  Figure  27.  Figure  27(a)  shows 
the  first  mode  response  and  Figure  27(b)  shows  the  second  mode  response.  The  response  of 
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the  untreated  structure  shows  the  most  interesting  behavior.  As  the  frequency  of  excitation  is 
increased  from  0.90,  the  amplitude  initially  grows  until  the  second  mode  is  also  excited;  this 
happens  at  a  frequency  near  0.97.  As  the  frequency  is  increased  further,  the  amplitude  of  the 
first  mode  decreases  dramatically-almost  four  times!  When  a  Hopf  bifurcation  occurs  near  a 
frequency  of  1 .0,  no  steady-state  responses  are  possible.  This  region  is  denoted  by  dots  which 
represents  an  "average"  amplitude.  Actual  bounds  on  the  modulation  are  shown  later  in  the 
amplitude  response  curves.  As  the  frequency  is  increased  further,  the  amplitude  of  response 
inci  eases  until  it  jumps  down  (at  the  first  mode  frequency  of  1.072)  to  the  linear  response 
amplitude.  The  linear  response  consists  only  of  the  directly  excited  mode-no  other  modes  are 
present.  If  the  frequency  is  decreased  from  above,  the  jump  up  occurs  at  a  lower  frequency  (i.e., 
1 .044)  than  did  the  jump  down  during  a  sweep  up.  This  indicates  an  overhang  or  double-valued 
steady-state  response.  Theory  shows  that  these  two  solution  branches  are  connected  by  an 
unstable  solution  branch;  this  is  shown  by  the  dashed  line  on  the  response  (note-there  are  no 
data  points  for  this  dashed  line  because  they  cannot  be  realized  in  the  laboratory).  Further 
decreases  in  the  excitation  frequency  cause  the  response  to  follow  the  same  path  as  that 
followed  during  the  sweep  up.  The  frequency  response  is  almost  symmetrical;  because  it  is 
slightly  skewed  to  the  right,  it  indicates  that  a  slight  positive  detuning  of  the  internal  resonance  is 
present. 

The  frequency  response  curve  for  the  undamped  structure  shows  four  distinct  types  of 
motions  that  are  possible  when  the  excitation  frequency  is  near  the  first  mode:  (1)  the  linear 
solution  where  only  the  directly  excited  mode  participates  in  the  response,  (2)  a  region  where 
nonlinear  coupling  is  present  and  causes  two  steady-state  solutions  to  co-exist-the  linear  one 
and  a  nonlinear  one,  (3)  only  a  nonlinear  response  where  the  modes  achieve  one  steady-state 
amplitude,  and  (4)  a  region  bounded  by  a  Hopf  bifurcation  where  no  steady-state  solutions  exist. 

As  damping  treatment  was  applied  to  the  structure,  response  amplitudes  were 
attenuated.  Certain  trends  can  be  observed  in  Figure  27.  The  second  application  of  damping 
treatment  [II]  consisted  of  two  strips  on  each  beam  (17%).  Both  response  peaks  are  significantly 
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attenuated.  The  peak  at  a  frequency  below  the  resonance  is  almost  completely  eliminated  while 
the  response  peak  at  the  upper  frequency  is  reduced  in  amplitude  sufficiently  to  eliminate  the 
hysteresis.  The  fourth  application  of  damping  treatment  [IV]  consisted  of  six  strips  (50%)  and 
caused  further  peak  broadening  and  smoothing,  resulting  in  a  frequency  response  that  was  so 
broad  that  it  appeared  as  if  it  had  a  large  damping  coefficient.  The  fifth  application  (100%)  almost 
completely  eliminated  the  modal  coupling;  however,  when  one  compares  the  harmonic  response 
(i.e.,  Figure  27)  to  the  stochastic  response  (Figure  26),  it  is  obvious  that  linear-based  modal 
analysis  techniques  (i.e.,  random  excitation)  cannot  be  used  to  identify  this  type  of  nonlinearity. 
This  figure  shows  that  damping  treatment  can  attenuate  the  nonlinear  hysteretic  behavior  and 
eliminate  the  amplitude  of  the  response. 

A  direct  excitation  to  the  second  mode  is  shown  in  Figure  28.  Similar  observations  can 
be  made  from  these  curves.  The  frequency  axes  have  been  nondimensionalized  with  respect  to 
the  excitation  frequency;  hence  the  second  mode  has  a  resonance  at  1 .0  and  the  first  mode, 
representing  a  frequency  nearly  one-half  that  of  the  excitation,  has  a  resonance  at  2.0 
(representing  a  subharmonic  resonance  of  order  1/2).  The  indirectly  excited  mode  (i.e.,  the  first 
mode)  remains  trivial  until  it  is  strongly  (and  nonlinearly)  coupled  to  the  second  mode;  when  it  is, 
the  amplitude  of  the  second  mode  is  drastically  attenuated.  Furthermore,  during  a  sweep  down, 
there  is  no  jump  up  in  the  second  mode  response-only  a  jump  down-because  the  lower  branch 
merges  with  the  upper  branch;  it  is  not  a  turning  point  bifurcation  as  was  the  case  for  a  direct 
excitation  to  the  first  mode  as  seen  in  Figure  27.  The  first  mode  response  demonstrates  both  a 
jump  up  and  a  jump  down  during  a  frequency  sweep.  For  the  fifth  application  of  damping 
treatment,  the  nonlinear  coupling  between  the  modes  is  completely  suppressed;  the  end  result  is 
that  the  system  behaves  like  a  moderately  damped  linear  system. 

Experiments  were  also  conducted  to  measure  the  amplitude  response  at  selected 
frequencies  of  excitation;  these  curves  are  cuts  across  a  particular  frequency  response  curve 
shown  in  Figures  27  and  28.  For  the  case  of  no  damping  treatment,  a  cut  at  a  frequency  of 
1 .004  (first-mode  excitation)  passes  through  the  modulation  region.  The  amplitude  response  at 
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this  frequency  is  shown  in  Figure  29.  As  the  amplitude  of  excitation  is  increased,  the  system 
experiences  a  Hopf  bifurcation  and  begins  to  modulate;  r.e. ,  energy  begins  to  flow  back  and  forth 
between  the  two  modes.  As  the  amplitude  of  excitation  is  increased,  the  "average*  amplitude 
tends  to  increase  and  the  maximum  excursions  increase. 

At  a  frequency  of  1.067  the  first-mode  frequency  response  curve  (for  no  damping 
treatment)  in  Figure  27(a)  shows  a  double-value  steady-state  amplituc  An  amplitude  sweep 
through  this  region,  as  shown  in  Figure  30,  reveals  a  jump  phenomenon,  ./hen  starting  on  the 
lower  branch,  increasing  the  amplitude  of  excitation  causes  a  jump  up  to  the  upper  branch; 
decreasing  the  amplitude  of  excitation  causes  a  jump  down  at  a  lower  value  of  the  amplitude  of 
excitation. 

When  the  maximum  damping  treatment  was  applied,  there  was  essentially  one  response, 
Figure  31  shows  an  amplitude  sweep  through  this  resonance  at  a  frequency  of  0.979.  Although 
the  curves  are  quite  tame,  they  still  show  nonlinear  coupling  because  the  second  mode  is 
excited.  Even  though  the  excitation  is  driving  the  first  mode,  some  of  the  energy  is  channeled 
into  the  second  mode. 

Amplitude-excitation  experiments  were  also  performed  for  a  direct  excitation  to  the 
second  mode.  Figure  32  shows  a  cut  across  the  0.998  frequency  line  (the  1 .999  frequency  line 
for  the  first  mode)  in  Figure  28  for  the  structure  with  no  damping  treatment  applied.  Although  the 
second  mode  is  directly  excited,  its  response  is  attenuated;  instead,  the  energy  goes  into  the  first 
mode  which  responds  like  a  parametrically  excited  SDOF  system.  An  amplitude  sweep  at  the 
1.057  frequency  line  (2.119  for  the  first  mode)  in  the  region  of  the  overhang  is  shown  in  Figure 
33.  During  the  experiment,  the  amplitude  of  the  excitation  was  increased  slowly.  At  0.32  g's 
rms,  the  linear  solution  became  unstable.  However,  the  divergence  growth  rate  was  so  slow  that 
it  was  possible  to  stay  on  the  unstable  branch  long  enough  to  locate  some  of  the  equilibrium 
points.  These  are  shown  in  the  figure  as  isolated  circles.  However,  by  waiting  long  enough,  the 
response  went  to  the  steady-state  nonlinear  solution  which  is  shown  as  a  solid  line.  During  a 
sweep  down,  the  response  followed  the  upper  curve  rather  than  the  lower  curve.  At  an  excitation 
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level  of  0.075  g's  rms,  the  response  jumped  down  to  the  linear  response  on  the  tower  branch  and 
remained  there.  Note  that  the  overhang  region  in  Figure  28  is  qualitatively  different  than  the 
overhang  region  in  Figure  27;  hence,  it  is  not  surprising  to  see  a  qualitative  difference  in  the 
double  solution  region  of  Figures  30  and  33. 

When  damping  treatment  was  applied  to  the  structure,  the  regions  of  multiple  solutions 
contracted  and  eventually  disappeared.  For  small  levels  of  excitation  the  linear  solution 
appeared;  this  is  shown  in  Figure  34.  At  this  frequency  of  excitation  (0.992  for  the  first  mode, 
1.991  for  the  second  mode),  when  the  excitation  level  exceeded  0.09  g's,  the  second  mode 
saturated.  The  energy  that  was  put  into  the  system  at  the  second-mode  was  channeled  into  the 
first  mode;  essentially  the  second  mode  frequency  was  parametrically  exciting  the  first  mode. 
Hence,  the  first  mode  response  appears  to  have  a  parametric  type  response. 


Slightly  Detuned  Structure 

Experiments  for  a  slightly  detuned  structure  were  also  performed.  When  the  damping 
treatment  was  applied,  experiments  were  performed  for  both  the  tuned  and  detuned  case.  One 
way  to  accomplish  the  same  detuning  was  to  increase  the  mass  of  m2;  this  was  accomplished  by 
adding  a  small  screw  to  m2.  In  this  case,  the  resonances  tor  the  undamped  structure  occurred  at 
3.750  Hz  and  7.750  Hz,  which  corresponds  to  a  detuning  of  +0.25  Hz,  or  +3.2%.  The  results  of 
these  experiments  are  qualitatively  the  same,  if  we  had  more  time  we  would  have  performed 
additional  experiments  with  even  more  detuning.  This  amount  of  detuning  would  have  yielded 
more  significant  skewing  if  a  smaller  excitation  level  was  used.  Increased  internal-resonance 
detuning  causes  increased  skewing  of  the  frequency-response  curves  and  in  some  cases 
increases  the  region  where  modulated  responses  occur. 

The  frequency  response  functions  for  low-level  random  excitation  for  the  five  cases  are 
shown  in  Figures  35  through  40.  The  trends  here  are  similar  to  those  of  the  tuned  structure.  The 
frequency  responses  to  harmonic  excitation  are  shown  in  Figures  41  and  42.  The  features  are 
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qualitatively  the  same  as  the  tuned  structure,  except  that  there  is  more  skewing.  Had  the 
detuning  been  greater,  the  skewing  would  have  been  more  pronounced.  There  is  one  noticeable 
difference:  in  Figure  41(b)  the  steady-state  response  amplitudes  for  the  undamped  (O)  case 
appear  to  be  discontinuous  before  and  after  the  Hopf  bifurcation.  The  theoretical  response 
curves  will  show  why  this  is  the  case.  The  qualitative  behavior  for  increased  application  of 
damping  treatment  is  the  same  as  that  for  the  tuned  case. 

Experiments  to  measure  the  amplitude  response  at  a  fixed  frequency  were  also 
performed  for  the  slightly  detuned  structure.  These  amplitude-response  curves  represent  a  cut 
across  a  particular  frequency-response  cun/e  at  a  fixed  frequency  (3rd  axis  representing  the 
amplitude  of  the  excitation).  The  frequencies  were  chosen  to  represent  different  qualitative 
regions  in  the  frequency-response  curves.  For  the  case  of  no  damping  treatment,  an  amplitude 
sweep  at  a  frequency  of  3.82  Hz  is  shown  in  Figure  43;  this  frequency  corresponds  to  a 
nondimensional  frequency  of  0.9941  for  the  first  mode  shown  in  Figure  41(a)  and  0.4922  for  the 
second  mode  in  Figure  41(b)  and  is  central  to  the  modulation  region.  The  amplitude  sweep 
shows  nonlinear  coupling  immediately  and  at  about  0.05g  excitation,  a  Hopf  bifurcation  takes 
place  that  causes  the  modulation  between  the  modes.  Since  the  motion  was  quite  large,  the 
experiments  were  not  continued  for  larger  amplitudes  of  excitation  because  of  fear  of  the 
structure  failing  prematurely.  The  amplitude  response  for  a  frequency  of  excitation  in  the 
overhang  region  on  the  right  side  is  shown  in  Figure  44.  The  amplitude  response  for  the 
maximum  amount  of  damping  treatment  applied  to  the  structure  is  shown  in  Figure  45.  Even 
though  the  entire  structure  is  covered  with  damping  treatment,  nonlinear  modal  coupling  still 
occurs.  The  response  amplitudes  are  attenuated. 

When  the  second  mode  is  excited,  an  unusual  behavior  was  observed  The  frequency 
response  for  a  direct  excitation  to  the  second  mode  was  shown  in  Figure  42.  When  an  excitation 
frequency  of  7.45  Hz  (1.9388  for  the  first  mode  and  0.9600  for  the  second  mode)  was  used,  the 
response  was  steady  state  (i.e ,  constant  modal  amplitudes)  with  modal  coupling  as  shown  in 
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Figure  46.  This  frequency  corresponds  to  the  region  just  left  of  the  modulation  region  where 
modal  interaction  is  quite  aggressive. 

When  the  frequency  was  increased  slightly  to  move  into  the  modulation  region,  an 
interesting  behavior  was  observed,  as  shown  in  Figure  47.  Initially,  mode  2  responds  and  mode 
1  is  trivial,  which  is  essentially  the  linear  response.  When  the  system  bifurcates  (i.e.,  Hopf),  the 
modes  begin  to  interact  and  exchange  energy  as  previously  observed  when  the  first  mode  was 
excited.  However,  as  the  amplitude  of  excitation  is  increased  further,  there  is  an  amplitude  of 
excitation  at  which  the  modal  interaction  ceases!  This  may  have  been  possible  on  the  previous 
cases,  but  since  the  amplitudes  of  response  were  so  large,  experiments  at  the  larger  amplitudes 
of  excitation  were  not  performed.  The  nature  of  the  "average"  amplitudes  (indicated  by  the 
dashed  line)  shown  in  Figure  47  suggests  that  the  second  mode  is  saturated  and  the  first  mode  is 
autoparametrically  excited. 

If  the  frequency  of  excitation  is  increased  further  so  that  we  are  now  to  the  right  of  the 
modulation  region  and  in  the  overhang  region,  two  possible  steady-state  responses  are  possible: 
the  linear  and  the  nonlinear.  These  responses  are  shown  in  Figure  48.  The  figure  shows  an 
isolated  data  point  to  the  right  of  the  bifurcation.  This  point  is  the  steady-state  response 
immediately  after  the  excitation  amplitude  was  increased.  Theoretical  analysis  shows  that  this 
point  is  unstable.  By  waiting  long  enough  (about  10  minutes),  the  response  did  go  to  the  stable 
steady-state  amplitude  predicted  by  theory.  This  experiment  shows  that  patience  is  required 
when  performing  experiments  on  nonlinear  structures  because  some  stable  solutions  require 
much  more  time  (than  one  might)  think  to  be  realized  in  the  laboratory. 

When  damping  treatment  was  applied  to  the  structure,  the  response  was  generally 
attenuated.  The  trends  were  shown  in  the  frequency  domain  in  Figure  42.  For  the  maximum 
amount  of  damping  treatment,  the  amplitude  response  is  shown  in  Figuie  49.  This  figure  shows 
the  same  behavior  as  discussed  previously 
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4.3  Discussion  of  Results  and  Comparison  with  Theory 

The  experimental  results  were  compared  with  the  theoretical  results  using  values  of  the 
damping  coefficients  and  natural  frequencies  shown  in  Table  2  (Appendix  A)  obtained  from  the 
random  excitation  experiments  and  complex  exponential  curve  fits.  The  response  of  the 
structure  to  a  direct  excitation  of  the  first  mode  is  given  by  equations  (60)  and  (61);  typical  results 
are  shown  in  Figure  50.  Two  additional  theoretical  curves  were  generated  for  increasing 
amounts  of  damping;  these  are  shown  in  Figures  51  and  52.  These  curves  also  capture  the 
qualitative  behavior  seen  in  the  experimental  results. 

The  steady-state  response  for  a  direct  excitation  to  the  second  mode  is  predicted  by 
equations  (77)-(80).  Curves  were  generated  for  the  response  to  a  direct  excitation  to  the  second 
mode.  The  case  of  no  damping  treatment  "O"  is  shown  in  Figure  53,  case  ”111”  is  shown  in  Figure 
54,  and  case  "V"  is  shown  in  Figure  55.  Again,  the  qualitative  features  are  captured  by  the 
theory,  including  the  nature  of  the  bifurcations.  Apparently  the  experiment  had  more  detuning 
than  was  measured  because  the  theoretical  curves  are  more  symmetrical  than  the  experimental 
curves. 

For  the  tuned  structure,  the  results  are  qualitatively  the  same.  Typical  results  for  a  direct 
excitation  to  the  first  and  second  mode  are  shown  in  Figures  56  and  57,  respectively. 
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5.0  SUMMARY  and  CONCLUSIONS 


Analysis  and  experiments  were  performed  to  study  the  effect  of  commercially  available 
viscoelastic  damping  treatment  on  parametric  and  autoparametric  resonances  in  nonlinear 
systems.  Both  SDOF  and  MDOF  structures  were  used;  they  consisted  of  flexible  beams  and 
masses. 

For  the  SDOF  structure,  experiments  were  conducted  at  several  levels  of  excitation  and 
it  was  found  that  the  damping  treatment  was  particularly  effective  in  reducing--and  even 
suppressing-the  resonance  entirely.  The  first-order  theory  predicts  the  region  of  the  loss  of 
trivial  stability  very  well,  but  failed  to  predict  the  softening  behavior  of  beam  2  and  the  closing 
(i.e.,  merginh  of  the  two  branches)  of  the  frequency  response  curves. 

For  the  2DOF  system,  the  particular  case  of  an  internally  resonant  structure  was 
considered.  The  qualitative  and  quantitative  effects  of  the  damping  treatment  were  determined 
for  very  large  excitation  levels;  it  was  observed  that  increased  amounts  of  damping  treatment 
reduces  the  nonlinear  modal  coupling  and  modulation  regions  and  reduces  the  amplitude  of 
response.  For  the  case  of  direct  excitation  to  the  second  mode,  a  large  amount  of  damping 
treatment  was  capable  of  eliminating  the  nonlinear  coupling. 

Further  research  should  investigate  flat  plates  subject  to  parametric  excitation.  The 
formulation  for  the  problem  would  be  similar  to  that  of  the  SDOF  beam,  but  allow  multi-modal 
interaction  similar  to  the  2DOF  model. 
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APPENDIX  A 


TABLE  1 .  Dimensions  and  properties  of  SDOF  beam  and  lumped  mass. 


Beam 

1 

2 

Length  (mm) 

125.40 

360.36 

Width  (mm) 

15.85 

19.16 

Thickness  (mm) 

0.559 

1.206 

6.599x1  O'2 

2.805 

E  (N/mm) 

0.209x10® 

0.202x10® 

r  (gm/mm) 

7.762x1  O'3 

0.1798 

n  (sec  ’) 

1.469x10 3 

1. 207x1 0"3 

Lumped  Mass 

Lenqth  (mm) 

31.75 

31.50 

Height  (mm) 

9.53 

19.32 

Width  (mm) 

9.05 

25.02 

Mass  (gm) 

25.40 

60.04 

RfflESSHH 

1969 

6670 

d  (mm) 

76.20 

346.1 

Natural  Freq  (Hz) 

15.038 

3.462 
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Table  2.  Natural  frequencies  and  damping  coefficients  estimated  from  complex  exponential 
curve  fits  to  the  frequency  response  data  for  low-level  random  excitation  of  the  2DOF  tuned  and 
slightly  detuned  structure. 


Beam 

CO, 

co2 

“iCl 

c, 

|a,| 

■n 

Tuned-0 

4.012 

8.040 

0.410 

0.016 

0.191 

0.004 

0.012 

0.022 

Tuned-I 

4.091 

8.050 

0.569 

0.022 

0.396 

0.008 

0.006 

0.020 

Tuned-ll 

4.062 

8.161 

0.485 

0.019 

0.531 

0.010 

0.010 

0.022 

Tuned-lll 

4.131 

8.220 

0.476 

0.018 

0.863 

ERUI 

0.011 

0.026 

BMMi 

4.175 

8.283 

0.593 

0.023 

1.135 

0.022 

Tuned-V 

4.278 

8.620 

1.425 

0.053 

1.731 

0.007 

0.011 

Detuned-0 

3.750 

7.750 

0.183 

0.008 

0.195 

0.004 

0.010 

0.027 

Detuned- 1 

3.748 

7.802 

0.289 

0.012 

0.318 

mmm 

0.010 

0.024 

Detuned-ll 

3.795 

7.846 

0.387 

0.016 

■mu!™ 

0.009 

0.024 

Detuned-lll 

3.863 

7.861 

0.507 

0.021 

0.782 

0.016 

0.009 

0.027 

Detuned-IV 

3.906 

8.186 

0.742 

0.031 

1.162 

0.012 

Detuned-V 

4.005 

8.324 

0.723 

0.029 

1.560 

0.030 

0.008 

0.023 

Frequency  (Hz),  damping  (percent  of  critical) 


*  Estimated  from  sinusoidal-dwell  data  because  random  excitation  data  unavailable. 
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Appendix  B 


I 


f>1  =  (^22^11  '  ^12^12)  ^  ^  • 


h2  =  ((<22^12  '  ^12^22)  ^  ^  ■ 


^3  =  (^11^12*^12^11^^  ■ 


h4  «  (kn/22  '  ^12^12)  ^  ^ 


D  = 


2^1  +  cos  £  (L-d)cosh  £  (L  - d)J  , 


where 


kt1  =  sin  £  (L-d)  +  sinh  £  (L-d) 


k  k 

k12  =  cos  £  (L-d)  +  cosh  £  (L-d) 


k^  =  sin  £  (L  -  d)  +  sinh  £  (L-d) 


lu  =  -  sin  k  -  sinh  k 
/12  =  -  cos  k  -  cosh  k 


=  sin  k  -  sinh  k  . 


A  = 


mn  _  m2i 

m12  1^22 


where 


(B1) 

(B2) 

(B3) 

(B4) 

(B5) 

(B6) 

(B7) 

(B8) 

(B9) 

(BIO) 

(B11) 

(B12) 
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Figure  1 .  Structure  used  for  the  SDOF  experiments  consisting  of  a  flexible  beam  carrying  a 
lumped  mass.  Constrained-layer  damping  treatment  was  applied  in  thin  0.10-inch  strips  on  both 
sides  of  the  beam  to  incrementally  increase  the  level  of  damping  in  the  structure.  The  base  of 
the  beam  was  clamped  to  a  shaker  head  that  oscillated  in  the  vertical  direction. 
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Figure  2.  Variation  of  the  steady-state  amplitude  a*  with  the  nondimensional  frequency  of 
excitation  of  beam  1:  ( — )  stable,  ( — )  unstable,  a  «  0.0808,  k,  -  0.4163,  Kj  -0.1716,  C  * 
0.1716,  v  «  0.2577,  f  -  0.01653  (0.230-g  level),  e  «  1.00. 


Figure  3.  Variation  of  the  steady-state  amplitude  a*  with  the  amplitude  of  excitation  f  in  region 
II  of  Figure  2:  ( — )  stable,  ( — )  unstable,  +  •  2.000. 
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Table  Acceleration  (center  of  table) 
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Figure  5.  Variation  of  the  steady-state  amplitude  a*  with  the  frequency  of  excitation  4  for  beam 
1  for  two  levels  of  excitation  amplitude:  0.230  g’s  and  0.300  g's.  The  diamond  at  "a"  denotes  the 
loss  of  stability  of  the  trivial  solution  for  the  0.300-g  acceleration  level  and  the  diamond  at  "b”  for 
the  0.230-g  acceleration  level.  These  diamonds  represent  the  extent  of  penetration  into  the 
linear  unstable  region  of  the  trivial  solution. 


Figure  6.  Variation  of  the  amplitude  a*  with  the  amplitude  of  excitation  T  for  $  -  2.000  for 
beam  1 ,  shown  in  Figure  5. 
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Figure  7.  Variation  of  the  steady-state  amplitude  a*  with  the  nondimensional  frequency  of 
excitation  of  beam  1 :  ( — )  stable,  ( — )  unstable,  a  -  0.3869,  -  1 .8201 ,  k :  =0.7379,  £  = 
0.001207,  v  =  0.9585. 
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Figure  8.  Variation  of  the  steady-state  amplitude  a*  with  the  frequency  of  excitation  4  for  the 
SDOF  beam  2  (before  damping  treatment  was  applied)  for  two  levels  of  excitation  amplitude: 
0.25  g's  (v)  and  0.35  g’s  (o)  rms.  The  bending  of  the  curves  to  the  left  was  neither  expected  nor 
predicted-it  indicates  that  the  system  is  softening. 


Figure  9.  Variation  of  the  steady-state  amplitude  a*  with  the  frequency  of  excitation  $  for  two 
levels  of  excitation  amplitude  for  beam  2  with  one  strip  of  damping  treatment  applied:  0.25  g's 
(A)  and  0.35  g’s  (o)  rms. 
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Figure  10.  Variation  of  the  steady-state  amplitude  a*  with  the  frequency  of  excitation  +  for  two 
levels  of  excitation  amplitude  for  beam  2  with  two  strips  of  damping  treatment  applied:  0.25  g's 
(A)  and  0.35  g's  (o)  rms. 
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Figure  11.  Bifurcation  diagram  showing  the  instability  regions  in  the  excitation-amplitude  vs. 
excitation  frequency  domain  of  the  principal  parametric  resonance  for  beam  2  before  any 
damping  treatment  was  applied.  The  curve  on  the  far  left  represents  the  boundary  caused  by  the 
overhang  observed  in  Figure  8.  In  this  case,  it  represents  a  sub-critical  instability. 
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Figure  12.  Bifurcation  diagram  showing  the  insta 
-asonance  for  beam  2  with  one  strip  of  damping  trea 
-  ie  boundary  caused  bv  the  overhana  observed 


Excitation  Amplitude  (g's.  RMS) 


Frequency  Ratio  (Excitation/Natural) 


Figure  13.  Bifurcation  diagram  showing  the  instability  regions  of  the  principal  parametric 
resonance  for  beam  2  with  two  strips  of  damping  treatment.  The  curve  on  the  far  left  represents 
the  boundary  caused  by  the  overhang  observed  in  Figure  10. 
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Figure  14.  Bifurcation  diagram  summarizing  the  region  where  nontrivial  responses  exist  for 
three  levels  of  damping.  Increased  amounts  of  damping  treatment  causes  the  instability  region  to 
migrate  away  from  the  frequency  axis  and  simultaneously  contract. 
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Figure  16.  Bifurcation  boundaries  showing  the  loss  of  stability  of  the  trivial  solution  as  predicted 
by  the  theory.  The  damping  coefficients  were  determined  from  the  free  decays  shown  in  Figure 
IS  using  ERA. 
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figure  17.  Comparison  of  the  bifurcation  boundary  indicating  the  loss  of  stability  of  the  trivial  as 
predicted  by  theory  and  measured  during  the  experiment  for  the  beam  with  no  damping  treatment 

applied:  ( — )  theory,  (A,  o)  experiment. 
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Figure  18.  Comparison  of  the  bifurcation  boundary  indicating  the  loss  of  stability  of  the  trivial  as 
predicted  by  theory  and  measured  during  the  experiment  for  the  beam  with  one  strip  of  damping 

treatment  applied:  ( — )  theory,  (A,  o)  experiment. 
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Figure  19.  Comparison  of  the  bifurcation  boundary  indicating  the  loss  of  stability  of  the  trivial 
solution  as  predicted  by  theory  and  measured  during  the  experiment  for  the  beam  with  two  strips 

of  damping  treatment  applied:  ( — )  theory,  (a,  o)  experiment. 


Figure  20.  Dual  beam-mass  structure  used  for  the  MDOF  experiments.  By  adjusting  the  length 
of  the  lower  beam  and  the  position  of  the  second  mass,  the  first  and  second  resonant 
frequencies  couW  be  adjusted.  The  higher  modes  and  the  out-of-plane  modes  were  not  excited 
during  the  experiments;  hence,  a  2DOF  mathematical  model  can  be  used  to  adequately  describe 
the  observed  behavior. 
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Figure  22.  Frequency  Response  Function  and  complex  exponential  curve  fit  for  the  tuned 
structure  before  damping  treatment  was  applied  [0].  The  estimated  natural  frequencies  are 
4.018  and  8.060  Hz  and  the  damping  coefficients  are  0.016  and  0.004,  respectively.  The 
excitation  was  low-level  random. 


Freq  (Hz) 


Figure  23.  Frequency  Response  Function  and  complex  exponential  curve  fit  for  the  tuned 
structure  for  the  first  application  of  damping  treatment  [I].  The  estimated  natural  frequencies  are 
4.091  and  8.050  Hz  and  the  damping  coefficients  are  0.022  and  0.008,  respectively.  The 
excitation  was  low -level  random. 
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Figure  24.  Frequency  Response  Function  and  complex  exponential  curve  fit  for  the  tuned 
structure  for  the  second  application  of  damping  treatment  [II].  The  estimated  natural  frequencies 
are  4.062  and  8.161  Hz  and  the  damping  coefficients  are  0.019  and  0.010,  respectively.  The 
excitation  was  low-level  random. 
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Figure  25.  Frequency  Response  Function  and  complex  exponential  curve  fit  for  the  tuned 
structure  for  the  third  application  of  damping  treatment  [III].  The  estimated  natural  frequencies 
are  4.131  and  8.220  Hz  and  the  damping  coefficients  are  0.018  and  0.017,  respectively.  The 
excitation  was  low-level  random. 
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Figure  26.  Frequency  Response  Function  and  complex  exponential  curve  fit  for  the  tuned 
structure  for  the  fifth  application  of  damping  treatment  [V].  The  estimated  natural  frequencies  are 
4.278  and  8.620  Hz  and  the  damping  coefficients  are  0.053  and  0.032,  respectively.  The 
excitation  was  low-level  random. 
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first  mode:  (a)  shows  the  first  mode  response  and  (b)  shows  the 
second  mode  response.  A  it  hough  attempts  were  made  to  have  a 
perfect  internal  resonance,  there  is  a  small  amount  of  detuning  as 
evidenced  from  the  unsymmetrical  nature  of  the  response  curves. 
The  arrows  indicate  jumps. 


Figure  29.  Experimental  amplitude  response  of  the  2DOF  structure  with  +0.20%  detuning  and 
no  damping  treatment  to  a  (tract  excitation  to  the  first  mode  at  a  frequency  of  1.004:  (a)  first 
mode  amplitude,  (b)  second  mode  amplitude.  This  frequency  corresponds  to  the  modulation 
region  in  Figure  27.  The  curves  approximate  the  bounds  on  the  modulation. 
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Figure  30.  Experimental  amplitude  response  of  the  2DOF  structure  with  +0.20%  detuning  and 
no  damping  treatment  to  a  direct  excitation  to  the  first  mode  at  a  frequency  of  1.067:  (a)  first 
mode  amplitude,  (b)  second  mode  amplitude.  This  frequency  corresponds  to  the  double- valued 
region  in  Figure  27  (to  the  right  of  the  modulation  region). 
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Figure  31.  Experimental  amplitude  response  of  the  2DOF  structure  with  +0.29%  detuning  and 
maximum  damping  treatment  (case  [V])to  a  direct  excitation  to  the  first  mode  at  a  frequency  of 
0.979:  (a)  first  mode  amplitude,  (b)  second  mode  amplitude.  This  frequency  corresponds  to  the 
resonance  region  in  Figure  27  (VJ.  These  curves  should  be  compared  to  Figure  27  to  see  the 
effect  of  the  damping  treat ment--it  completely  suppresses  the  modulation  and  attenuates  the 
amplitude  of  the  response. 
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Figure  32.  Experimental  amplitude  response  of  the  2DOF  structure  with  +0.20%  detuning  and 
no  damping  treatment  to  a  direct  excitation  to  the  second  mode  at  a  frequency  of  0.998:  (a)  first 
mode  amplitude,  (b)  second  mode  amplitude.  This  frequency  corresponds  to  the  single-valued 
resonance  region  in  Figure  28. 
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Figure  33.  Experimental  amplitude  response  of  the  2DOF  structure  with  +0.20%  detuning  and 
no  damping  treatment  to  a  direct  excitation  to  the  second  mode  at  a  frequency  of  1 .057:  (a)  first 
mode  amplitude,  (b)  second  mode  amplitude.  This  frequency  corresponds  to  the  overhang 
region  in  Figure  28. 
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Figure  34.  Experimental  amplitude  response  of  the  2DOF  structure  with  +0.29%  detuning  and 
maximum  damping  treatment  (case  (VI)  to  a  direct  excitation  to  the  second  mode  at  a  frequency 
of  0.992:  (a)  first  mode  amplitude,  (b)  second  mode  amplitude.  This  frequency  corresponds  to 
the  peak  in  Figure  28  (V). 
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Figure  35.  Frequency  Response  Function  and  complex  exponential  curve  fit  for  the  slightly 
detuned  structure  before  damping  treatment  was  applied  (case  O).  The  estimated  natural 
frequencies  are  3.7599  and  7.7576  Hz  and  the  damping  coefficients  are  0.008  and  0.004, 
respectively.  The  excitation  was  low-level  random. 
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Figure  36.  Frequency  Response  Function  and  complex  exponential  curve  fit  for  the  slightly 
detuned  structure  for  damping  treatment  I.  The  estimated  natural  frequencies  are  3.7484  and 
7.8015  and  the  damping  coefficients  are  0.012  and  0.007,  respectively.  The  excitation  was  low- 
level  random. 


64-69 


Figure  37.  Frequency  Response  Function  and  complex  exponential  curve  fit  for  the  slightly 
detuned  structure  for  damping  treatment  II.  The  estimated  natural  frequencies  are  3.7954  and 
7.8460  Hz  and  the  damping  coefficients  are  0.016  and  0.010,  respectively.  The  excitation  was 
low-level  random. 
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Figure  38.  Frequency  Response  Function  and  complex  exponential  curve  fit  for  the  slightly 
detuned  structure  for  damping  treatment  III.  The  estimated  natural  frequencies  are  3.8633  and 
7.8607  Hz  and  the  damping  coefficients  are  0.021  and  0.016,  respectively.  The  excitation  was 
low-level  random. 
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Freq  (Hz) 


Figure  39.  Frequency  Response  Function  and  complex  exponential  curve  fit  for  the  slightly 
detuned  structure  for  damping  treatment  IV.  Hie  estimated  natural  frequencies  are  3.9063  and 
8.1856  Hz  and  the  damping  coefficients  are  0.030  and  0.023,  respectively.  The  excitation  was 
iow-levei  random. 


Freq  (Hz) 


Figure  40.  Frequency  Response  Function  and  complex  exponential  curve  fit  for  the  slightly 
detuned  structure  for  damping  treatment  V.  The  estimated  natural  frequencies  are  4.0054  and 
8.3241  Hz  and  the  damping  coefficients  are  0.029  and  0.030,  respectively.  The  excitation  was 
Iow-levei  random. 
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Excitation  Amplitude  (g'&RMS) 


Figure  43.  Experimental  amplitude  response  o(  the  2 DOF  structure  with  +3.06%  detuning  and 
no  damping  treatment  to  a  direct  excitation  to  the  first  mode  at  a  frequency  of  0.9941  (for  the  first 
mode  and  0.4922  for  the  second  mode):  (a)  first  mode  amplitude,  (b)  second  mode  amplitude. 
This  frequency  corresponds  to  the  modulation  region  in  Figure  41.  The  curves  approximate  the 
bounds  on  the  modulation. 
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Figure  44.  Experimental  amplitude  response  of  the  2DOF  structure  with  43.06%  detuning  and 
no  damping  treatment  to  a  direct  excitation  to  the  first  mode  at  a  frequency  oM  .0488  (for  the  first 
mode  and  0.5165  for  the  second  mode):  (a)  first  mode  amplitude,  (b)  second  mode  amplitude. 
This  frequency  corresponds  to  the  overhang  region  in  Figure  41 . 
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Figure  45.  Experimental  amplitude  response  of  the  2DOF  structure  with  +3.76%  detuning  and 
maximum  damping  treatment  (case  V)to  a  direct  excitation  to  the  first  mode  at  a  frequency  of 
0.9866  (for  the  first  mode  and  0.4747  for  the  second  mode):  (a)  first  mode  amplitude,  (b)  second 
mode  amplitude.  This  frequency  corresponds  to  the  resonance  region  in  Figure  41  (Vj. 
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Excitation  Ampfltude  (g'sjRMS) 


Excitation  Amplitude  (gaRMS) 

Figure  46.  Experimental  amplitude  response  of  the  2DOF  structure  with  +3.06%  detuning  and 
no  damping  treatment  (case  O)  to  a  direct  excitation  to  the  second  mode  at  a  frequency  of 
0.9600  ((or  the  second  mode  and  1 .9388  for  the  first  mode):  (a)  first  mode  amplitude,  (b)  second 
mode  amplitude.  This  frequency  corresponds  to  the  region  just  left  of  the  modulation  region  in 
figure  42(0]. 
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Excitation  Ampttufe  (g'aflMS) 


Figure  47.  Experimental  amplitude  response  of  the  2DOF  structure  with  +3.06%  detuning  and 
no  damping  treatment  (case  0)  to  a  direct  excitation  to  the  second  mode  at  a  frequency  of 
0.9909  (for  the  second  mode  and  2.0012  for  the  first  mode):  (a)  first  mode  amplitude,  (b)  second 
mode  amplitude.  This  frequency  corresponds  to  the  modulation  region  in  figure  42  [O].  Note  that 
the  modulation  can  be  suppressed  with  sufficiently  large  amplitudes  of  excitation:  however,  with 
the  large  amplitude  of  excitation  to  the  seoond  mode  we  get  a  large  response  of  the  first  mode 
because  the  second  mode  is  essentially  saturated. 
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Figure  48.  Experimental  amplitude  response  of  the  2DOF  structure  with  43.06%  detuning  and 
no  damping  treatment  (case  0)  to  a  direct  excitation  to  the  second  mode  at  a  frequency  of 
1 .0309  (for  the  second  mode  and  2.0819  for  the  first  mode):  (a)  first  mode  amplitude,  (b)  second 
mode  amplitude.  This  frequency  corresponds  to  the  overhang  region  to  the  right  of  the 
modulation  region  in  figure  42  (O).  The  arrows  indicate  jumps.  The  isolated  point  to  the  right  of 
the  bifurcation  indicates  the  temporary  stationary  value  achieved  by  the  structure  immediately 
after  the  amplitude  of  excitation  was  increased.  However,  since  it  is  an  unstable  response,  by 
waiting  long  enough  the  response  went  to  the  stable  nonlinear  response. 
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Excitation  Amplitude  (g'&RMS) 


Figure  49.  Experimental  amplitude  response  of  the  2DOF  structure  with  +3.76%  detuning  and 
maximum  damping  treatment  (case  V)  to  a  direct  excitation  to  the  second  mode  at  a  frequency  of 
1 .0309  (lor  the  second  mode  and  2.0819  for  the  first  mode):  (a)  first  mode  anyXkude,  (b)  second 
mode  amplitude.  This  frequency  corresponds  to  the  peak  in  Figure  42  (V). 
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hence  these  could  never  be  realized  in  an  experiment. 
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III:  (a)  first  mode  response,  (b)  second  mode  response,  ( — )  stable,  ( — )  unstable.  The  values  of  the  damping  coefficients  and 
frequencies  were  obtained  from  the  complex  exponential  curve  fit  shown  in  Figure  38: 
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detuned  structure  before  any  damping  treatment  was  applied:  (a)  first  mode  response,  (b)  second  mode  response,  ( — )  stable,  ( — ) 
unstable.  The  theory  captures  the  qualitative  features  of  the  experimental  responses.  The  values  of  the  damping  coefficients  and  natural 
frequencies  were  obtained  from  the  complex  exponential  curve  fit  shown  in  Figure  35.  The  dashed  line  represents  unstable  solutions; 
hence  these  could  never  be  realized  in  an  experiment. 
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detuned  structure  tor  the  maximum  amount  ot  damping  treatment  applied:  (a)  first  mode  response,  (b)  second  mode  response,  ( _ ) 

stable,  ( — )  unstable.  The  estimated  values  ot  the  damping  coefficients  and  the  natural  frequencies  were  obtained  from  the  complex 
exponential  curve  fit  shown  in  Figure  40. 
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before  any  damping  treatment  was  applied:  (a)  first  mode  response,  (b)  second  mode  response,  ( — )  stable,  ( — )  unstable.  The  theory 
captures  the  qualitative  features  of  the  experimental  responses.  The  values  of  the  damping  coefficients  and  natural  frequencies  were 
obtained  from  the  complex  exponential  curve  fit  shown  in  Figure  22.  The  dashed  line  represents  unstable  solutions;  hence  these  could 
never  be  realized  in  an  experiment. 
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Figure  57.  Theoretical  frequency  response  for  a  direct  excitation  to  the  second  mode  (oompare  with  Figure  28 -{O])  for  the  tuned 

structure  before  any  damping  treatment  was  applied:  (a)  first  mode  response,  (b)  second  mode  response,  ( — )  stable,  ( — )  unstable. 
The  values  of  the  damping  coefficients  and  natural  frequencies  were  obtained  from  the  complex  exponential  cun/e  fit  shown  in  Figure  22. 
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Abstract: 

The  summer  1990  SFRP.project  is  continued  to  include  AMI  semi- 
empirical  calculations  on  odd-electron  rigid-rod  polymer  model  species. 

The  justification  for  the  continued  series  of  calculations  is  that  evidence  has 
been  found  for  odd-electron  species  in  bulk  sample  of  rigid  rod  polymer 
fibers.  Torsional  barriers  calculated  for  neutral  radical  species  are  about  3 
Kcal/mol  higher  than  corresponding  closed-shell  species.  Radical  cation  and 
anion  species  have  torsional  barrier  heights  approximately  four-times  as 
great  as  the  neutral  closed-  shell  species.  These  effects  are  explained  in 
terms  of  the  p  bonding  characteristics  of  the  frontier  orbitals. 

Introduction: 

The  rigid  rod  polymer,  poly(p-phenylenebenzobisoxazole),  PBO,  and 
its  sulfur  analog,  poly(p-phenylenebenzobisthiazole),  PBT.  have  been  found  to 
exhibit  exceptional  specific  strength  and  modulus,  thermooxidative  stability 
and  environmental  resistance  when  made  into  films  and  fibers1.  The 
properties  of  these  compounds  are  a  consequence  of  the  chain  stiffness  as 
well  as  the  molecular  stability  due  to  extensive  conjugation  along  the 
polymer  chain. 

At  the  molecular  level,  one  aspect  of  chain  stiffness  is  the  barrier  to 
rotation  about  the  carbon-carbon  single  bonds  linking  the  phenylene  group 
to  the  aromatic  heterocycle.  Using  a  suitable  model  compound  this  barrier 
height  can  be  calculated  using  semi -empirical  molecular  orbital  techniques 
by  calculation  of  the  molecular  heat  of  formation  at  each  of  a  series  of 
molecular  conformations.  Figure  1  shows  two  model  compounds  on  which 
such  calculations  have  been  recently  reported2-3. In  both  cases  heats  of 
formation  were  calculated  at  10°  intervals  of  rotation  about  the  indicated 
carbon-carbon  bonds.  It  should  be  noted  that  in  this  figure  the  symbol  X 
stands  for  O.  NH.  or  S. 
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FIG  1.  Model  Compounds  Used  in  Previous  AMI  Calculations. 


Figure  2  shows  the  model  compound,  again  with  X  symbolizing  O,  NH, 
or  S,  on  which  an  extensive  series  of  AMI  calculations  were  done  during  the 
1990  SFRP  project. 


FIG  2.  Model  Compounds  Structure  Used  In  Summer  1990 SFRP. 

In  the  previous  calculations  the  model  compounds  started  in  the  planar 
conformation  and  one  segment  of  the  molecule  was  rotated  relative  to  the 
original  plane  of  the  molecule,  as  indicated  in  the  arrows  in  the  figures.  In 
addition  to  that  type  of  calculation  we  have  examined  the  situation  in  which 
the  one  heterocyclic  system  is  perpendicular  to  the  remainder  of  the 
molecular  plane.  This  structure  is  shown  in  Figure  3. 


FIG  3.  Perpendicular  Model  Compound 

Since  dynamics  calculations  on  PBO  and  PBZT  model  structures4  indicate 
that  these  materials  are  more  flexible  than  the  calculated  torsional  barrier 
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heights  suggest,  we  decided  to  examine  torsional  barriers  which  model 
segments  of  the  polymer  chain  which  are  perpendicular  to  one  another. 

Figure  4  shows  the  result  of  our  AMI  calculations  on  the  unsubstituted 
PBO  model  compound. 


Fig  4.  Graphical  Display  of  AMI  Calculations  on  PBO  Model  Compounds. 

Note  the  very  small  rotational  barrier  shown  in  curve  C.  The  relevance. of  this 
result  is  as  follows.  When  a  given  polymer  link  rotates  90°  it  becomes 
perpendicular  to  adjacent  links  in  the  polymer.  According  to  our 
calculations,  the  links  adjacent  to  the  perpendicular  one  then  have  much 
greater  torsional  flexibility.  Table  I  lists  all  the  molecular  species  on  which 
calculations  were  carried  out  and  Table  II  summarizes  the  results  of  these 
calculations. 
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TABLE  I 


STRUCTURAL  KEY  FOR  TABLE  II 
SYMBOL  STRUCTURE 


Table  II  summarizes  the  results  of  these  calculations. 

TABLE  II 

NUMERICAL  SUMMARY  OF  AMI  CALCULATIONS 
TORSIONAL  ANGLE  TORSIONAL  ANGLE 


(DEG)  (DEG)  BARRIER 


STRUC 

TURAL 

TYPE 

MIN  ENERGY 
XpO  x=nh 

x=s 

MAX  ENERGY 

XpO  X=NH  XpS 

(KCAL/MOL) 

XpO  X=NH 

X=S 

I 

0 

30 

10 

90 

90 

90 

5.2 

2.7 

2.2 

II 

45 

45 

45 

0.90 

0,90 

0.90 

0.5 

1.3 

0.5 

III 

0 

140 

30 

90 

180 

90 

4 

3.1 

1.3 

IV 

20 

150 

60 

90 

0 

0 

1.7 

4 

1.6 

V 

40 

50 

70 

90 

0 

0 

1.7 

7.1 

1.7 

VI 

20 

50 

30 

90 

0 

0 

2.5 

4.7 

0.2 

VII 

80 

50 

90 

90 

0 

0 

4.2 

8.5 

3.7 

VIII 

140 

130 

45 

0 

180 

0 

1 

3.1 

1.1 

In  the  current  project  we  have  continued  these  calculations  to  include 
charged  and  uncharged  odd -electron  model  species.  The  significance  of 
these  new  calculations  relates  to  the  chemical  consequences  of  the 
processessing  of  synthetic  PBO  and  PBZT.  It  has  been  reported5  that  the 
mechanical  stress  involved  in  the  processing  causes  radical  formation,  as 
indicated  by  the  presence  of  ESR  signals  in  the  processed  polymer  films.  We 
then  consider  whether  such  chemical  alteration  of  the  polymer  would  cause 
a  change  in  its  stiffness. 

In  Table  III  are  summarized  the  results  of  our  calculations  on  the 
closed-shell,  cation  radical,  and  anion  radical  PBO  and  PBZT  model  species. 
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TABLE  III 


SUMMARY  OF  AMI  CALCULATIONS 
ON  MODEL  P BO  AND  PBZT  SPECIES 


CONFOUND  BARRIER  HEIGHT  (KCAL/MOL) 

CLOSED  SHELL  CATION  ANION 

RADICAL  RADICAL 


5.2  18.8  21.8 


0.5  4.2 


0.3 


0.3  0  1.6 

NEUTRAL  RADICAL  SPECIES 
PLANAR  PERPENDICULAR 
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The  substantial  increase  In  barrier  height  for  both  the  cation  radicals 
and  the  anion  radicals  can  be  understood  in  terms  of  the  frontier  orbitals  for 
these  species.  The  HOMO  is  shown  if  Figure  IV.  This  orbital  is 

HOMO 


Figure  IV.  HOMO  for  DBO  Model  Compound. 


antibonding  across  the  sigma-bond  about  which  rotation  is  occurring.  When 
an  electron  is  removed  from  this  orbital  (to  make  the  cation  radical)  the 
repulsion  across  the  sigma  bonds  decreases  and  rotation  about  these  bonds 
is  more  difficult.  The  LUMO  is  shown  in  Figure  V.  This  orbital  is  P  bonding 

LUMO 


Figure  V.  LUMO  for  DBO  Model  Compound. 

across  the  C-C  s  bond.  Partial  occupancy  of  it,  as  occurs  in  the  radical  anion, 
increases  the  difficulty  in  rotation  about  this  sigma  bond. 

It  was  necessary  to  change  our  model  species  to  model  neutral  radical 
species  in  the  polymer  chain.  Neutral  radicals  formed  during  processing  of 
the  polymer  would  presumably  be  formed  by  breaking  a  C-C  s  bond  along  the 
backbond.  Such  radicals  would  be  carbon-centered.,  with  the  carbon  being 
either  at  the  end  of  the  phenyl  group  or  the  heterocyclic  group.  The 
structures  shown  in  Table  III  model  both  of  these  possibilities  for  PBO  and 
PBZT.  As  can  be  seen  by  the  data  in  Table  III  the  barrier  to  rotation  in  the 
flat  structures  is  about  3  Kcal/mol  higher  than  in  closed -shell  species.  In  the 
perpendicular  structure  the  barrier  to  rotation  is  essentially  zero. 

In  summary,  our  calculations  suggest  that  if  the  processessing  of  these 
rigid  rod  polymers  causes  radical  formation.  The  resulting  polymer  chain 


segments  will  be  less  flexible,  even  though  the  radical  formation  could  cause 
the  chain  length  to  be  reduced. 
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STRUCTURAL  ANALYSIS  OF  POTENTIAL  NLO  CHROMAPHORES 


by 

David  A.  Grossie,  Ph.D. 


ABSTRACT 

Crystallization  studies  on  a  collection  of  benzothiazole  and  tetrazine  derivatives  were 
conducted,  with  the  intent  to  produce  crystals  suitable  for  single-crystal  X-ray 
diffraction.  The  benzothiazole  compounds  are  designed  to  model  a  single  repeating  unit 
of  a  polymer  chain,  with  an  electron-withdrawing  group  available  for  attachment  of  a 
pendent  having  potential  nonlinear  optical  (NLO)  properties.  These  compounds  were 
also  examined  via  molecular  mechanics  to  determine  any  intra-molecular  interactions 
which  may  affect  the  molecular  packing  in  the  crystal  lattice.  Single-crystal  x-ray 
diffraction  data  was  collected  on  Bis(phenyl)tetrazine  and  the  molecular  structure 
determined.  The  tetrazine  derivative,  a  molecular  analogue  of  para-terphenylene, 
crystallizes  in  an  monoclinic  crystal  lattice  with  cell  constants  of  a=5.448(2),  b=5. 119(1), 
c=20.236(7)  A  and  p  =  93.64(2)°.  The  observed  space  group  is  P2j/n,  a  centric  space 

group.  The  structure  was  solved  and  refined,  yielding  a  R-factor  of  0.094.  The 
compound  is  planar  with  little  distortion  in  the  internal  bond  distances  and  angles. 
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I.  INTRODUCTION: 


The  Polymer  Branch  of  the  Materials  Laboratory  at  the  Wright  Aeronautical  Laboratory, 
Wright-Patterson  Air  Force  Base  and  the  Department  of  Chemistry  at  Wright  State 
University  are  interested  in  the  synthesis  and  characterization  of  polymeric  materials. 
Basic  research  is  also  conducted  in  the  structure  of  polymeric  materials  and  the 
correlation  of  the  structure  and  physical  properties.  The  emphasis  of  this  area  is  to 
predict  the  properties  of  a  polymer  prior  to  its  synthesis.  In  this  way,  the  synthesis 
problem  can  have  greater  direction  and  produce  new  and  better  materials  with  more 
efficiency.  One  of  the  techniques  used  in  determining  the  structure  of  polymers  is  to 
examine  by  single-crystal  X-ray  diffraction  methods  compounds  that  may  be  used  to 
form  the  backbone,  pendants,  or  cross-links  of  the  polymer.  By  knowing  the  structure  of 
a  small,  repeating  portion  of  the  polymer,  the  polymer  itself  may  be  mathematically 
modeled,  yielding  the  physical  properties.  The  synthesis  of  a  new  material  does  not 
always  produce  a  crystalline,  solid  compound,  so  crystallization  studies  are  used  to  find 
the  optimal  conditions  under  which  good  crystals  may  be  grown.  Once  the  crystalline 
material  has  been  produced  and  a  structure  determination  completed,  examination  of  the 
molecule  via  molecular  mechanics  will  isolate  the  intra-  and  inter-molecular  effects  in 
the  packing  of  the  molecule  in  the  crystal  lattice. 


II.  OBJECTIVES  OF  THE  RESEARCH  EFFORT: 

A  study  of  model  compounds  of  polymeric  materials  that  have  potential  nonlinear  optical 
properties  will  be  conducted.  This  study  will  involve  the  structural  analysis  of 
compounds  by  single-crystal  x-ray  diffraction  techniques,  with  the  intent  to  amass  data 
which  may  be  used  to  correlate  the  observed  structure  and  the  magnitude  of  the  nonlinear 
optical  response.  The  primary  structural  information  that  is  needed  by  the  currently 
accepted  theories  is  the  centricity  of  the  crystal  lattice  in  which  the  compound  of  interest 
crystallizes  and  the  extent  of  n-orbital  conjugation.  To  this  end,  the  study  will  examine 
compounds  with  known  nonlinear  optical  properties,  and  other  conjugated  systems  with 
separated  electron  donor  and  acceptor  groups.  In  addition,  the  analysis  will  be  conducted 
into  the  means  of  attaching  the  NLO  active  compound  to  a  polymer  chain  through  the 
examination  of  polymeric  models  with  available  electron  donor  or  acceptor 
functionalities. 
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III. 


As  part  of  the  research  effort,  attempts  were  made  to  grow  crystals  of  various 


benzothiazole,  thiophene  and  tetrazine  derivatives.  In  most  cases,  a  small  portion  of  the 
compound  was  placed  in  a  test  tube,  a  small  amount  of  the  solvent  added,  and  the 
mixture  heated  to  dissolve  the  solid  material.  In  a  few  cases,  attempts  were  made  to  co¬ 
crystallize  the  compound  with  another  compound  that  may  (via  dipole  interactions)  help 
orient  the  molecules  and  enhance  the  crystallinity  of  the  recrystallized  material.  The 
results  of  the  study  are  tabulated  below,  followed  by  the  structural  formulas  of  the 
compounds  studied. 


Compound 

Solvents 

Results 

#51 

Ethanol 

long,  thin  crystals 

#53 

Ethyl  acetate 

large,  thin  platelets 

#54 

Toluene 

DMF 

amorphous  powder 
amorphous  powder 

Thio  #1 

Ethanol 

Ethanol/toluene-TPO  co-crystallization 
Ethanol-naphthol  co-crystallization 
Ethanol-phenol  co-crystallization 

amorphous  powder 
colorless  long,  thin  crystals 
no  results 
no  results 

Thio  #2 

Ethyl  acetate 

small,  thin  crystals 

Tet  #1 

Ethyl  acetate 

large  red  crystals 

Tet  #2 

Not  soluble 

#51 


66-5 


N-N 


H5C2°2C ' 


A 


C02C2H5 


HOOCL  ^  .COOH 


\ 


H21C10°  OC10H21 
Thio  #1 


H21C10° 


A - \ 


Thio  #2 


OC10H21 


IV. 

a.  Crystalline  samples  of  a  series  of  compounds  from  the  crystallization  study  were 
examined  using  an  optical  microscope  to  determine  the  size  and  quality  of  the  individual 
crystals.  One  of  the  compounds  examined  showed  promise  of  containing  suitable 
crystals  for  diffraction  analysis,  whereas  the  remainder  were  of  insufficient  size  and 
quality  to  be  analyzed.  A  single  crystal  of  para-Bis(phenyl)tetrazine  (Tet  #1)  were 
prepared  for  analysis  by  attaching  it  to  a  thin  glass  fiber  and  placing  it  at  the  center  of  an 
Enraf-Nonius  CAD-4  automated  diffractometer.  Preliminary  x-ray  analysis  of  the 
selected  crystal  was  made.  These  results  are  summarized  in  Table  1,  along  with  the 
parameters  of  the  subsequent  data  collection. 

b.  Data  collected  on  the  crystalline  sample  was  examined  for  the  presence  of  space- 
group-determining  systematic  absences  using  the  program  LOOK  (Chapius,  1984).  For 
the  sample,  an  appropriate  space  group  was  determined-P2,/n. 
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c.  Using  the  direct  methods  routines  contained  in  the  XTAL  system  of  crystallographic 
programs  (Grossie,  1991;  Hall  and  Stewart,  1990),  the  initial  structures  were  determined. 
These  structures  were  refined  using  CRYLSQ  (Olthof-Hazekamp,  1990)  a  full-matrix 
least-squares  refinement  program  contained  in  XTAL  (Hall  and  Stewart,  1990). 

d.  Figure  1  shows  an  ORTEF  (Johnson,  1971;  Davenport,  Hall,  and  Dreissig,  1990) 
drawing  of  the  refined  structure  of  p-Bis(phenyl)tetrazine.  A  listing  of  the  interatomic 
distances  and  angles  is  presented  in  Table  2.  The  molecule  is  completely  planar,  with  a 
maximum  deviation  from  planarity  of  0.011(4)  A . 

e.  The  molecule  lies  on  an  inversion  center  such  that  each  atom  at  one  end  is  related  to 
a  second  at  the  opposite  end.  Thus  the  attached  phenyl  rings  are  absolutely  identical. 
The  phenyl  show  typical  distortions  with  bond  distances  ranging  from  1.366  to  1.402  A 
with  an  average  of  1.387  A.  The  bond  angles  show  similar  distortions  with  values 
ranging  from  118.2  to  121.5°  with  an  average  of  120°.  The  tetrazine  ring  is  slightly 
distorted  with  a  widening  of  the  bond  angle  at  C  (123.2°)  and  a  narrowing  at  the  two  N 
atoms  (117.9  and  118.9°). 

V. 

As  a  prelude  to  the  X-ray  crystal  structure  analysis,  possible  molecular  structures  for 
compound  54  were  examined  by  molecular  mechanics.  The  results  for  the  four  possible 
conformers  are  tabulated  below.  Of  interest  in  this  compound  is  the  minima  observed 
when  changing  the  position  of  the  sulfur  and  nitrogen  atoms  in  the  thiazole  rings.  As  the 
positions  are  changed,  there  is  a  change  in  the  molecular  planarity  and  the  minimum 
energy.  In  experimental  determinations  of  melting  point  for  this  compound,  three 
distinct  melting  points  are  observed.  It  is  obvious  then  that  three  or  perhaps  all  four  of 
these  conformers  exist,  with  the  highest  melting  point  corresponding  to  the  lowest 
molecular  mechanical  energy. 
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2 

3 


a 

*2 

*3 

B 

*5 

*6 

Distance 

Xi-Xg 

Distance 

X2-X4 

Torsion 

X4(°) 

Torsion 

X6(°) 

MMX 

Energy 

s 

N 

N 

S 

N 

S 

3.275 

3.045 

1.81 

24.64 

83.72 

s 

N 

S 

N 

N 

s 

3.295 

2.933 

21.45 

-20.27 

85.89 

s 

N 

S 

N 

S 

N 

3.126 

2.889 

3.87 

16.02 

85.49 

s 

N 

N 

S 

S 

N 

3.058 

3.009 

3.37 

14.16 

82.95 

VI.  RECOMMENDATIONS: 

There  is  a  need  to  improve  the  crystallization  techniques  used  in  recrystallizing  these 
compounds.  The  crystallization  study  needs  to  be  broadened  by  the  use  of  other  methods 
besides  solution  recrystallization.  Co-crystallization  methods  need  to  be  more  fully 
exploited,  using  a  wide  range  of  compounds  for  the  co-crystallization.  Co-crystallization 
has  a  great  deal  of  potential  for  producing  crystalline  material  of  compounds  that  have 
low  melting  points  (close  to  room  temperature)  or  low  solubility.  This  technique  also 
provide  a  means  by  which  a  compound  that  crystallizes  in  a  centrosymmetric  lattice  may 
be  crystallized  in  an  acentric  lattice,  thus  allowing  the  second-order  NLO  properties  of 
the  compounds  to  be  examined  and  used. 

Additional  crystal  structures  are  needed  to  further  quantify  the  prediction  of  NLO 
properties  and  to  provide  initial  information  for  numerical  calculations  on  known  NLO 
active  materials. 

Currently,  there  are  three  factors  that  are  assumed  to  produce  the  desired  NLO  response, 
with  one  factor  being  quantitative  and  a  second  based  on  a  relative  scale.  These  two 
factors  are  the  centricity  of  the  crystal  lattice  and  the  electron-donating  and  withdrawing 
effects  of  the  commonly  used  functional  groups.  The  third  factor,  the  extent  of 
conjugation  within  the  molecule,  is  currently  unquantified.  Molecular  planarity  is 
normally  taken  as  the  first  clue  that  a  non-fused,  Ji-bonded  ring  system  is  conjugated. 


Since  this  information,  like  the  centricity,  is  directly  obtainable  from  the  structural 
analysis  of  a  crystalline  compound,  the  extent  of  conjugation  can  be  quantified  by  this 
process. 

The  molecular  packing  of  a  molecule  within  a  crystal  lattice  is  of  great  importance  to  the 
observation  of  NLO  properties  in  the  bulk  material.  One  factor  that  influences  molecular 
packing  is  the  shape  of  molecule  and  the  intra-molecular  interactions.  These  interactions 
can  be  modelled  easily  using  molecular  mechanics,  work  that  should  be  continued.  The 
inter-molecular  attractions  also  influence  the  packing.  The  inter-molecular  interactions 
can  be  controlled  using  guest  compounds  that  interfere  with  the  attractive  sites  within  a 
molecule  that  cause  then  to  come  together  in  the  solid  state.  To  study  these  effects,  the 
co-crystallization  of  the  potential  NLO  active  material  and  an  appropriate  guest  is 
recommended. 

With  the  above  pieces  of  data  obtained  and  analyzed,  the  synthesis  of  nonlinear  optical 
materials  can  be  by  rational  design.  This  will  allow  the  physical  properties  of  the 
material  to  be  optimized  without  compromising  the  desired  nonlinear  optical  properties. 
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Table  1.  Experimental  Details  for  p-Bis(phenyl)tetrazine. 


Formula: 

Formula  weight: 
F(000): 

Crystal  dimensions: 
Color: 

Radiation: 
Wavelength: 
Temperature: 
Crystal  form: 

Space  group: 

Cell  constants: 


Volume: 

Z: 

Density: 

Absorption  coefficient: 
Scan  type: 

Scan  rate: 

Scan  width: 

Maximum  20  : 
Reflections  measured: 

Corrections: 


Observations: 

Parameters: 

R: 

wR: 

Goodness-of-fit: 
Maximum  shift/error: 
Residual  density 
maximum: 
minimum: 


C14H,oN4 

234.2 

244 

0.20  x  0.41  x  0.62  mm 

Red 

MoKa 

0.71073  A 

23° 

monoclinic 

P2,/n 

a  =  5.448(2)  A 
b  =  5.119(1)  A 
c  =  20.236(7)  A 
p  =  93.64(2)* 

563.2(3)  A3 
2 

1.38  g/cm3 
0.95  cm"1 
w/20 

1.20  -  5.58*  /min 
1.00  +  0.344  tan  0 
70.0° 

2827  total 

2461  unique 

Lorentz-polarization 

Numerical  absorption  (range  of  T:  0.9619-0.9828) 
Reflection  averaging  (agreement  on  I  =  4.2%) 

973  with  I>3o  (I) 

82 

0.094 

0.098 

4.602 

0.0009 

0.5  e/A  3 
-0.5  e/A 3 
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Table  2  Interatomic  Bond  Distances,  Angles  and  Dihedral  Angles  for  p- 
Bis(phenyl)tetrazine. 


Atom 

Distance(A) 

Atoms  Dihedral  Angle(°) 

N(l)-C(2) 

1.354(5) 

N(3')-N(  1)-C(2)-C(4) 

-179.6(3) 

N(l)-N(3') 

1.316(5) 

N(3')-N(l)-C(2)-N(3) 

0.6(6) 

C(2)-C(4) 

1.450(5) 

C(2)-N(l)-N(3')-C(2') 

-0.5(5) 

C(2)-N(3) 

1.346(5) 

N(  1)*C(2)-C(4)-C(5) 

-0.6(6) 

C(4)-C(5) 

1.398(5) 

N(l)-C(2)-C(4)-C(9) 

-179.7(4) 

C(4)-C(9) 

1.402(5) 

N(3)-C(2)-C(4)-C(5) 

179.3(4) 

C(5)-C(6) 

1.369(6) 

N(3)-C(2)-C(4)-C(9) 

0.2(6) 

C(5)-H(5) 

1.040(5) 

C(2)-C(4)-C(5)-C(6) 

-179.2(4) 

C(6)-C(7) 

1.401(6) 

C(2)-C(4)-C(5)-H(5) 

-1.4(6) 

C(6)-H(6) 

1.040(6) 

C(9)-C(4)-C(5)-C(6) 

-0.1(6) 

C(7)-C(8) 

1.385(6) 

C(9)-C(4)-C(5)-H(5) 

177.8(4) 

C(7)-H(7) 

1.040(7) 

C(2)-C(4)-C(9)-C(8) 

178.8(4) 

C(8)-C(9) 

1.366(6) 

C(2)-C(4)-C(9)-H(9) 

1.1(7) 

C(8)-H(8) 

1.040(6) 

C(5)-C(4)-C(9)-C(8) 

-0.3(6) 

C(9)-H(9) 

1.040(5) 

C(5)-C(4)-C(9)-H(9) 

-178.0(5) 

C(4)-C(5)-C(6)-C(7) 

-0.2(7) 

Atoms 

Angle(°) 

C(4)-C(5)-C(6)-H(6) 

-177.2(5) 

H(5)-C(5)-C(6)-C(7) 

-178.1(5) 

C(2)-N(l)-N(3’) 

117.9(3) 

H(5)-C(5)-C(6)-H(6) 

5.0(8) 

N(l)-C(2)-C(4) 

117.9(3) 

C(5)-C(6)-C(7)-C(8) 

0.9(7) 

N(l)-C(2)-N(3) 

123.2(4) 

C(5)-C(6)-C(7)-H(7) 

179.1(5) 

C(4)-C(2)-N(3) 

118.9(3) 

H(6)-C(6)-C(7)-C(8) 

177.8(5) 

N(1>N(3)-C(2) 

118.9(3) 

H(6)-C(6)-C(7)-H(7) 

-4.0(8) 

C(2)-C(4)-C(5) 

120.7(3) 

C(6)-C(7)-C(8)-C(9) 

-1.3(7) 

C(2)-C(4)-C(9) 

121.1(3) 

C(6)-C(7)-C(8)-H(8) 

179.2(5) 

C(5)-C(4)-C(9) 

118.2(4) 

H(7)-C(7)-C(8)-C(9) 

-179.5(5) 

C(4)-C(5)-C(6) 

120.8(4) 

H(7)-C(7)-C(8)-H(8) 

1.0(8) 

C(4)-C(5)-H(5) 

119.1(4) 

C(7)-C(8)-C(9)-C(4) 

1.0(7) 

C(6)-C(5)-H(5) 

120.0(4) 

C(7)-C(8)-C(9)-H(9) 

178.7(5) 

C(5)-C(6)-C(7) 

120.7(4) 

H(8)-C(8)-C(9)-C(4) 

-179.5(5) 

C(5)-C(6)-H(6) 

119.8(5) 

H(8)-C(8)-C(9)-H(9) 

-1.8(8) 

C(7)-C(6)-H(6) 

119.4(5) 

C(6)-C(7)-C(8) 

118.2(4) 

C(6)-C(7)-H(7) 

120.9(5) 

C(8)-C(7)-H(7) 

120.8(5) 

C(7)-C(8)-C(9) 

121.5(4) 

C(7)-C(8)-H(8) 

1 19.0(5) 

C(9)-C(8)-H(8) 

119.4(5) 

C(4)-C(9)-C(8) 

120.5(4) 

C(4)-C(9)-H(9) 

119.2(5) 

C(8)-C(9)-H(9) 

120.3(4) 
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ORTEP  drawing  of  p-Bis(phenyl)tetrazine.  Non-hydrogen  atoms  are  shown  at 
30%  thermal  probability.  Hydrogen  atoms  are  shown  artificially  small  for 
clarity. 
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Abstract. 


This  report  presents  an  eddy-current  test  set-up  designed  and  constructed  at  the 
University  of  Kentucky  during  the  seven  month  period  of  the  project.  We  have 
demonstrated  that  second-layer  cracks  can  be  delineated  to  an  accuracy  of  about  2 
percent  in  materials  of  high  conductivity  using  long  transmitter  pulse  scheme.  The 
pulse  transmitter  has  been  designed  and  built.  In  the  theoretical  analysis  of 
electromagnetic  NDE,  the  method  of  moments  has  been  applied  to  calculate  the 
variation  of  the  probe  impedance  due  to  arbtrarily  shaped  defects  in  materials  of  high 
conductivity.  A  detailed  FORTRAN  computer  program  was  developed  for  the  use  of 
this  technique. 

A  small  effort  was  addressed  to  study  the  microstructural  state  of  alumina  ceramics 
using  the  technique  of  Time-Domain  Dielectric  spectroscopy.  Preliminary  results  seem 
promising  for  non-destructive  evaluation  (  NDE  )  of  ceramics.  We  plan  to  continue  this 
effort  in  detail  during  the  course  of  next  year. 
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INTRODUCTION 


Eddy-curTent  testing  is  used  in  the  field  of  nondestructive  evalution  to  perform  the 
inservice  inspection  of  metal  (  conductor  )  products.  The  experienced  operator  can,  in 
general,  determine  the  presence  of  surface  flaws,  subsurface  flaws  particularly  in 
nonmagnetic  conductors,  layer  thickness  and  perform  metal  sorting  (  conductivity  ).  In 
spite  of  its  simplicity,  currently,  eddy  current  method  is  the  most  useful  technique  to 
detect  the  presence  of  indepth  flaws  especially  in  materials  of  high  electrical 
conductivity  such  as  metals.  Another  companion  technique  is  the  ultrasonic  detection 
method,  but  eddy  current  method  is  non-contacting  and  the  inspection  can  be  carried 
out  quickly  with  no  contamination  of  the  surface.  Nondestructive  techniques  such  as 
microwave  scattering  method  although  non-contacting  is  restricted  to  only  surface 
characterization  because  of  the  skin-depth  aspect  of  microwaves,  especially  in 
materials  of  high  conductivity. 

During  the  seven-month  period  of  the  minigrant,  the  following  tasks  have  been 
addressed: 

1 .  Development  of  an  eddy-current  pulse  technique  to  explore  second-layer  cracks 
in  materials  of  high  conductivity. 

2.  Development  of  a  computer  program  for  the  method  of  moments  analysis  to 
calculate  the  variation  of  probe  impedance  due  to  arbitrarily  shaped  defects  in  high 
conductivity  materials. 

3.  Preliminary  study  of  the  microstructural  state  of  sintered  alumina  by  the 
technique  of  Time-Doman  Dielectric  Spectroscopy. 


DETAILS  OF  THE  INVESTIGATION 


1.  EDDY-CURRENT  TEST  SET-UP 

In  the  literature,  eddy-current  test  systems  have  been  reported  for  different  NDE 
tasks  [1],[2],[3].  Specially  in  the  area  of  NDE  of  a  flat  metal  plate,  many  contributions 
have  been  published  [4],—, (8).  Although  some  significant  advances  have  been  made  in 
electromagnetic  NDE,  a  pulse  technique  still  needs  to  be  developed  in  order  to  detect 
second-layer  cracks  or  flaws  in  materials  of  high  conductivity  .  A  pulsed  eddy-current 
test  system  for  NDE  of  a  conductor  plate  has  been  set  up  in  our  lab.  The  details  are  as 
follows. 

In  order  to  set  up  an  eddy-current  system,  first  of  all,  a  probe  needs  to  be  designed 
for  a  special  task.  In  designing  a  probe,  of  importance  is  its  size  i.e.  its  diameter  and 
height,  turns  and  the  diameter  of  wires  of  coils,  cored  material  and  the  arrangement  of 
driving  coil  and  receiving  coil  [9].  Using  the  above  consideration,  a  probe  has  been 
designed  for  the  eddy-current  test  system,  some  considerations  in  designing  probes  are 
introduced  as  follows. 

At  first.  What  parameters  in  electromagnetic  NDE  will  be  measured  should  be  kept 
in  the  probe  designer's  mind,  that  is,  whether  reflection  or  transmission  measurement  is 
used  ,  and  whether  impedances  or  output  voltages  /  currents  or  output  waveforms  will 
be  measured.  In  the  case  of  reflection  measurements,  as  shown  in  Fig.  1  (  see  page  15  ) 

,  the  probe  generally  consists  of  two  coils,  namely  driving  coil  and  receiving  coil. 
However,  in  through-transmission  measurements,  as  shown  in  Fig.  2  (  see  page  15  ) , 
these  two  coils  are  designed  in  two  probes  separately.  In  the  reflection  probe,  the 
receiving  coil  usually  has  two  sub-coils  with  opposite  winding  so  that  when  the 
reflection  probe  is  kept  in  the  air,  the  output  of  the  receiving  coil  can  be  nearly  zero, 
i.e.  self-zeroed.  This  is  convenient  to  intercept  measured  data  since  the  outputs  of  the 
receiving  probe  directly  indicate  the  characterization  of  materials  under  test.  However, 
this  makes  it  complicated  to  design  a  reflection  probe  since  a  fine  adjustment  of  the 
coil  locations  should  be  made. 

In  our  reflection  probe,  there  is  only  one  coil.  Because  the  impedance  analyzer  we 
are  using  has  a  high  resolution  and  can  be  easily  zeroed  or  calibrated  by  a  computer 
through  HP-IB.  Our  reflection  probe  can  be  used  in  both  reflection  and  through- 
transmisison  measurements. 

In  through-transmission  measurements,  the  driving  probe  usually  has  the  same  size 
as  the  receiving  probe,  and  is  moved  from  one  point  to  another  as  the  receiving  probe 
is  moved.  In  this  way,  a  mechnical  scanner  is  required  to  have  two  headers  to  hold  and 
move  each  probe  (  or  in  the  other  way,  to  move  the  heavy  plate  while  two  probes  are 
fixed  )  .  This  machine  was  difficult  to  design  and  and  build  in  the  short  period  of  the 
grant.  An  alternative  way  needs  to  be  investigated  .  A  technique  to  sove  the  above 
problem  has  been  developed  in  this  report.  A  fixed  coil  configuration,  as  shown  in  Fig. 
3  (  see  page  16  )  ,  instead  of  a  driving  probe  is  used  in  this  technique.  In  this 
configuration,  coils  are  connected  to  each  other  in  the  way  that  the  current  directions  of 
two  adjacent  coils  are  opposite  in  order  to  enlarge  the  magnetic  fields  between  these 
two  coils.  On  first  thought,  the  fact  that  the  magnetic  fields  produced  by  these  coil 
currents  maybe  are  non-uniform  makes  it  difficult  to  detect  defects  in  materials  under 
test.  Fortunately,  however,  this  problem  can  be  easily  solved.  The  magnetic  fields  can 
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be  not  only  theoretically  calculated,  but  also  calibrated  with  a  standard  metal  plate. 
These  coils  can  be  designed  using  a  thick  wire  which  can  carry  a  large  current,  and 
have  different  turns  for  some  purposes.  It  should  be  also  noted  that  these  coils  be 
placed  far  from  other  materials  of  high  conductivity  to  avoid  disturbing  their  magnetic 
fields. 

The  diameter  of  a  wire  used  to  make  a  coil  depends  on  the  root  mean  square 
current  flowing  in  the  coil.  An  empirical  formula  as  follows  is  useful. 

d  >  -is-. 


where  d  is  the  diameter  of  wires  in  mm. 


The  number  of  turns  of  coils  is  limited  by  the  size  of  probe  and  also  depends  on 
desired  outputs  of  the  probe.  The  coil  of  our  probe  has  400  turns. 

Cored-material  usually  is  air  medium.  If  a  ferrite  rod  is  used  for  the  core,  the 
effects  of  magnetic  saturation  and  hysteresis  on  the  flux  should  be  modeled  [10]. 


For  an  impedance  analyzer  used  in  eddy-current  test,  of  importance  are  its 
frequency  band,  resolution  and  accuracy  etc.  GR  1693  RLC  DIGIBRIDGE  is  used 
in  our  eddy-current  test  system.  It  has  a  frequency  band  from  5  Hz  to  200  kHz,  a 


resolution  of  4-i-  and  an  accuracy  of  ±  0.01  %. 


Pulse  generator  is  one  of  the  important  units  in  a  pulsed  eddy-current  test 
equipment,  a  commercial  pulse  generator  with  a  suitable  frequency  band  is  available 
to  us.  Unfortunately,  however,  sine  wave  signals  from  GR  1693  RLC  DIGIBRIDGE 
have  a  unstable  phase  drift  since  it  is  produced  digitally.  Unstable  phase  differences 
between  the  signals  from  GR  1693  and  those  from  the  commercial  pulse  generator 
make  the  readings  of  GR  1693  unstable.  Thus  a  pulse  circuit  using  the  sine  wave 
signals  from  GR  1693  as  its  inputs  needs  to  be  designed.  Fig.  4  (  see  page  16 )  shows  a 
block  diagram  of  our  pulse  circuit  which  is  listed  in  Appendix  1 . 

In  fig.  4  ,  an  follower  is  used  to  provide  isolation  between  the  GR  1693  and  the 
pulse  circuit  assembly.  Pulse  generator  has  a  variable  resistance  to  change  the  duration 
of  pulse  signals  .  The  magnitude  of  currents  to  the  driving  coil  mainly  depends  on  the 
power  amplifier. 

A  block  diagram  of  our  eddy-current  test  system  is  shown  in  fig.  5  (  see  page  17  ) . 
A  mechnical  scanner  (  model  ADM- 1812/pc,  manufactured  by  Creative  Automation 
Company  )  is  used  and  can  be  controlled  to  move  in  x,  y  or  z  direction  by  a  computer 
through  HJP-IB.  A  program  (  see  Appendix  2  )  has  been  developed  to  control  the 
mechnical  scanner  to  move  in  small  steps  in  x  or  y  direction.  The  output  of  the  probe  is 
stored  in  the  computer  and  processed  and  then  plotted.  The  experimental  results  will 
be  given  in  section  3. 
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2.  IMPEDANCE  CALCULATION  USING  THE  METHOD  OF  MOMENTS 


In  the  area  of  eddy-current  test,  numerical  methods  of  fast  speed  and  good  accuracy 
to  analyze  a  forward  problem,  namely  to  predict  probe  response  to  a  defect  of  known 
size,  still  need  to  be  developed,  although  FEM  and  BEM  are  available  in  NDE.  In  this 
report.  Moment  Methods  are  applied  in  the  analysis  of  a  forward  problem  in  eddy- 
current  test. 

Statement  of  the  problem  :  Let  us  assume  that  there  are  some  defects  in  a  flat  metal 
plate  which  is  thick  enough  so  that  the  field  produced  by  the  current  in  the  probe  coil 
can  not  transmit  through  it,  as  shown  in  fig.  6  ( see  page  18  ).  As  shown  in  fig.  7  (  see 
page  18  ),  the  probe  coil  with  the  radius  of  'a'  is  horizontally  located  in  a  height  of  'h'  in 
air.  The  axis  of  the  probe  is  taken  as  the  z-axis  of  the  coordinate  system  and  the  surface 
of  the  metal  plate  as  the  x-y  plane  or  the  p — <p  plane  .  Some  arbitrarily  shaped  defects 

are  centered  at  poi,  (J>w,  Za.  (i=l,2, . ,N) .  The  medium  of  the  ith  defect  is  characterized 

by  the  conductivity  aa< ,  the  permittivity  eoi ,  and  the  permeability  p.ai .  The  metal 
plate  is  characterized  by  the  parameters  a, ,  e,  and  |i, . 

The  electromagnetic  field  produced  by  the  coil  current  is  disturbed  by  the  defects 
because  of  the  discontinuity  on  the  interface  between  a  defect  and  the  metal  plate.  The 
total  field  is  the  superposition  of  the  incident  field  existing  when  there  arc  no  defects 
and  the  scattered  field  due  to  defects.  So  the  coil  impedance  also  has  two  parts 
namely,  a  standard  impedance  presented  by  the  incident  field,  and  a  disturbed 
impedance  due  to  the  scattered  fields  of  the  defects.  The  standard  impedance  can  be 
calibrated  or  calculated  using  the  method  published  in  [1 1] .  The  disturbed  impedance 
will  be  calculated  in  this  report  using  the  Method  of  Moments. 

In  order  to  obtain  the  disturbed  impedance,  it  is  necessary  to  calculate  the  scattered 
field  by  defects  in  the  metal  plate.  According  to  the  equivalence  principle  in 
electromagnetic  theory  [12]  ,  the  scattered  field  by  defects  can  be  equivalent  to  the 
field  produced  by  the  equivalent  magnetic  current  sources  on  the  interface  between 
defects  and  the  metal  plate.  Here,  magnetic  currents  need  to  be  impressed  on  the  two 
sides  of  the  interface  in  a  way  that  both  have  the  same  magnitudes  but  the  opposite 
directions  in  order  to  match  the  tangential  component  of  the  electric  fields  on  the 
interface.  The  tangential  component  of  the  magnetic  fields  on  the  interface  will  be 
matched  using  the  following  equation. 

( i ) 

Then  the  fields  both  inside  and  outside  defects  are  unique  according  to  the 
uniqueness  theorem  in  electromagnetic  theory  [12],  The  above  equation  will  be 
treated  here  using  the  point-matching  technique  with  subsectional  bases  of  Moment 
Methods  in  order  to  deal  with  problems  with  arbitrarily  shaped  defects.  After  the 
equivalent  magnetic  currents  on  the  interfaces  are  obtained  by  solving  equation  (1),  the 
scattered  fields  and  the  disturbed  impedance  can  be  calculated  by  the  following 
equations. 


£*  =  Vxf 

(2) 

Z*  =  f  t  dl 

J 

(3) 

om  cod 
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where  F  ,  an  electric  vector  potential,  will  be  treated  in  the  following  section,  and  in 
equation  (3),  the  current  flowing  in  the  coil  is  taken  to  be  a  unity. 


GREEN'S  FUNCTIONS  AND  FIELD  CONSIDERATIONS 

The  fields  produced  by  the  coil  current  has  been  given  in  reference  [11],  which  can 
be  written  as  follows  in  the  metal  plate 


|  a<p 


ff=VxA. 


where 


*.= ^e>(1V’d-) 

'  toe, 

*0=oW  p0e0 

where  I,  the  coil  current,  will  be  taken  as  unity  in  the  following  analysis  and  z  is 
negative. 

The  field  produced  by  a  vertical  electric  diple  in  the  metal  can  be  obtained  by 
reference  [13] ,  which  in  air  medium  is  given  by  : 


where 


T(5)= 


The  magnetic  vector  potential  presented  by  a  horizontal  electric  diple,  in  principle, 
has  two  components  ,  one  of  which  is  in  the  direction  of  the  diple,  and  the  other  in  the 
z  direction  ,  because  of  the  boundary  conditions  on  the  interface  between  the  air  and 
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the  metal  plate.  In  the  problem  treated  by  this  report,  z-component  of  the  magnetic 
vector  potential  by  the  horizontal  electric  diple,  however,  is  much  smaller  than  the 
component  in  the  direction  of  the  diple  since  the  metal  plate  has  a  high  conductivity. 
For  example,  if  the  conductivity  of  the  metal  plate  is  the  order  of  106  the  z-component 
of  the  magnetic  vector  potential  will  be  about  10' 3  of  the  component  in  the  diple 
direction,  and  hence  can  be  neglected.  The  component  of  the  magnetic  vector  potential 
in  the  diple  direction  can  be  calculated  by  equation  (6). 

Using  the  duality  of  electromagnetic  fields,  we  can  get  the  electric  vector  potential 
from  equation  (6).  Again,  the  z-component  of  the  electric  vector  potential  due  to  a 
horizontal  magnetic  diple  is  neglected  for  the  same  reason. 


ANALYSIS  USING  THE  METHOD  OF  MOMENTS 
We  can  rewrite  equation  (1)  as  follows 

(/-U.-JV)  (7) 

where 

In  order  to  treat  a  problem  of  arbitrarily  shaped  defects,  we  solve  equation  (7)  for 
M  using  the  point-matching  method  in  conjunction  with  the  subsectional  bases. 

Magnetic  currents  on  the  surface  of  the  nth  defect  can  be  expanded  by  subsectional 
bases. 

(s-sj  («*  1,2, -AO  (8) 


where  5 (s-sj  is  a  delta  function  at 

X  *.*<«.)  «= >  (») 

*  *  t 

Substituting  eq  (9)  in  eq  (7),  we  have  : 

’■  <  10> 

ft  I  I 

In  the  point-matching  method,  the  testing  function  is  taken  as  a  delta  function  8 (s-sv). 
Taking  inner  product  of  8(5-j(/)  and  each  side  of  (10),  we  have 

XX^.s^jhX  ««,))=-«*  (11) 


67-9 


where  H1  ’’  can  be  obtained  from  (5).  #L.rff(  and  fiis-sj)  are 

obtained  as  follows 

The  electric  vector  potential  produced  by  the  magnetic  current  Mjb(s-sm) 
outside  the  defect  can  be  writen  as  follows 

f"  J  4(4lp„-p  (12) 


where 


r.(5>= 


r,($)= 


M-p^i  M-i^-o 

HoVnA 

£1^0~£0^'l 

£>Wi 


The  magnitic  field  outside  the  defect  is  : 

The  electric  vector  potential  inside  the  defect  is  : 

f  *.**=-?*- l4 — ri  (14) 

*  4tc  l/v-r^l 

The  magnetic  field  inside  the  defect  is  given  by  : 

,md,+-i_7(V-F„;'  (15) 

J10*1,,! 

where  the  subscript  "t"  means  parellel  to  the  interface  between  the  air  and  the  metal 
plate . 

Deduced  from  (13)  and  (15),  Hdl  both  outside  and  inside  the  defect  can  be  writen  as 
follows 

where 

P--T 

(».  i;= 

kl 
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PVT 

i\i;= v 

A_ 


"VT 


wjH  V 
LVl 

Here  v  stards  for  x,  y  or  z. 

(17) 

Substituting  the  results  (16)  and  (17)  in  (1 1),  we  have 


Using  matrixe  notation,  we  have  : 

where 

[A,.l’=(A,lAia,-A<j<]' 

W.,H 

Letting  /=1,2,„.7V,  y=l,2 . k, ,  we  have 

IA,l’[W,]+[A/(Mr]+M,nM,]+[BJ'[MJ+[B/lM(|+[B,nM,l=-(W.| 

where 

(A,r=|[A,i;.iA,];,-.[A,i„]r 
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vtx**., 

Eq  (20)  can  be  rewriten  as  follows  : 

{CS[MMC,Y[M,\+[C,nM,\=-m  (21) 

v  =  x,  y,  z 
where 

[CJWW 

By  solving  equation  (21),  we  can  obtain  Mx,  My  and  Mt . 


IMPEDANCE  CALCULATION 

After  the  magnetic  currents  MI,My  dndMl  are  obtained,  the  impedance  of  the 
receiving  probe  due  to  them  can  be  calculated  thus  : 

Letting  the  current  in  the  receiving  coil  be  equal  to  unity,  we  can  obtain  the 
impedance  by  calculating  the  voltage  induced  in  the  receiving  coil 


produced  by  Mt,  My  and  M,  can  be  obtained  as  follows 

As  shown  in  fig.  8  (a)  (see  page  19),  a  magnetic  current  source  with  three 
components  is  located  at  (  xm,  ym,  );  Then  the  eiectnc  vector 

potentials  Ftm,  F  and  Ftm  can  be  obtained  as  follows 
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(23) 


F.=^  ]  U$P 


where 


TJ&- 


2e,X0 


e.Ve^, 
2^Ao 


TJ®* 


P mriO^m-a  cos^+Ov-a  sin^)2] 2 
The  electric  fields  can  then  be  obtained  from  equation  (23),  which  is 

E.=Y  (V*P.  .)•*  (24) 

The  receiving  coil  is  assumed  to  be  horizontally  placed  ,  so  Etm  has  no  contribution 
to  the  voltage  induced  on  the  coil.  ExjmdE  m  are  of  interest  here.  As  shown  in  fig. 
8(b)  (  see  page  19  ), 

sin<|>,  +  Ek  cos<t>?  (25) 

The  voltage  ( or  impedance  )  due  to  the  magnetic  current  source  Mm  is 

V>£^A/  (26) 


where  A/=— —  ( the  coil  is  divided  into  Q  sub-sections  ) 

The  total  voltage  (  or  impedance  )  is 

( X  X  (v*.»  <27> 

*  "  * 

PROGRAM 

The  flow  chart  of  the  FORTRAN  program  using  the  Method  of  Moments  is  shown 
in  fig.  9  (see  page  20).  The  detailed  program  is  listed  in  Appendix  3. 


CALCULATED  RESULTS 
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We  have  used  the  above  program  to  calculate  the  probe  impedance  variations  with 
frequency  from  60  Hz  to  200  KHz  when  the  probe  is  centered  over  two  holes.  Both 
holes  have  a  circular  cross  section  and  a  depth  of  7.86  mm;  their  diameters  are  6.35 
mm  and  3.81  mm,  respectively.  A  comparison  between  the  calculated  results  and  the 
experimental  results  is  shown  in  fig.  10  (see  page  21).  It  is  noted  that  a  good 
agreement  is  presented  between  these  two  results. 


3.  EXPERIMENTAL  INVESTIGATION 

In  this  subsection,  experimental  studies  of  NDE  of  an  aluminum  plate  have  been 
reported.  The  experimental  set-up,  receiving  probe  and  pulse  current  waveform  to 
driving  coils  are  shown  in  fig.  1 1  (  see  page  22  ).  Three  machined  holes  #1,  #2,  #3  in 
the  plate  are  shown  in  fig.  12  (  see  page  23  ).  In  fig.  12,  the  sizes  of  the  plate  and  three 
holes  are  also  presented.  The  thickness  of  an  individual  aluminum  foil  used  in  the 
experiment  is  0.2  mm.  The  operating  frequency  is  10  KHz. 

In  figs.  13  through  16,  the  eddy-current  test  results  (  the  magnitudes  of  probe 
impedances  )  have  been  shown.  In  the  experiment,  after  the  mechnical  scanner  moves 
the  probe  by  steps  of  0. 1  inch  in  the  horizontal  direction,  and  0.5  inch  in  the  vertical 
direction,  the  impedance  readings  of  GR  1693  RLC  DIGIBRIDGE  is  stored  in  the 
computer.  In  accordance  with  this,  in  each  figure  (  14  through  16  ),  there  are  30  points 
in  the  horizontal  direction  and  5  rows  in  the  vetical  direction.  As  shown  in  the  figures, 
impedance  magnitude  exhibits  a  peak  when  the  probe  is  moved  across  the  hole 
(defect).  In  figure  13,  the  hole  is  not  covered  by  a  layer  of  aluminum  foil;  so  the 
impedance  peaks  are  big  in  intensity  under  condition  of  both  low  and  high  resolution, 
while  as  in  figures  14  and  15,  the  peaks  are  reduced  in  intensity  with  three  layers  of 
aluminum  foil  covering  both  hole  #1  and  #2.  In  figure  16  when  4  layers  of  aluminum 
foil  are  over  hole  #2,  the  impedance  peaks  are  even  reduced  further.  Fig.  17  shows 
preliminary  results  of  detection  of  a  second-layer  crack  using  the  current  pulse 
technique. 

The  results  presented  in  this  report  are  with  a  current  pulse  of  0.5  amp.  Longer 
current  pulses  are,  of  course,  needed  for  detection  of  defects  or  flaws  to  greater  depth 
in  high  conductivity  materials.  Current  pulses  of  50  amp  are  under  development  and 
the  results  will  be  forth  coming  early  next  year. 


4.  MICROSTRUCTURAL  STATE  OF  ALUMINA  CERAMICS  BY  THE 
TECHNIQUE  OF  TIME-DOMAIN  DIELECTRIC  SPECTROSCOPY 
( TDDS ) 

Approximately  a  month  was  spent  in  this  aspect  of  the  grant.  Sintered  alumina 
samples  were  measured  in  the  frequency  range  1  to  10  KHz  at  a  temperature  of  60°c . 
The  experimentally  measured  parameter  was  the  capacitance  measured  in  a  guarded- 
disk  geometry  by  TDDS  .  The  samples  showed  a  fairly  large  change  in  el,  the  real  part 
of  the  permitivity  with  densification.  Broad  dispersion  regions  have  been  observed 
which  are  indicative  of  multiple  dielectric  relaxation  processes.  We  plan  to  continue 
this  study  during  next  year  at  different  temperatures  and  also  at  frequencies  lower  than 
1  kHz  both  for  pure  alumina  and  glass-containing  samples.  Detailed  interpretation  of 
the  results  will  lead  to  a  characterization  of  different  phases  and  interfaces  present  in 
the  samples. 
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CONCLUSIONS 


During  the  tenure  of  the  mini-grant ,  a  rather  sophisticated  pulse  eddy-current  test 
set-up  has  been  developed.  It  has  been  demonstrated  that  second-layer  cracks  can  be 
delineated  to  an  accuracy  of  about  2  percent  in  aluminum  metal  plates.  A  pulse 
transmitter  with  50  amp  pulse  current  is  almost  complete  and  it  is  planned  to  continue 
this  study  next-year  for  detection  of  deeper  flaws  in  non-magnetic  as  well  as  magnetic 
materials.  The  eddy-current  test  problem  has  been  analyzed  using  the  method  of 
moments  and  a  detailed  FORTRAN  program  specific  to  the  present  investigation  has 
been  developed  in-house. 

Time-Domain  Dielectric  Spectroscopy  (TDDS)  has  been  used  in  a  brief  study  of 
the  microstructural  state  of  alumina  ceramics.  Preliminary  results  are  promising  and 
the  study  will  be  continued  next  year  extending  to  glass-containing  samples. 
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Fig.  1  Electrical  connections  for  reflection  measurement 


Fig.  2  Electrical  connections  for  Through-transmission  measurements 
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Fig.  4  Block  Diagram  for  pulse  circuit 
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Fig.  5  Block  diagram  of  eddy-current  test  system 
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Fig.  6  Configurations  for  a  practical  problem 


Fig.  7  The  problem  to  be  analyzed 
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Fig.  8  (a)  Magnetic  currents  at  Sf 
(b)  Electric  fields  on  the  c 
produced  by  magnetic 
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Fig.  9  FORTRAN  program  flow  chart  for  calculating 
the  coil  impedance  using  the  Method  of  Moments 
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Fi^;.  11(c)  (he  current  pulse  waveform 


Fiji.  11  .  (a)  the  eddy-current  test  set-up 
(hi  the  detector  probe 
<c)  the  current  pulse  waveform 


H”.  11(h)  the  detector  probe 


Fig.  13  No  aluminum  foil  is  placed  on  the  hole  #1 

(a)  under  low  resolution 

(b)  under  high  resolution 
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Fig.  14  Three  layers  of  aluminum  foils, 
each  of  thickness  0.2  mm. 
are  placed  on  the  hole  #1. 
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Fig.  15  Three  layers  of  aluminum  foils. 


each  of  thickness  0.2  mm, 
are  placed  on  the  hole  #2. 
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16  Four  layers  of  aluminum  foils 


each  of  thickness  0.2mm  . 


are  placed  on  the  hole  #2  . 


Fig.  17  Detection  of  a  second  layer  crack 
using  the  current  pulse  technique 
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Appendix  I 


Schematic  of  the  Current  Pulse  Generator 
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Appendix  2 

The  Program  Used  to  Control  the  Mechnicai  Scanner 
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Appendix  3 


The  FORTRAN  Program 

Used  to  Calculate  the  Receiving  Coil  Impedance  Variations 
Due  to  Defects  or  Flaws  in  Metal  Plates 
Using  the  Method  of  Moments 
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I 

i 


052 

051 


C 

064 

063 

062 

061 


0  68 
069 


-  RETURN  WITH  THE  VALUE  OF  H(RUO) -  EDD00010 

- RETURN  WITH  THE  VALUE  OF  H(Z)  -  EDD00020 

PARAMETER  (KKK-152)  EDD00030 

COMMON  Z0 , WFREQ,  UO, APCO , UR, APCR, CIT, AA, HH, K0, RUO, Z,  A, B, XX, WW, NCOEFEDD0004 0 
COMMON  SB, HHZ, RUOO, OO  EDD00050 

COMMON  / WORKS P/  RWKSP  EDD00060 

COMPLEX  HFIA(KKK) ,HZ(KKK) , AZZ (KKK, KKK) , AFF ( KKK, KKK)  EDD00070 

COMPLEX  BFF (KKK,  KKK)  ,  BZZ (KKK,  KKK) , Y12, Z12, C (KKK,  KKK)  ,  D (KKK, KKK)  EDD00080 
COMPLEX  E (KKK, KKK) , EFIA, SEFIA, COST1 , COST2, RESULT1, RESULT2 , A1  EDD00090 

REAL  RWKSP (13944)  EDD00100 

REAL  K0,K1  EDD00110 

CALL  IWKIN(13944)  EDD00120 

NCOEF-21  EDD00130 

NNF-11  EDD00140 

NNZ-3  EDD00150 

CALL  PARAM  (NNF, NNZ, HFIA, HZ, AZZ, AFF, EZZ, BFF)  EDD00160 

DO  051  I-0,NNZ  EDD00170 

DO  052  J-1,NNF  EDD00180 

I J“I*NNF+J  EDD00190 

WRITE  (6,  *)  HFIA(IJ)  ,  ' - HFIA'  EDD00200 

WRITE  ( 6,  * )  HZ  ( I J)  ,  ' - HZ’  ,  IJ  EDD00210 

CONTINUE  EDD00220 

CONTINUE  EDD00230 

DO  061  I“0 , NNZ  EDD00240 

DO  062  J-1,NNF  EDD00250 

IJ-I*NNF+J  EDD00260 

DO  063  I I“0 , NNZ  EDD00270 

DO  064  JJ-1 , NNF  EDD00280 

IIJ-II*NNF+JJ  EDD00290 

WRITE  (  6,  * )  BZZ  ( I  J,  IIJ)  ,  ' - BZZ'  ,  IJ,  IIJ  EDD00300 

WRITE  (6,  *)BFF(IJ,  IIJ)  ,  ' - BFF', IJ, IIJ  EDD00310 

CONTINUE  EDD00320 

CONTINUE  EDD00330 

CONTINUE  EDD00340 

CONTINUE  EDD00350 

LDA-NNF* (NNZ+1)  EDD00360 

-  EDD00370 

LDA-2  EDD00380 

BFF  (1,1) -CMPLX ( 1 . , 1 . )  EDD00390 

BFF  (1,2) "CMPLX ( 1 . , - 1 . )  EDD00400 

BFF  (2,1) -CMPLX ( 0 . , - 1 . )  EDD00410 

BFF ( 2 , 2 ) "CMPLX ( 0 . , 1 . )  EDD00420 

BZZ  (1,  U-BFFU,  1)  EDD00430 

BZZ  (1,2) "BFF (1,2)  EDD00440 

BZZ  (2,1) -BFF (2,1)  EDD00450 

BZZ  (2,2) -BFF (2,2)  EDD00460 

CALL  CGEMUL  (BFF,  KKK,  '  C' , BFF, KKK, ' N' , C, KKK, LDA, LDA, LDA)  EDD00470 

CALL  LINCG  (LDA, C, KKK, C, KKK)  EDD00480 

DO  069  KJ**1 ,  LDA  EDD00490 

DO  068  KL-l.LDA  EDD00500 

WRITE (6, *)C(KJ,KL) ,KJ,KL  EDD00510 

CONTINUE  EDD00520 

CONTINUE  EDD00530 

CALL  CGEMUL  (C,  KKK,  ' N' ,  BFF, KKK, ' C' , E, KKK, LDA, LDA, LDA)  EDD00540 

CALL  CGEMM  ( ' N' , ' N' , LDA, LDA, LDA, (-1,0) , E, KKK, BZZ, KKK, (0,0) , C, KKK)  EDD00550 
Al-CMPLXt-NNF*NNZ/BB/HHZ*2 . , 0 . )  EDD00560 

CALL  CGEMM  (' N' ,' N' ,  LDA,  1 ,  LDA,  Al,  E,  KKK,  HFIA,  KKK,  (0,  0)  ,  D,  KKK)  EDD00570 

CALL  CGEMM  ( ' N' , ' N' , LDA, 1 , LDA,  ( -1 , 0 )  ,  AFF, KKK, D,  KKK,  Al ,  HZ, KKK)  EDD00580 

CALL  CGEMM  ('N' , 'N' , LDA, LDA, LDA, (1,0) , AFF, KKK, C, KKK, (1,0) , AZZ, KKK) EDD00590 
CALL  LSACG  (LDA, AZZ, KKK, HZ, 1, HFIA)  EDD00600 
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CALL  CGEMM  ( '  N'  ,  '  N'  , LDA, 1, LDA,  ( 1 , 0 )  ,  C, KKK, HFIA, KKK,  (1,0) ,0, KKK)  EDD00610 

C - M2 HFIA,  MFIA - D -  EDD00620 

DO  025  II-1,LDA  EDD00630 

C  WRITE  (6,  *) HFIA (II) , ' - MZ  ! '  EDD00640 

025  CONTINUE  EDD00650 

DO  026  11*1 , LDA  EDD00660 

C  WRITE  (6,*)D(II,1),' - MFIA  !  '  EDD00670 

026  CONTINUE  EDD00680 

C  END  EDD00690 

C  -  CALCULATE  VOLTAGE  -  EDD00700 

SEFIA-CMPLX  (0  . ,  0 . )  EDD00710 

DO  081  13-1, NNF  EDD00720 

EFI A-CMPLX ( 0 . , 0 . )  EDD00730 

FIA0-2*3 . 14159256*I3/NNF  EDD00740 

DO  071  I1-0,NNZ  EDD00750 

Z-DD+HHZ*I1/NNZ  EDD00760 

Z— Z  EDD00770 

DO  072  12-1, NNF  EDD00780 

JJ-I1*NNF+I2  EDD00790 

COST1-D ( JJ, 1) /4./3. 14159256  EDD00800 

COST2-HFIA(JJ) /4./3. 14159256  EDD00810 

FIA«2*3 . 14159256* I2/NNF  EDD00820 

XXI-B8*C0S (FIA)  EDD00830 

YYI-BB*SIN(FIA)  EDD00840 

RUO2-RUO0**2+BB**2+2 . *RUO0*BB*COS (FIA)  EDD00850 

RU0-RU02+AA**2+2*AA*SQRT(RU02) *COS(FIA0)  EDD00860 

C - ERUO  ,  EFIA -  EDD00870 

RUO-SQRT ( RUO )  EDD00880 

RU02-SQRT (RU02 )  EDD00890 

CAA-AA  EDD00900 

CRUO-RUO  EDD00910 

CONDT-1.  EDD00920 

Z0-K0**2  EDD00930 

CALL  HRUOZ  (HR1, HX1, HR2, HX2,  CONDT)  EDD00940 

RUO-CRUO  EDD00950 

AA-CAA  EDD00960 

RESULT1-CMPLX (HR1 , HX1 )  EDD00970 

RES  ULT2 -CMP  LX ( HR2 , HX2 )  EDD00980 

C  WRITE  (6,*)  RESULT1 ,  RESULT2 ,  ' - R1,R2',JJ  EDD00990 

EFIA-EFIA-RESULTl*COSTl * (RU02/RUO) *SIN(FIA0)  EDD01000 

EFIA-+EFIA+RESULT2*COST2* (AA+ (RU02 ) *COS (FIA0 ) ) / (RUO)  EDD01010 

072  CONTINUE  EDD01020 

071  CONTINUE  EDD01030 

SEFIA-SEFIA+EFIA*2*3 . 14159256*AA/NNF  EDD01040 

081  CONTINUE  EDD01050 

WRITE  (  6,  * )  SEFIA,  ' - IMPEDANCE'  EDD01060 

END  EDD01070 

C  -  CALCULATE  THE  VALUE  OF  FIELD  -  EDD01080 

SUBROUTINE  HRUOZ  (HR1 , HX1 , HR2 , HX2 , CONDT)  EDD01090 

INTEGER  INTERV,NOUT, IRULE  EDD01100 

REAL  A. ABS, B, ALOG, ATAN, BOUND,  ERRABS, ERREST, ERROR, ERRREL, EXACT  EDD01110 
REAL  RESULT, CONST, K1,K0  EDD01120 

COMMON  Z0, WFREQ, U0, APCO, UR,  APCR, CIT, AA, HH, K0, RUO, Z, A, B, QX,QW,  NCOEFEDD01130 
COMMON  BB, HHZ, RUOO, DD  EDD01140 

INTRINSIC  ABS, ALOG, SQRT,  COS,  SIN  EDD01150 

EXTERNAL  F, G, DQDAG, UMACH,  CONST,  QDAG, FN1 , FN2 , FM1 , FM2 , FP, FP2,  FA,  FB  EDD01160 
EXTERNAL  DBSJ1, FX,GX, FP1, FQ,  DBS JO , FPP, FQQ, FAA, FBB, FF, GG,  FFX,  GGX  EDD01170 
EXTERNAL  EGX, EG, ERUOX, ERUO,  PF, QF, AF, BF  EDD01180 

CALL  UMACH  <2,NOUT)  EDD01190 

21-ZO  EDD01200 
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IF  (CONDT)  468, 468, 469 

EDD01210 

4  63 

21— ZO 

EDD01220 

469 

TEST2-ABS(K0*Z*SQRT(CIT/2.)  ) 

EDD01230 

IF  (TEST2-70.)  001,001,599 

EDD01240 

:  99 

HR1-0. 

EDD01250 

HX1-0. 

EDD01260 

HR2-0. 

EDD01270 

HX2-0. 

EDD01280 

GOTO  600 

EDD01290 

r _ 

■”  LALvUmAICi  n  \*>vUf  Un  L  (KUv/  «  iJaI  /  flAl 

-- 

EDD01310 

001 

TESTl-l. 

EDD01320 

401 

SUM1-0.0 

EDD01330 

SUM2-0.0 

EDD01340 

TERV-2 . 

EDD01350 

I RULE- 1 

EDD01360 

A-0.0 

EDD01370 

B-TERV 

EDD01380 

110 

ERRABS-0 . 0 

EDD01390 

ERRREL-1 . E-2 

EDD01400 

IF  (CONDT)  441,441,443 

EDD01410 

443 

IF  (TEST1 )  451,451,453 

EDD01420 

451 

CALL  INTEG  (ERUOX, RESULT) 

EDD01430 

GOTO  404 

EDD01440 

453 

CALL  INTEG  (ERUO, RESULT) 

EDD01450 

GOTO  404 

EDD01460 

441 

IF  (TEST1 )  402,402,403 

EDD01470 

402 

CALL  INTEG (FX, RESULT) 

EDD01480 

GOTO  404 

EDD01490 

403 

CALL  INTEG (F, RESULT) 

EDD01500 

404 

SUM1-SUM1+RESULT*EXP  (K0*Z*FN1  (  (A+B)  /2  .  /K0/HH)  ) 

EDD01510 

IF  (ABS (SUM1-SUM2 ) -1 . E-20)  111,111,112 

EDD01520 

112 

A-B 

EDD01530 

B-B+TERV 

EDD01540 

SUM2-SUM1 

EDD01550 

GOTO  110 

EDD01560 

in 

IF  (TEST1 )  405,405,406 

EDD01570 

406 

TEST1— 1 . 

EDD01580 

HR1-SUM1*Z1/K0/HH 

EDD01590 

GOTO  401 

EDD01600 

4  0  5 

HX1-SUM1*Z1/K0/HH 

EDD01610 

TESTl-l . 

EDD01620 

A-0.0 

EDD01630 

B-l . 

EDD01640 

IRULE-1 

EDD01650 

4  10 

ERRABS-0. 0 

EDD01660 

ERRREL-1. E-2 

EDD01670 

IF  (CONDT)  461,461,463 

EDD01680 

463 

IF  (TEST1 )  471,471,473 

EDD01690 

4  71 

CALL  INTEG  (EGX, RESULT) 

EDD01700 

GOTO  413 

EDD01710 

473 

CALL  INTEG  (EG, RESULT) 

EDD01720 

GOTO  480 

EDD01730 

461 

IF  (TEST1 )  411,411,412 

EDD01740 

1 1 

CALL  INTEG (GX, RESULT) 

EDD01750 

GOTO  413 

EDD01760 

i  i  *. 

CALL  INTEG(G, RESULT) 

EDD01770 

j  q 

TEST1  — 1 . 

EDD01780 

HR1-HR1+RESULT*  Z 1 

EDD01790 

GOTO  410 

EDD01800 

67-37 


File:  EDDY 2 


FORTRAN  A1  University  of  Kentucky 


413 

HX1-HX1+RESULT*Z1 

EDD01810 

EDD01820 

•v  “  “  “ 

IF  (CONDT)  490,490,491 

EDD01830 

491 

AA-RUO 

EDD01840 

RUO-O . 

EDD01850 

490 

TEST1-1. 

EDD01860 

501 

SUM1-0.0 

EDD01870 

SUM2-0.0 

EDD01880 

TERV-2 . 

EDD01890 

IRULE-1 

EDD01900 

A-0. 

EDD01910 

B-TERV 

EDD01920 

310 

ERRABS-0 . 

EDD01930 

ERRREL-l.E-2 

EDD01940 

IF  (TEST1)  502,502,503 

EDD01950 

502 

CALL  INTEG(FFX, RESULT) 

EDD01960 

GOTO  5C4 

EDD01970 

503 

CALL  INTEG(FF,  RESULT) 

EDD01980 

504 

SUM1-SUM1+RESULT*EXP  <K0*Z*FN1  (  (A+B)  /2./K0/HH)  ) 

EDD01990 

IF  (ABS (SUM1-SUM2 ) -1 . £-30 )  311,311,312 

EDD02000 

312 

A-B 

EDD02010 

B-B+TERV 

EDD02020 

SUM2-SUM1 

EDD02030 

GOTO  310 

EDD02040 

311 

IF  (TEST1 )  505,505,506 

EDD02050 

506 

TEST1— 1. 

EDD02060 

HR2-SUM1*ZO/KO/HH 

EDD02070 

GOTO  501 

EDD02080 

505 

HX2-SUM1*ZO/KO/HH 

EDD02090 

TEST1-1. 

EDD02100 

A“1 .0 

EDD02110 

B-0. 

EDD02120 

IRULE-1 

EDD02130 

510 

ERRABS-0. 

EDD02140 

ERRREL-l.E-2 

EDD02150 

IF  (TEST1 )  511,511,512 

EDD02160 

511 

CALL  INTEG(GGX, RESULT) 

EDD02170 

GOTO  513 

EDD02180 

512 

CALL  INTEG(GG, RESULT) 

EDD02190 

TEST1-- 1. 

EDD02200 

HR2-HR2+RESULT*Z0 

EDD02210 

GOTO  510 

EDD02220 

513 

HX2-HX2+RESULT*Z0 

EDD02230 

600 

RETURN 

EDD02240 

END 

EDD02250 

rnnn??fifi 

~  ““ 

REAL  FUNCTION  F(XT) 

EDD02270 

REAL  X,  KO,  FN1 , FN2 ,  XT 

EDD02280 

REAL  ALOG, EXP 

EDD02290 

COMMON  20 ,  WFREQ,  U0,  APC0,  UR,  APCR,  CIT,  AA,  HH,K0,RUO,  Z,A,  B,  QX,  QW,  NCOEFEDD02300 

COMMON  BB,  HHZ,  RUOO, DD 

EDD02310 

INTRINSIC  ALOG, EXP, SQRT 

EDD02320 

X-XT/K0/HH 

EDD02330 

FI-  (FP  (X)  *COS  (K0*FN2  (X)  »Z) -FQ(X)  »SIN(K0*FN2  (X)*Z)  ) 

EDD02340 

Fl-Fl *EXP (-K0*HH*X+K0*  (FN1  (X) -FN1 { (A+B) /2  .  /K0/HH) ) *Z) 

EDD02350 

F-F1*DBSJ1  (KO  *RUO*SQRT  ( 1 .  +X**2)  )  *DBSJ1  (K0*AA*SQRT  ( 1  .+X**2)  ) 

EDD02360 

RETURN 

EDD02370 

END 

EDD02380 

REAL  FUNCTION  FX(XT) 

EDD02390 

REAL  X,  K0 , FN1 , FN2 ,  XT 

EDD02400 
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REAL  ALOG , EXP  EDD02410 

COMMON  ZO, WFREQ,  UO, APCO,  UR,  APCR,  CIT,  AA,  HH,  KO,  RUO,  Z,  A,  B,  QX,  QW,  NCOEFEDD02420 
COMMON  BB,  HHZ, RUOO, OD  EDD02430 

X-XT/KO/HH  EDD02440 

ZKN-K0*Z*FN2(X)  EDD02450 

Fl-FP  (X)  *SIN (ZKN)  +FQ (X)  *COS  (ZKN)  EDD02460 

F1-F1*EXP  <-KO*HH*X+KO* (FN1 (X) -FN1 ( (A+B) 12 . /KO/HK)  ) *Z)  EDD02470 

ZKN-KO*RUO*SQRT<1 .+X**2)  EDD02480 

FX-F1*DBSJ1 (ZKN)  *DBSJ1 (K0*AA*SQRT (1 . +X**2) )  EDD02490 

RETURN  EDD02500 

END  EDD02510 

C  -  FUNCTION  FP (X)  - - -  EDD02520 

REAL  FUNCTION  FP(X)  EDD02530 

COMMON  ZO,  WFREQ,  UO,  APCO ,  UR,  APCR,  CIT,  AA,  HH,  KO ,  RUO,  Z,  A,  B,  QX,  QW,  NCOEFEDD02540 
COMMON  BB,  HHZ,  RUOO, DD  EDD02550 

F1-UR*X+FN1 (X)  EDD02560 

FP-2*UR*X* (FNl (X)  *F1+FN2 (X) **2) / <F1**2+FN2 (X) **2)  EDD02570 

RETURN  EDDO250O 

END  EDD02590 

C  -  FUNCTION  FQ (X)  -  EDD02600 

REAL  FUNCTION  FQ(X)  EDD02610 

COMMON  ZO,  WFREQ,  UO,  APCO ,  UR,  APCR,  CIT,  AA,  HH,  KO ,  RUO,  Z,  A,  B,  QX,  QW,  NCOEFEDD02620 
COMMON  BB,  HHZ,  RUOO, DD  EDD02630 

FQ-FN2 (X) *UR*X/ ( (UR*X+FN1 (X) ) **2+FN2 (X) **2)  EDD02640 

RETURN  EDD02650 

END  EDD02660 

C  -  FUNCTION  FOR  INTEGRAL  (0.0, 1,0)  -  EDD02670 

REAL  FUNCTION  G(X)  EDD02680 

REAL  X, ALOG,  EXP ,  KO , FA, FB  EDD02690 

COMMON  ZO,  WFREQ,  UO ,  APCO ,  UR,  APCR,  CIT,  AA,  HH,  KO,  RUO,  Z,  A,  B,  QX,  QW,  NCOEFEDD02700 
COMMON  BB, HHZ, RUOO,  DD  EDD02710 

INTRINSIC  ALOG, EXP, SQRT  EDD02720 

ZKN-K0*(HH*X+FM1(X) *Z)  EDD02730 

Fl-FA (X) *COS ( ZKN) +FB (X) *SIN(ZKN)  EDD02740 

C  F1-F1*EXP (KO* (FM2 (X) -FM2 ( (A+B) /2  .  ) ) *Z)  EDD02750 

Fl-Fl *EXP (KO* (FM2 (X) *Z)  )  EDD02760 

ZKN-KO*RUO*SQRT<1 .-X**2)  EDD02770 

G-F1*DBSJ1 (ZKN) *DBSJ1 (K0*AA*SQRT ( 1 . -X* * 2 )  )  EDD02780 

RETURN  EDD02790 

END  EDD02800 

REAL  FUNCTION  GX(X)  EDD02810 

REAL  ALOG, BX, EXP, KO , FA,  FB  EDD02820 

COMMON  ZO ,  WFREQ ,  UO ,  APCO ,  UR ,  APCR, CIT,  AA,  HH ,  KO ,  RUO ,  Z ,  A,  B ,  QX , QW ,  NCOEFEDDO 2 830 
COMMON  BB, HHZ, RUOO,  DD  EDD02840 

ZKN-KO* (HH*X+FM1 (X) *Z)  EDD02850 

FI-  (FA (X)  *SIN  (ZKN)  -FB  (X)  *COS  (ZKN)  )  *EXP  (K0*FM2  (X)  *Z)  EDD02860 

GX-F1*DBSJ1 (KO*RUO*SQRT (1 . -X**2) ) *DBSJ1 (K0*AA*SQRT (1 . -X**2) )  EDD02870 

RETURN  EDD02880 

END  EDD02890 

REAL  FUNCTION  FA(X)  EDD02900 

COMMON  ZO,  WFREQ,  UO,  APCO,  UR,  APCR,  CIT,  AA,  HH,  KO,  RUO,  Z,  A,  B,  QX,  QW,  NCOEFEDD02910 
COMMON  BB,  HHZ,  RUOO, DD  EDD02920 

REAL  FM1, FM2  EDD02930 

F1-UR*X+FM1 (X)  EDD02940 

FA-2*UR*X* (FM1 (X)  *F1+FM2 (X) **2)  /  (F1*«2+FM2 (X) **2)  EDD02950 

RETURN  EDD02960 

END  EDD02970 

REAL  FUNCTION  FB(X)  EDD02980 

COMMON  ZO,  WFREQ,  UO,  APCO,  UR,  APCR,  CIT,  AA,  HH,K0,  RUO,  Z,  A,  B,  QX,  QW,  NCOEFEDD02990 
COMMON  BB, HHZ, RUOO, DD  EDD03000 
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REAL  FM1, FM2 
F1"UR*X+FM1 (X) 

FB-FM2(X) »UR*X/ (F1**2+FM2 (X) **2) 

END 

REAL  FUNCTION  FM1 (X) 

REAL  FP1 

COMMON  ZO, WFREQ, UO, APCO, UR,  APCR, CIT, AA,  HH,  KO,  RUO,  Z,  A, 
COMMON  BB, HHZ, RUOO, DD 

FM1-SQRT (0.5*  (SQRT (FP1  (X)  **2+CIT**2) -t-FPl  (X>  >  ) 

RETURN 

END 

REAL  FUNCTION  FM2 (X) 

COMMON  20 ,  WFREQ,  U0 ,  APCO ,  UR,  APCR,  C IT ,  AA,  HH ,  K0 ,  RUO,  Z ,  A, 
COMMON  BB, HHZ, RUOO, DD 
REAL  FP1 

IF  (FP1 (X) -CIT*1 .  E2)  0042,0042,0043 
0043  Yl-CIT/FPKX) 

FM2«. 5*SQRT (CIT* (Y1-.5*Y1**3+.75*Y1**5)  ) 

GOTO  0044 

0042  FM2-SQRT ( . 5* (SQRT (FP1 (X) **2+CIT**2) -FP1  (X)  )  ) 

0044  RETURN 
END 

REAL  FUNCTION  FN1 (X) 

COMMON  20,  WFREQ,  U0,  APCO ,  UR,  APCR,  CIT,  AA,  HH,  K0,  RUO,  2,  A, 
COMMON  BB, HHZ, RUOO, DD 
REAL  FP2 

FN1-SQRT  ( .  5*  (SQRT  (FP2  (X)  **2+CIT**2)  +FP2  (X)  )  ) 

RETURN 

END 

REAL  FUNCTION  FN2(X) 

COMMON  20 ,  WFREQ ,  U0 ,  APCO ,  UR,  APCR ,  C IT ,  AA,  HH ,  K0 ,  RUO ,  2 ,  A , 

COMMON  BB , HHZ , RUOO , DD 

INTRINSIC  SQRT 

REAL  FP2, K0 ,  SQRT, CIT, APCR,  UR 

C  WRITE  ( 6 ,  * )  FP2  (X)  ,  ' - FP2 ' 

IF  (FP2  (X) -CIT*1.E2)  0011,0011,0022 
0022  Y1-CIT/FP2(X) 

FN2-.5*SQRT(C:T*(Yl-.5»Yi*-3+3./4 . *Y1**5) ) 

GOTO  0033 

0011  FN2-SQRT ( . 5* (SQRT(FP2 (X)  **2+CIT**2) -FP2  (X)  )  ) 

0033  RETURN 
END 

REAL  FUNCTION  FP2 (X) 

COMMON  Z0,  WFREQ,  U0,  APCO,  UR,  APCR,  CIT,  AA,  HH,  KO,  RUO,  2,  A, 

COMMON  BB, HHZ, RUOO, DD 

REAL  X, KO, UR, APCR 

FP2-X**2+1-UR*APCR 

RETURN 

END 

REAL  FUNCTION  FP1 (X) 

COMMON  ZO,  WFREQ,  UO,  APCO,  UR,  APCR,  CIT,  AA,  HH,  KO,  RUO,  Z,  A, 

COMMON  BB, HHZ, RUOO, DD 

FP1-X**2-1+UR*APCR 

RETURN 

END 

C  - FUNCTION  FOR  CALCULATING  H(2)  - 

REAL  FUNCTION  FF (XT) 

COMMON  20,  WFREQ,  UO,  APCO,  UR,  APCR,  CIT,  AA,HH,K0,  RUO,  2,  A, 
COMMON  BB, HHZ, RUOO, DD 
REAL  KO 


EDD03010 
EDD03020 
EDD03030 
EDD03040 
EDD03050 
EDD03060 
B, QX, QW, NCOEFEDD03070 
EDD03080 
EDD03090 
EDD03100 
EDD03110 
EDD03120 
B, QX, QW, NCOEFEDD03130 
EDD03140 
EDD03150 
EDD03160 
EDD03170 
EDD03180 
EDD03190 
EDD03200 
EDD03210 
EDD03220 
EDD03230 
B, QX,QW, NCOEFEDD03240 
EDD03250 
EDD03260 
EDD03270 
EDD03280 
EDD03290 
EDD03300 
B, QX, QW, NCOEFEDDO  3310 
EDD03320 
EDD03330 
EDD03340 
EDD03350 
EDD03360 
EDD03370 
EDD03380 
EDD03390 
EDD03400 
EDD03410 
EDD03420 
EDD03430 
B, QX, QW, NCOEFEDD03440 
EDD03450 
EDD03460 
EDD03470 
EDD03480 
EDD03490 
EDD03500 
B, QX, QW, NCOEFEDD03510 
EDD03520 
EDD03530 
EDD03540 
EDD03550 

-  EDD03560 

EDD03570 
B, QX, QW, NCOEFEDDO 3 580 
EDD03590 
EDD03600 
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X-XT/KO/HH 
ZKN-K0*FN2 (X) *Z 

F1-EXP(-K0*HH*X+K0*Z* (FN1 (X)-FN1 ( (A+B) /2 . /KO/HH) ) ) 

F1-(FPP(X) *COS (ZKN) +FQQ (X) «SIN(ZKN) ) *F1 

FF-F1*DBSJ1 (K0*AA*SQRT ( 1 . ♦X**2)  )  *DBSJ0 < KO*RUO*SQRT ( 1 . +X**2) ) 

RETURN 

END 

REAL  FUNCTION  FFX(XT) 

COMMON  ZO, WFREQ,  UO, APCO, UR,  APCR,  CIT,  AA,  HH,  KO,  RUO,  Z,  A,  B,  QX,  QW, 

COMMON  BB, HHZ, RUOO, DD 

REAL  KO 

X-XT/KO/HH 

ZKN— KO*FN2 (X)  *Z 

Fl-FPP (X) *SIN (ZKN) -FQQ (X) *COS(ZKN) 

Fl-Fl *EXP <-K0*HH*X+K0* (FN1 (X)  -FN1 ( (A+B) / 2 . /KO/HH) ) *Z) 
FFX-F1*DBSJ1 (K0*AA*SQRT ( 1 . +X*  *2 )  )  *DBSJ0 <KO*RUO*SQRT (1 . +X* *2 )  ) 
RETURN 
END 

REAL  FUNCTION  GG(X) 

COMMON  ZO,  WFREQ,  U0,APC0,  UR,  APCR,  CIT,  AA,  HH,KO,RUO,  Z,A,B,QX,QW, 
COMMON  BB, HHZ, RUOO, DD 
REAL  KO 

ZKN— KO* (HH*X+FM1 (X) *Z) 

FI- (FAA(X) *SIN (ZKN) +FBB(X) «COS (ZKN)  )  *EXP (K0*FM2 (X) *Z) 
GG-F1*DBSJ1 (K0*AA*SQRT (1 . -X**2)  )  *DBSJ0 (KO*RUO*SQRT (1 . -X**2)  ) 
RETURN 
END 

REAL  FUNCTION  GGX(X) 

COMMON  ZO,  WFREQ,  U0,APC0,  UR,  APCR,CIT,AA,  HH,KO,RUO,  Z,A,B,QX,QW, 
COMMON  BB, HHZ, RUOO, DD 
REAL  KO 

ZKN-KO* (HH*X+FM1 (X) *Z) 

FI- (-FAA (X) *COS ( ZKN) +FBB (X)  *SIN(ZKN)  )  *EXP (K0*FM2 (X) *Z) 
GGX-F1*DBSJ1 (K0*AA*SQRT (1 . -X**2 )  )  *DBSJ0 (KO*RUO*SQRT ( 1 . -X**2 )  ) 
RETURN 
END 

REAL  FUNCTION  FPP(X) 

COMMON  ZO, WFREQ,  UO, APCO,  UR,  APCR,  CIT,  AA,  HH, KO, RUO, Z, A, B, QX,  QW, 
COMMON  BB, HHZ, RUOO, DD 
ZKN-UR*X*FN1 (X) 

F?P-2*UR*X*SQRT<1 ,+X**2) *ZKN/ (ZKN**2+FN2 (X) **2) 

RETURN 

END 

REAL  FUNCTION  FQQ(X) 

COMMON  ZO,  WFREQ,  UO,  APCO ,  UR,  APCR,  CIT,  AA,  HH,  KO,  RUO,  Z,  A,  B,  QX,  QW, 
COMMON  BB, HHZ, RUOO, DD 

FQQ-2  *UR*X*SQRT (l.+X**2)*FN2(X)  / ( (UR*X+FN1 (X) ) **2+FN2 (X) *  *  2 ) 

RETURN 

END 

REAL  FUNCTION  FAA(X) 

COMMON  ZO,  WFREQ,  UO,  APCO,  UR,  APCR,  CIT,  AA,  HH,  KO,  RUO,  Z,  A,  B,  QX,  QW, 
COMMON  BB, HHZ, RUOO, DD 
ZKN-UR*X+FM1 (X) 

FAA-2*UR*X*SQRT (1 . -X*  *  2 ) *ZKN/ (ZKN**2  +  FM2 (X) **2) 

RETURN 

END 

REAL  FUNCTION  FBB(X) 

COMMON  ZO,  WFREQ,  UO,  APCO,  UR,  APCR,  CIT,  AA,  HH,  KO,  RUO,  Z,  A,  B,  QX,  QW, 
COMMON  BB, HHZ, RUOO, DD 

FBB-2 . *UR*X*FM2 (X) *SQRT (1 . -X**2)  / ( (UR*X+FM1  (X) ) »*2  +  FM2(X) **2) 


EDD03610 
EDD03620 
EDD03630 
EDD03640 
EDD03650 
EDD03660 
EDD03670 
EDD03680 
NCOEFEDD03690 
EDD03700 
EDD03710 
EDD03720 
EDD03730 
EDD03740 
EDD03750 
EDD03760 
EDD03770 
EDD03780 
EDD03790 
NCOEFEDD03800 
EDD03810 
EDD03820 
EDD03830 
EDD03840 
EDD03850 
EDD03860 
EDD03870 
EDD03880 
NCOEFEDD03890 
EDD03900 
EDD03910 
EDD03920 
EDD03930 
EDD03940 
EDD03950 
EDD03960 
EDD03970 
NCOEFEDD03980 
EDD03990 
EDD04000 
EDD04010 
EDD04020 
EDD04030 
EDD04040 
NCOEFEDD04050 
EDD04060 
EDD04070 
EDD04080 
EDD04090 
EDD04100 
NCOEFEDD04 110 
EDD04120 
EDD04130 
EDD04140 
EDD04150 
EDD04160 
EDD04170 
NCOEFEDD04 180 
EDD04190 
EDD04200 
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RETURN 

END 

-  PARAMETERS  !  - 

SUBROUTINE  PARAM (NNF, NNZ, HFIA, HZ, AZZ,  AFF,  BZZ,BFF> 

DIMENSION  HFIA (NNF/ NNZ) , HZ (NNF,  NNZ) 

COMPLEX  HFIA (100) , HZ  (100) ,AZZ(100,  100)  ,  AFF ( 100,  100) 

COMPLEX  BFF (100,100) ,BZ2(100,100) , Y12, Z12 

COMMON  Z0,  WFREQ,  U0,  APC0,  UR,  APCR,  CIT,  AA,  HH,  K0,  RUO,  Z,  A,B,QX,QW, 

COMMON  BB,  HHZ, RUOO, DD 

INTRINSIC  SQRT, SIN, COS, ACOS 

REAL  K1,K0,  SQRT, SIN, COS, ACOS 

PI-AC0S<-1.) 

U0-4.*PI*l.£-7 

UR-1. 

WRITE (6,*)  '  INPUT  F(HZ)' 

READ (5, * ) FREQ 
APCO-8. 854*1. E-12 
APCR-1. 

WFREQ-2 . *PI*FREQ 
K0-WFREQ*SQRT (UO*APC0) 

CITA-5. 7E7 

CIT-CITA*UR/ WFREQ/ APC0 
AA- . IE- 1 
HH-.1E-3 
Z0-K0**2*AA/2. 

Zl— ZO 

Y12-CMPLX(CITA,WFREQ*APC0*(H-APCR)  ) 

212-CMPLX  (0  . ,  -1 .  /WFREQ/UO*  <  1 .  *1 .  /UR) ) 

BB-l.E-2 
RUOO-2 . E-3 
DD— 0 . 

HHZ-2.E-2 

DO  012  JIZ-O.NNZ 

Z—  (DD+HHZ* JIZ/NNZ) 

DO  Oil  JIF-1 , NNF 
JI-JIZ*NNF+JIF 
FIA-2 . *PI*JIF /NNF 

RUO-SQRT ( RUOO *  *  2  +BB  *  *  2 + 2 . »RUOO*BB*COS (FIA) ) 

CONDT—  1. 

CALL  HRUOZ (HR1, HX1, HR2 . HX2 , CONDT) 

CCNS-RUO0*SIN (FIA) /RUO 

HFIA(JI)— - (CMP LX ( HR1 , HX1 ) ) * CMP LX ( CCNS ,  0  . ) 

HZ ( JI ) -CMP LX (HR2, HX2) 

XXI-BB*COS (FIA) 

YYI-BB*SIN (FIA) 

DO  014  JZ-O.NNZ 
ZZ—  (DD-*-HHZ*JZ/NNZ) 

DO  015  JF-l.NNF 
JII-JZ*NNF*JF 
FIA-2*PI * JF /NNF 
XXJ-BB*COS (FIA) 

YYJ-B8*SIN (FIA) 

RI J- (XXI-XXJ) **2f(YYI-YYJ)  **2-HZ-ZZ)  **2 

IF  (RIJ-l.E-10)  017,017,018 

RI J-SQRT ( (2*PI*BB/NNF) **2* (HHZ/NNZ) **2) /2 . 

GOTO  019 
RI J-SQRT (RIJ) 

AZZ(JI, JII)-Y12/RIJ-Z12*(1 . /RIJ**3*3. * (Z-ZZ)  **2/RIJ**5) 

AFF  ( JI,  JII)  — -3  .  *  (Z-ZZ)  /RIJ**S*  (Z12)  •  <XXJ*YYI-XXI*YYJ) 

ALP0- (XXI*XXJ+YYI*YYJ> /RIJ* *3 *3 . * (XXJ* YYI-XXI *YYJ) **2/RIJ**5 


EDD04210 
EDD04220 
EDD04230 
EDD04240 
EDD04250 
EDD04260 
EDD04270 
NCOEFEDD04280 
EDD04290 
EDD04300 
EDD04310 
EDD04320 
EDD04330 
EDD04340 
EDD04350 
EDD04360 
EDD04370 
EDD04380 
EDD04390 
EDD04400 
EDD04410 
EDD04420 
EDD04430 
EDD04440 
EDD04450 
EDD04460 
EDD04470 
EDD04480 
EDD04490 
EDD04500 
EDD04510 
EDD04520 
EDD04530 
EDD04540 
EDD04550 
EDD04560 
EDD04570 
EDD04580 
EDD04590 
EDD04600 
EDD04610 
EDD04620 
EDD04630 
EDD04640 
EDD04650 
EDD04660 
EDD04670 
EDD04680 
EDD04690 
EDD04700 
EDD04710 
EDD04  720 
EDD04730 
EDD04740 
EDD04750 
EDD04760 
EDD04770 
EDD04780 
EDD04790 
EDD04800 
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as) 


315 

014 

Oil 

012 


BFF(JI, JII)-Y12/RIJ-Z12*  (  (ALPO)  ) 

BZZfJI,  JII)—  Z12*3.  *(XXJ*YYI-XXI*YYJ)  *  ( Z-ZZ )  /RIJ**5 
AZZ ( JII , JI ) -AZZ ( JI , JI I ) 

AFF(JII, JI)-AFF(JI, JII) 

BFF (JII, JI)-BFF (JI, JII) 

BZZ(JII, JI)-BZZ (JI, JII) 

CONTINUE 

CONTINUE 

CONTINUE 

CONTINUE 

WRITE  (6,  *)  KO,  RUO,  ' - KO  RUO' 

RETURN 

END 

-  FUNCTIONS  FOR  CALCULATING  ERUO  - 

REAL  FUNCTION  ERUO (XT) 

REAL  X, KO, FN1, FN2 , XT 

COMMON  ZO,  WFREQ,  UO,  APCO,  UR,  APCR,  CIT,  AA,  HH,  KO,  RUO,  Z,  A,  B,  QX,  QW, 

COMMON  BB,  HHZ,  RUOO, DD 

X-XT/KO/HH 

F1-(PF(X) *COS (K0*FN2 (X) *Z)-QF(X) *SIN(K0*FN2 (X)  *Z)  ) 

Fl-Fl *EXP  < -KO*HH*X+KO*  (FN1  (X)-FN1  (  <A*B)  /2 . /KO/HH) ) *Z) 

ERUO-F1 *DBS JO (KO*RUO*SQRT ( 1 . +X*  *2 ) ) 

RETURN 

END 

REAL  FUNCTION  ERUOX(XT) 

REAL  X, KO, FN1 , FN2 , XT 

COMMON  ZO ,  WFREQ,  UO,  APCO ,  UR,  APCR,  CIT,  AA,  HH,  KO ,  RUO,  Z,  A,  B,  QX,  QW, 
COMMON  BB,  HHZ,  RUOO, DD 
X-XT/KO/HH 
ZKN-K0*Z*FN2 (X) 

Fl-PF  (X) *SIN ( ZKN)  +QF  (X)  *COS(ZKN) 

F1-F1*EXP (-KO*HH*X+-KO*  (FN1  (X)  -FN1 ( (A+B) /2 . /KO/HH) ) *Z) 

ERUOX-F 1 *DBS JO ( KO  *  RUO*  SQRT ( 1 . +X  *  *  2 )  ) 

RETURN 

END 

REAL  FUNCTION  EGX(X) 

REAL  KO 

COMMON  ZO,  WFREQ,  UO,  APCO ,  UR,  APCR,  CIT,  AA,  HH,  KO,  RUO,  Z,  A,  B,  QX,  QW, 
COMMON  BB, HHZ,  RUOO, DD 
ZKN-KO* (HH*X+FM1 (X) *Z) 

FI- (AF (X) *SIN ( ZKN) +BF (X ) *COS(ZKN) ) *EXP(KO*FM2 (X) *Z) 
EGX-F1*DBSJ0 (KO*RUO*SQRT ( 1 . -X**2) ) 

RETURN 

END 

REAL  FUNCTION  EG(X) 

REAL  KO 

COMMON  ZO,  WFREQ,  UO,  APCO  ,  UR,  APCR,  CIT,  AA,  HH,  KO,  RUO,  Z,  A,  B,  QX,  QW, 
COMMON  BB,  HHZ,  RUOO, DD 
ZKN-KO*  <HH*X+FM1 (X) *Z) 

Fl-AF(X) *COS ( ZKN) -BF(X) *SIN(ZKN) 

Fl-Fl *EXP (K0*FM2 (X) *Z) 

EG-F1 *DBS JO ( RUO *K0* SQRT  (1 . -X**2 ) ) 

RETURN 

END 

REAL  FUNCTION  PF(X) 

COMMON  ZO,  WFREQ,  UO,  APCO  ,  UR,  APCR,  CIT,  AA,  HH,  KO,  RUO,  Z,  A,  B,  QX,  QW, 
COMMON  BB, HHZ, RUOO, DD 
F1-FN1 (X) *APCR*X 
PF-2*X**2*F1/ (F1**2+FN2 (X) **2) 

RETURN 


EDD04810 
EDD04820 
EDD04830 
EDD04840 
EDD04850 
EDD04860 
EDD04870 
EDDO  1880 
EDD04890 
EDD04900 
EDD04910 
EDD04920 
EDD04930 
EDD04940 
EDD04950 
EDD04960 
NCOEFEDD04970 
EDD04980 
EDD04990 
EDD05000 
EDD05010 
EDD05020 
EDD05030 
EDD05040 
EDD05050 
EDD05060 
NCOEFEDD05070 
EDD05080 
EDD05090 
EDD05100 
EDD05110 
EDD05120 
EDD05130 
EDD05140 
EDD05150 
EDD05160 
EDDO 5 170 
NCOEFEDD05180 
EDD05190 
EDD05200 
EDD05210 
EDD05220 
EDD05230 
EDD05240 
EDD05250 
EDD05260 
NCOEFEDD05270 
EDD05280 
EDD05290 
EDD05300 
EDD05310 
EDD05320 
EDD05330 
EDD05340 
EDD05350 
NCOEFEDD05360 
EDD05370 
EDD05380 
EDD05390 
EDD05400 
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C 


C 


020 


END  EDD05410 

REAL  FUNCTION  QF(X)  EDD05420 

QF— 2 . *FN2 (X) *X**2/ ( (FN1 (X) +APCR*X) **2+FN2 (X) **2)  EDD05430 

RETURN  EDD05440 

END  EDD05450 

REAL  FUNCTION  AF(X)  EDD05460 

F1-FM1(X)+APCR*X  EDD05470 

AF-2. *X**2*F1/ (F1**2+FM2 (X) **2)  EDD05480 

RETURN  EDD05490 

END  EDD05500 

REAL  FUNCTION  BF(X)  EDD05510 

BF-2 . *X**2*FM2 (X) / ( (FM1 (X) +APCR*X) **2+FM2 (X) **2)  EDD05520 

RETURN  EDD05530 

END  EDD05540 

- CALCULATE  THE  INTEGRAL  COEFFICIENTS -  EDD05550 

SUBROUTINE  COEF  (ICA,QX,QW)  EDD05560 

INTEGER  II, WEIGH, NFIX,NOUT  EDD05570 

COMMON  20,WFREQ,U0,APC0,UR,APCR,CIT,AA,HH,K0,RUO,Z,A,B,XX,WW,NCOEFEDD05580 
COMMON  BB,HH2,RUOO,DD  EDD05590 

REAL  ALPHA, BETA, QXFIX (2 ) , SUMF  EDD05600 

REAL  QX (21) ,QW{21)  EDD05610 

EXTERNAL  GQRUL, UMACH, BAS  EDD05620 

CALL  UMACH ( 2 , NOUT )  EDD05630 

NCOEF-21  EDD05640 

ICA-NCOEF  EDD05650 

I WEIGH" 1  EDD05660 

ALPHA-0.  EDD05670 

BETA-0.  EDD05680 

NFIX-0  EDD05690 

CALL  GQRUL(NCOEF, IWEIGH, ALPHA, BETA,NFIX, QXFIX, QX,QW)  EDD05700 

RETURN  EDD05710 

END  EDD05720 

- CALCULATE  INTEGRAL  FROM  A  TO  0 -  EDD05730 

SUBROUTINE  INTEG (BAS, SUMF)  EDD05740 

EXTERNAL  BAS, COEF  EDD05750 


COMMON  20, WFREQ, UO, APCO,  UR, APCR, CIT, AA, HH, K0, RUO,  Z,  A,  B,  XX,  WW,  NCOEFEDD05760 


COMMON  BB, HH2,RUOO, DD  EDD05770 
REAL  QX ( 2 1 ) , QW ( 2 1 )  EDD05780 
CALL  COEF ( ICA, QX, QW)  EDD05790 
SUMF-0 .  EDD05800 
DO  020  II-l, ICA  EDD05810 
XINT- (B-A) /2 . *QX (II) + (B+A) /2 .  EDD05820 
SUMF-SUMF+BAS (XINT) *QW(II)  EDD05830 
CONTINUE  EDD05840 
RETURN  EDD05850 
END  EDD05860 
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Title:  Silicon/Tin  Polymers  for  Enhanced  Third  Order  Nonlinear  Optical  Properties 
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Institution:  Northwestern  University 

Evanston,  IL  60208 

PI  Department:  Chemistry 

Amount:  $20,000 

Duration:  12  months  (November  1,  1990-October  31,  1991) 

As  described  in  the  original  proposal,  polysilanes  have  very  favorable  non¬ 
linear  optical  properties.  In  an  effort  to  enhance  these  properties  and  possibly 
obtain  values  of  in  a  range  for  practical  applications,  we  have  prepared  poly¬ 
silanes  that  have  been  doped  with  the  more  highly  polarizable  tin  atom.  During  the 
grant  period  we  have  been  successful  in  obtaining  polysilylstannanes,  which  we  now 
plan  to  examine  for  enhanced  NLO  properties. 

The  synthesis  of  polysilanes  usually  involves  condensation  of  dichlorosilanes 
with  sodium  metal.  Success  of  the  polymerization  reaction  often  depends  on  the 
nature  of  the  solvent,  reaction  conditions,  and  the  purity  of  the  reactants.  Similar 
procedures  have  been  developed  to  prepare  polystannanes,  but  chain  lengths  are 
much  shorter  than  with  polysilanes,  presumably  because  of  the  much  lower  bond 
strength  of  Sn-Sn.  A  copolymer  of  silicon  and  tin  would  contain  Si-Sn  rather  than 


Sn-Sn  bonds  and  would  possibly  attain  reasonably  long  chain  lengths.  Copolymers 
of  Si  with  Ge  are  known  but  not  of  Si  with  Sn. 

To  make  poly-Si/Sn  we  selected  the  Wurtz-type  coupling  that  has  been  very 
successful  with  polysilanes.  We  used  phenylmethyldichlorosilane  (PhMeSiCl2)  and 
dibutyldichlorostannane  (Bu2SnCl2)  as  our  starting  materials.  Numerous  variants 
were  explored  without  successful  isolation  of  polymers.  Through  control  reactions, 
we  learned  that  the  Si-Sn  bond  is  unstable  to  most  reaction  conditions  using  metals 
(typical  Wurtz  conditions).  We  finally  determined  that  polymerization  of  the  mixed 
dichlorides  in  the  presence  of  lithium  metal,  with  12-crown-4  as  a  chelating  agent, 
and  in  diethyl  ether  as  solvent  successfully  resulted  in  polymerization. 

The  resulting  polymer  has  been  characterized  by  proton,  carbon- 13,  silicon- 
29,  and  tin-1 19  NMR  spectroscopy,  and  by  UV  spectroscopy.  Optimization  of  reac¬ 
tion  conditions  with  respect  to  solvent,  concentrations,  time,  and  temperature  has 
been  carried  out,  and  we  have  determined  that  the  Si-Sn  bond  is  stable  under  the 
reaction  conditions.  We  also  have  explored  changing  the  ratio  of  the  Si  and  Sn 
dichlorides  in  order  to  control  the  Si/Sn  ratio  in  the  product.  Molecular  weights 
have  been  examined  by  mass  spectrometry  and  gel  permeation  chromatography. 

Work  is  continuing  on  the  preparation,  purification,  and  characterization  of 
these  novel  materials.  In  particular,  we  plan  to  measure  by  the  third  harmonic 
generation  method,  which  is  available  in  the  laboratory  of  a  physics  colleague. 
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CONFORMATIONAL  STRUCTURE  AND  DYNAMICS 
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Martin  Schwartz 
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University  of  North  Texas 


In  order  to  establish  a  methodology  for  future  quantum  mechanical 
investigations  of  PFPAE  model  compounds,  the  geometries,  energies  end 
vibrational  frequencies  of  l,l,2-trichloro-l,2,2-trifluoroethane  [TCTFE] 
were  studied  by  ab  initio  calculations  using  various  basis  sets.  Bond 
angles  calculated  with  the  6-31G(d)  and  6-31 lG(d)  bases  were  in  close 
agreement  with  each  other  and  with  experimental  results.  Comparable 
energies  of  the  equilibrium  conformers  and  torsional  barriers  were 
obtained  by  MP2  calculations  using  the  6-31G(d),  6-31 lG(d)  and 
6-31 lG(2df )  bases.  The  calculated  equilibrium  energy  difference  is  in 
qualitative  agreement  with  experimental  results.  It  was  concluded  that 
the  6-31G(d)  basis  set  provides  satisfactory  results,  comparable  to 
those  obtained  using  larger  polarized  bases. 

Fluorine-19  NMR  spin-lattice  (T, )  relaxation  times  were  measured 
as  a  function  of  temperature  and  frequency  for  several  PFPAE's.  Derived 
correlation  times  and  activation  energies  revealed  that  there  exists  a 
substantially  greater  barrier  to  rotation  about  C-C  than  C-0  bonds  in 
these  systems.  There  is  a  significant  variation  in  values  of  Ea 
obtained  at  two  experimental  frequencies  (84.7  and  282.2  MHz), 
indicating  that  overall  and  internal  polymer  chain  rotation  affect  the 
two  measurements  differently. 
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CONFORMATIONAL  STRUCTURE  AND  DYNAMICS 
OF  PERFLUOROPOLYALKYLETHER  LUBRICANTS 

Martin  Schwartz 

I.  INTRODUCTION 

Perfluoropolyalkyl  ether  (PFPAE)  fluids  possess  the  viscoelastic, 
thermal  and  lubricity  properties  necessary  to  serve  as  effective,  stable 
liquid  phase  lubricants.1,2  No  currently  available  commercial  PFPAE 
lubricants,  however,  are  capable  of  operation  at  the  temperature 
extremes  and  oxidative  conditions  required  for  lubrication  of  high 
performance  gas  turbine  engines. 

The  viscoelastic  properties  of  polymer  fluids  such  as  the  PFPAE's 
are,  of  course,  intimately  connected  to  the  chain  flexibility  in  these 
systems  which  is,  in  turn,  dependent  upon  the  potential  energy  barriers 
to  internal  rotation  about  single  bonds  in  the  polymer.  This  year,  I 
have  been  engaged  in  two  research  projects  designed  to  obtain  a  better 
understanding  of  chain  mobility  in  PFPAE's.  (1)  In  collaboration  with 
Dr.  Harvey  L.  Paige  at  the  Materials  Directorate,  Wright  Laboratory, 
Wright-Patterson  AFB,  I  have  begun  ab  initio  and  semi -empirical  quantum 
mechanical  modelling  studies  of  rotational  barriers  in  molecules  related 
to  perfluoroethers;  (2)  Here  at  the  University  of  North  Texas,  we  have 
been  obtaining  data  on  the  frequency  and  temperature  dependence  of 
fluorine-19  NMR  spin-lattice  relaxation  times  (T,)  in  model  PFPAE 
compounds.  This  latter  technique  affords  a  direct  measurement  of  the 
flexibility  at  various  points  along  the  polymer  chain. 

In  the  future,  Dr.  Paige  and  I  plan  to  use  the  results  of  the 
quantum  mechanical  modelling  studies  in  molecular  dynamics  simulations 
of  the  internal  rotation  in  PFPAE  chains.  The  results  of  these 
simulations  will  be  compared  to  the  experimental  NMR  measurments,  which 
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will  provide  an  assessment  of  the  accuracy  of  the  calculations. 
Ultimately,  we  expect  to  be  able  to  use  molecular  modelling  to  predict 
the  viscoelastic  and  conformational  properties  of  these  fluids,  thus 
aiding  in  the  molecular  design  of  new  PFPAE  lubricants. 

II.  AB  INITIO  MOLECULAR  ORBITAL  STUDIES  OF  ROTATIONAL  BARRIERS 
Introduction  The  barriers  to  internal  rotation  about  the  C-0  and  C-C 
bonds  and,  hence,  chain  flexibility  in  PFPAE' s  depend  upon  both  the 
electronic  and  steric  properties  of  groups  attached  to  the  bonded  atoms. 
Prior  to  beginning  our  investigations  of  the  perfluoroethers,  Dr.  Paige 
and  I  decided  to  establish  the  methodology  via  studies  of  two 
hal ofl uoroethanes ,  1,1 , 2- trichloro- 1 , 2 , 2- tri fl uoroethane  (CC1 2F-CC1 F2) 
[TCTFE]  and  l,2-dichloro-l,l,2-trifluoro-2-iodoethane  (CC1F2-CC1FI) 
[DCTFIE].  The  chlorine  and  iodine  atoms  in  these  molecules  possess  very 
different  steric  and  electronic  properties  and  provide  us  with  the 
opportunity  to  assess  the  accuracy  of  the  calculational  procedures 
without  the  added  complication  of  multiple  internal  rotations.  The  work 
on  TCTFE  has  been  completed,  and  an  article  has  been  written  and 
accepted  for  publication  in  the  Journal  of  Physical  Chemistry .3  The 
computations  have  been  completed  on  DCTFIE  and  the  analysis  is  almost 
complete.  In  addition,  we  have  measured  the  Raman  vibrational  spectra 
and  Dr.  James  Liang  of  the  Materials  Directorate  has  obtained 
experimental  IR  frequency  and  intensity  data  on  the  vibrations  in 
DCTFIE,  which  will  be  compared  with  theoretical  predictions.  Results  on 
the  latter  molecule  will  be  presented  in  an  article  to  be  submitted  this 
Spring.4 

Calculations  Ab  initio  molecular  orbital  calculations  were 
performed  using  the  Gaussian-905  program  on  a  Cray  X-MP/216  computer. 
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In  TCTFE,  the  two  equilibrium  geometries,  C,  and  Cs,  are  termed  gauche 
(G)  and  trans  (T),  respectively,  to  denote  the  relative  positions  of  the 
lone  fluorine  ( F, )  on  the  first  carbon  atom  and  the  single  chlorine 
(Cl,)  on  the  second  carbon;  the  transition  state  structures  are  called 
GT  and  GG'.  The  equilibrium  and  saddle  point  geometries  were  optimized 
with  the  following  basis  sets;  3-21G,6  6-31G,7  D95,8  6-31G(d)7,9  and 
6-31 lG(d) .9,1°  Calculations  on  the  four  conformers  were  also  performed 
with  the  6-311G(2df)9,10  basis  set  using  the  6-31 IG(d)  geometries. 

Single  point  second  order  Meller-Plesset11  correlation  energy 
calculations  were  performed  with  several  of  the  largest  basis  sets. 

For  comparison  with  the  ab  initio  results,  MNDO,12  AMI13  and 
PM3U  semi -empirical  energy  calculations  were  performed  with  the 
program  MOPAC.15  As  above,  all  structural  parameters  were  optimized 
for  both  equilibrium  and  transition  state  conformations. 

Results  and  Discussion 

Geometries  Tabulated  in  columns  3-7  of  Table  1  are  the  structural 

parameters  for  the  trans  conformer  calculated  with  the  various  basis 
sets  used  in  this  study.  The  results  from  the  D95  (double-zeta)  basis 
set  are  not  included  since,  not  surprisingly,  they  are  quite  similar  to 
those  obtained  with  the  6-31G  (doubly-split  valence)  basis. 

One  observes  that  structural  parameters  determined  with  the  two 
polarized  basis  sets,  6-31G(d)  and  6-311G(d)  are  generally  quite  close; 
calculated  bond  lengths  are  the  same  to  within  0.002-0.006  A  and  bond 
angles  differ  by  an  average  of  <0.4°.  It  is  seen  further  from  the  table 
that  R(CF2) ,R(CFj)  <  R(CF,),  in  agreement  with  the  experimental 
observation16  that  C-F  bond  lengths  shorten  with  increasing  fluorine 
substitution  on  a  carbon. 
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Iwasaki17  has  determined  the  structure  of  TCTFE  by  electron 
diffraction,18  and  his  results  are  given  in  the  second  column  of 
Table  1.  One  finds  that  C-C  and  C-Cl  bond  lengths  obtained  with  the 
6-31G(d)  and  6-31 lG(d)  basis  sets  are  in  close  coincidence  to  his 
values,  whereas  calculated  C-F  bond  lengths  are  significantly  shorter 
than  experiment;  the  latter  inequality  is  common  to  all  large  basis  set 
SCF  calculations  on  fluorocarbons.19  Considering  the  approximations 
employed  in  the  experimental  structure  determination,  the  bond  angles, 
too,  are  in  quite  satisfactory  agreement  with  measured  values. 

Shown  in  the  last  four  columns  of  the  table  are  the  structures  of 
both  equilibrium  and  saddle  point  configurations,  determined  with  the 
6-31 lG(d)  basis  set.  As  found  in  earlier  ab  initio  studies  of 
substituted  ethanes,19  all  bond  lengths  with  the  exception  of  C-C  remain 
approximately  constant,  whereas  the  latter  increases  by  0.045-0.050  A  in 
the  saddle  point  structures.  One  observes,  also,  that  apparently 
equivalent  angles  appear  to  vary  within  a  given  conformation  and  between 
conformers.  These  variations  can,  in  all  cases,  be  explained  by  steric 
interactions.  For  instance,  the  observation  that  ZCCF2  >  ZCCF3  in  the  G 
rotamer  is  due  to  the  fact  that  fluorine  F2  has  two  gauche  interactions 
with  chlorine  atoms  compared  to  only  one  for  F3.  Similarly,  ZCCF,  and 
ZCCF2  are  greatest  in  the  GG'  transition  state,  which  is  the  only 
conformation  in  which  they  eclipse  chlorine  atoms. 

Energies  Displayed  in  Table  2  are  the  energies  and  energy 
differences  (relative  to  the  T  conformer)  calculated  at  the  SCF  and  MP2 
levels  using  the  various  basis  sets.  One  finds  that  all  ab  initio 
calculations  yield  negative  values  for  the  energy  difference,  AE(G-T), 
which,  at  the  SCF  level,  ranges  from  -0.7  to  -0.8  kcal/mol  for  the  two 
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largest  basis  sets.  The  correlation  energy  correction  is  small, 
lowering  AE  by  0.0  to  0.2  kcal/mol .  There  is  no  further  correction  to 
the  equilibrium  energy  difference  arising  from  zero  point  vibrational 
energy  (ZPVE)  and  thermal  contributions  to  the  enthalpy  (discussed  in 
ref.  3).  Thus  the  final  calculated  range  is  AE(G-T)  =  -0.6  to 
-0.7  kcal/mol,  which  is  in  qualitative  agreement  with  experimental 
estimates;  AH(exp)  *  AE(exp)  -  -0.2520  to  -0.3521  kcal/mol. 

One  sees  also  from  the  table  that  both  transition  state  energies, 
AE(GT-T)  and  AE(GG'-T),  generally  increase  with  size  of  the  basis  set. 
The  correlation  energy  correction  to  AE(GT-T)  is  -0.0  to  -0.3  kcal/mol. 
Together  with  a  -0.4  kcal/mol  correction  from  A[ZPVE]  and  A[H(T)-H(G)] 
(Ref.  3),  one  obtains  a  net  energy  barrier, 

AE(GT-T)  -  8.7  -  9.1  kcal/mol  (with  the  three  largest  basis  sets).  This 
value  lies  within  the  range  determined  from  ultrasonic  relaxation 
measurements,22  AE(GT-T)  *  5.0  -  10.0  kcal/mol.  It  is  somewhat  higher 
than  the  estimate  from  the  infrared  torsional  frequency;22  however,  this 
latter  measurement  was  subject  to  potentially  large  errors. 

It  is  found  that  the  second  transition  state,  GG',  has  a 
significantly  lower  energy  than  GT.  With  the  correlation  energy, 

A[ZPVE]  and  [H(T)-H(0)]  corrections,  AE(GG'-T)  *  6.8  -  7.2  kcal/mol. 

The  higher  energy  of  the  GT  saddle  point  seems  quite  reasonable  since 
there  is  a  repulsive  interaction  between  two  eclipsed  chlorine  atoms 
which  is  not  present  in  the  GG'  conformation. 

Finally,  we  note  that  all  three  semi -empirical  methods12'14  predict 
that  AE(G-T)  >  0,  which  disagrees  with  both  experimental  and  ab  initio 
results.  Too,  they  predict  that  A(GT-T)  <  A(GG'-T),  in  contrast  to  the 
ab  initio  calculations  and  to  chemical  intuition. 


69-7 


Vibrations  Although  not  shown  (see  Ref.  3),  vibrational  frequencies 
were  calculated  for  the  G  and  T  conformers  using  the  6-31G(d)  basis  set, 
and  were  multiplied  by  the  standard  scale  factor,  0.90,  to  account  for 
corrections  due  to  vibrational  anharmonicity  and  electron  correlation. 
The  scaled  frequencies  were  in  quite  satisfactory  agreement  with 
experimental  data  on  the  two  rotamers,  with  errors  of  1.2-1. 8%  for  modes 
below  1000  cm'1;  the  error  was  somewhat  greater  (3.5%-4.2%)  for  the  C-F 
stretching  vibrations  (above  1000  cm'1) 

Conclusions  The  agreement  between  theoretical  and  experimental 
geometries,  energies  and  vibrational  frequencies  were  quite  satisfactory 
when  using  the  6-31G(d)  basis  set,  which  provides  evidence  that  this 
basis  should  provide  an  adequate  characterization  of  the  structures  and 
energies  in  our  planned  computational  investigations  of  model  compounds 
for  the  perfluoropolyalkylethers. 

Planned  Investigations  Dr.  Paige  and  I  have  just  begun  a  comparative 
ab  initio  study  of  perfluorobutane  [CF3CF2CF2CF3]  and 
perfluoroethyl methyl  ether  (CF3CF20CF3]  in  order  to  determine  the  effect 
replacement  of  CF2  groups  by  oxygen  atoms  on  the  conformational  energies 
and  rotational  barriers.  We  plan  also  to  investigate  whether  the 
3-21G(d)  basis  set  yields  satisfactory  geometric  structural  parameters. 
If  this  proves  true,  it  will  permit  us  to  perform  more  computationally 
efficient  calculations  on  larger  PFPAE  model  compounds.  We  also  plan  to 
initiate  an  investigation  on  the  torsional  potential  energy  surface  in 
perfluorodimethoxymethane  [CF30CF20CF3]  either  later  this  coming  Spring. 
This  molecule  is  of  interest  since  one  expects  the  torsional  barriers  to 
rotation  about  the  the  two  CF2-0  bonds  to  be  mutually  dependent  upon 
both  dihedral  angles. 
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III.  NHR  RELAXATION  TIME  STUDIES  OF  PFPAE  CHAIN  FLEXIBILITY 


Introduction  The  measurement  of  NMR  Spin-Lattice  (T,)  relaxation 
times  is  a  well  established  technique  to  probe  both  the  rates  and 
mechanisms  of  molecular  reorientation  in  liquids  and  solution.23  The 
method  has  also  been  used  quite  profitably  to  characterize  the 
conformational  mobility  of  flexible  chain  polymers.24'25  To  date, 
however,  NMR  relaxation  has  not  yet  been  applied  to  study  the  polymer 
chain  dynamics  in  perfluoropolyalkyl ethers. 

As  applied  to  the  PFPAE's,  the  fluorine-19  relaxation  time 
[T1 (19F) ]  is  a  function  of  the  rotational  correlation  time  (tc)  of  the 
vector  connecting  the  two  fluorine  atoms  in  a  CF2  group;  i.e. 

Tj1  «  tc/R6,  where  R  is  the  distance  between  the  two  fluorine  nuclei. 
Qualitatively,  xc  is  the  time  it  takes  for  the  19F-19F  vector  to  rotate 
by  one  radian  (<=60°).  The  value  of  the  correlation  time,  then,  provides 
a  direct  measure  of  the  polymer's  flexibility  in  the  region  immediately 
surrounding  a  given  perfluoromethylene  group.  For  polymeric  species 
such  as  the  PFPAE's,  tc  may  be  a  function  of  the  frequency  of  the  NMR 
experiment.  The  acquisition  of  data  at  two  or  more  frequencies  permits 
a  determination  of  the  mechanism  of  the  internal  rotation  dynamics  in 
the  polymer  chain. 

During  the  past  year,  we  have  extended  investigations  begun  in 
Summer,  1990  to  study  the  temperature  and  frequency  dependence  of  NMR 
relaxation  in  a  number  of  perfluoropolyalkyl ethers.  The  data 
acquisition  is  still  in  progress;  below,  we  describe  the  preliminary 
results  obtained  to  date. 

Experimental  Fluorine- 19  NMR  measurements  were  performed  by  students 
in  my  research  group  at  the  University  of  North  Texas  on  (a)  a  JEOL 
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FX90Q  FT-NMR  Spectrometer  operating  at  B0=21.1  k6  [v0(19F)=84.7  MHz],  and 
(b)  a  Varian  VXR-300  FT-NMR  operating  at  Bo=70.5  kG  [v0(19F)-282.2  MHz]. 
Spin-lattice  relaxation  times  were  determined  with  the  standard 
Inversion  Recovery  Fourier  Transform  (IRFT)  pulse  sequence,26  (180°-t- 
90°-Acq.),  with  10-12  -c  values  plus  t-«.  T,  was  calculated  from  the 
peak  intensities  by  a  non-linear  fit  to  the  three  parameter 
magnetization  equation.27 

The  following  experiments  were  performed  during  the  past  year: 

(a)  Temperature  dependence  of  relaxation  times  in 
perfluoropoly (tetraethyl ene  glycol),  Rf0[(CF2CF20)4CF20]nCF3,  Rf=CF3, 
CF3CF2  (ML088-131)  [Table  3];28  (b)  Temperature  dependence  of  T, ( 19F ) 
in  Fomblin-Z,  Rf0[CF20]B[C2F40]JC3F60]q  ,  Rf=CF3,  C2F5  (ML078-80)  [Table 
4];  (c)  Temperature  and  frequency  dependence  of  relaxation  times  in 
perf 1 uoropoly (ethyl ene  oxide),  CF30[CF2CF20]nCF3  (ML088-50)  [Table  5]. 
Correlation  times  reported  in  Tables  3-5  were  calculated  using  standard 
formulae.29 

Results  and  Discussion  The  temperature  dependent  rotational 
correlation  times  of  perfluoropoly(tetraethylene  glycol)  are  displayed 
in  Table  3.  One  notes  first  that  tc's  of  the  perf  1  uoromethyl  (CF3)  end 
groups  [peaks  4  and  5]  are  lower  than  rotational  times  of 
perfl uoromethyl ene  (CF2)  groups  in  the  middle  of  the  polymer  chain. 

This  is  to  be  expected  since  the  internal  rotation  of  the  latter  groups 
require  cooperative  reorientation  about  several  adjacent  bonds.  One 
sees,  however  that  the  two  CF3  groups  are  not  equivalent;  the  activation 
energy  for  rotation  in  the  CF3CF20-  unit  [peak  4]  is  substantially 
higher  than  in  the  CF30-  group  [peak  5]  (4.5  versus  2.5  kcal/mol),  which 
provides  evidence  that,  as  is  intuitively  reasonable,  the  barrier  to 
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rotation  about  a  CF3-CF2  bond  is  substantially  greater  than  for  a  CF3-0 
bond.  Further  evidence  for  a  lower  barrier  to  rotation  about  C-0  bonds 
is  provided  by  a  comparison  of  peaks  2,  3  and  6.  The  first  two 
resonances  result  from  19F  nuclei  in  CF2  groups  with  one  C-C  and  one  C-0 
bond,  whereas  the  third  peak  represents  a  group  with  two  C-0  bonds.  The 
activation  energy  for  rotation  of  the  latter  CF2  unit  is  substantially 
lower  (2.9  kcal/mol)  than  found  in  the  two  former  peaks  (3.8  and  4.1 
kcal/mol)  which,  again,  provides  evidence  for  a  markedly  lower  barrier 
to  rotation  about  the  C-0  bonds  in  perfluoropolyalkylethers. 

Table  4  contains  correlation  times  obtained  for  the  various 
resonances  of  Fomblin-Z  as  a  function  of  temperature  in  the  neat  fluid. 
One  observes  the  same  trend  found  above  in  the  simpler  perfluoroether. 
Specifically,  the  activation  energies  for  rotation  of  CF2  groups 
containing  both  a  C-C  and  C-0  bond  [peaks  1-4]  are  uniformly  greater 
than  those  for  perfluoromethylene  units  bonded  to  two  oxygens 
[bands  7-9],  which  likely  results  from  the  higher  barrier  to  internal 
rotation  about  the  former  bonds.  Oue  to  its  greater  molecular  weight, 
with  consequently  greater  correlation  times,  Fomblin-Z  is  almost 
assuredly  not  in  the  motional  narrowing  limit.  Hence,  a  complete 
analysis  of  the  reorientation  in  this  molecule  will  require  measurements 
of  the  frequency  dependence  of  its  relaxation  times,  which  are  currently 
in  progress. 

Displayed  in  Table  5  are  the  reorientational  correlation  times  of 
perfluoropoly(ethylene  oxide)  as  a  function  of  temperature  at  two 
different  fluorine-19  resonance  frequencies  (84.7  and  282.2  MHz).  One 
observes  that,  as  in  perfluoropoly(tetraethylene  glycol)  and  Fomblin-Z, 
both  the  values  of  tc  and  Ea  for  reorientation  of  the  CF30-  end  group 
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are  lower  than  for  other  groups  in  the  middle  of  the  polymer  chain. 

Most  significantly,  the  activation  energies  of  all  resonances  are 
markedly  less  in  the  high  frequency  (282.2  MHz)  experiments.  If  all 
motions  in  the  polymer  were  at  a  frequency  greater  than  -300  MHz,  one 
would  expect  results  of  the  two  experiments  to  be  identical.  Hence,  it 
is  clear  that  the  low  frequency  measurements  reflect  two  separate 
reorientational  mechanisms,  overall  rotation  superposed  on  the  slower 
internal  rotations  about  C-C  and  C-0  bonds  in  the  chain.  This  coming 
Spring,  we  plan  to  fit  the  data  with  several  theoretical  models  of  chain 
reorientation,30*31  which  will  permit  us  to  obtain  a  quantitative 
separation  of  the  internal  and  overall  polymer  dynamics. 

Future  Studies  Work  is  currently  underway  to  determine  the 
relaxation  and  correlation  times  of  Fomblin-Z  at  a  second  19F  resonance 
frequency  and  to  study  relaxation  in  two  other  Fomblins  of  differing 
average  molecular  weights.  As  for  perfluoropoly(ethylene  oxide),  the 
data  will  be  fit  to  models  which  will  permit  separation  of  the  rates  and 
mechanisms  of  internal  and  overall  molecular  rotation  and  will, 
hopefully,  provide  semi -quantitative  estimates  of  the  barriers  to 
internal  rotation;  these  results  will  be  correlated  with  the 
experimental  viscosity  indices.  It  is  expected  that,  eventually,  these 
experiments  will  yield  an  indication  of  the  effects  of  structure  (e.g. 
number  of  CF2;  C2F4,  C3F6  units  and  their  arrangement)  on  the  viscosity- 
temperature  characteristics  of  perfluoroethers  and  aid  in  the  design  of 
new  PFPAE  lubricants. 
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Table  1.  Calculated  Structural  Parameters 
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a)  Bond  lengths  in  Angstroms  and  angles  in  degrees. 

b)  Reference  17. 


Table  2.  Calculated  Conformational  Energies 


Basis  set 

T 

G 

GT 

GG' 

A.  Total  Energies  (Hartrees) 

HF/3-21G 

-1743.842  44 

-1743.848  03 

-1743.831  07 

-1743.839  02 

HF/6-31G 

-1752.247  87 

-1752.250  68 

-1752.234  13 

-1752.238  64 

HF/D95 

-1752.310  81 

-1752.312  21 

-1752.296  76 

-1752.300  86 

HF/6-31G(d) 

-1752.458  57 

-1752.460  12 

-1752.443  56 

-1752.447  07 

HF/6-31 lG(d) 

-1752.630  64 

-1752.631  75 

-1752.615  42 

-1752.618  83 

HF/6-31 1G( 2df ) 
//6-311G(d) 

-1752.673  92 

-1752.675  17 

-1752.658  76 

-1752.661  81 

MP2/6-31G(d) 

-1753.624  56 

-1753.626  06 

-1753.610  05 

-1753.613  10 

MP2/6-311G(d) 

-1753.933  15 

-1753.934  24 

-1753.917  97 

-1753.921  11 

MP2/6-31 lG(2df ) 
//6-31 lG(d) 

-1754.277  79 

-1754.278  75 

-1754.263  01 

-1754.265  66 

B.  Relative  Energies  (kcal/mol) 

HF/3-21G 

0.00 

-3.50 

+7.14 

+2.15 

HF/6-31G 

0.00 

-1.76 

+8.62 

+5.79 

HF/D95 

0.00 

-0.88 

+8.82 

+6.25 

HF/6-31G(d) 

0.00 

-0.97 

+9.42 

+7.22 

HF/6-31 lG(d) 

0.00 

-0.69 

+9.55 

+7.41 

HF/6-31 lG(2df ) 
//6-311G(d) 

0.00 

-0.79 

+9.51 

+7.60 

MP2/6-31G (d) 

0.00 

-0.94 

+9.11 

+7.19 

MP2/6-311G(d) 

0.00 

-0.68 

+9.52 

+7.55 

HP2/6-31 lG(2df ) 
//6-31 IG(d) 

0.00 

-0.61 

+9.27 

+7.61 

MNDO 

0.00 

+0.82 

+5.17 

+7.23 

AMI 

0.00 

+1.13 

+5.15 

+9.21 

PM3 

0.00 

+0.81 

+2.86 

+3.44 

Experiment 

0.00 

-0.25" 

+5.9b 

-0.35d 

+5- 10c 

a)  Reference  20. 

b)  Reference  22.  From  torsional  frequency  (IR). 

c)  Reference  22.  From  ultrasonic  relaxation. 

d)  Reference  21. 
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Table  3.  NMR  Correlation  Tines  In  Perfluoropo1y(tetraethy1ene  glycol), 
Rf0[(CF2CF20)4CF20]nCF3,  Rf-CF3,  CF3CF2  (HL088-131) 

Bulk  Fluid.  Bc  -  21.1  kG  [v0(19F)  -  84.7  MHz] 


Peak 

Delta* 

Assignment 

23°C 

49°C 

73°C 

E. 

1 

73.6  ppm  CF30CF2CF20- 

72  ps 

41  ps 

32  ps 

3.3  kcal/mol 

2 

74.0 

-0CF2CF20CF20- 

115 

63 

46 

3.8 

3 

75.7 

-0CF2CF20CF2CF20- 

125 

68 

46 

4.1 

4 

76.8 

cf3cf2o- 

31 

20 

10 

4.5 

5 

108.2 

cf5ocf2cf2o- 

35 

23 

19 

2.5 

6 

112.6 

-oc2f4ocf2oc2f4o- 

92 

67 

45 

2.9 

a)  Chemical  shifts  are  measured  in  ppm  downfield  from  hexafluorobenzene 
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Table  4.  NMR  Correlation  Tlaes  In  Fonblin-Z 
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ABSTRACT 

A  mathematical  model  for  prediction  the  formation  of  macrosegregation  in  castings  has 
been  successfully  developed.  The  fluid  flow  caused  by  temperature  gradients,  solutal 
gradients,  and  shrinkage,  as  well  as  the  domain  change  during  solidification  were 
considered  in  the  model.  A  benchmark  problem  of  unidirectional  solidification  chilling 
from  bottom  was  solved,  for  the  first  time,  by  the  comprehensive  mathematical  model, 
including  the  calculations  of  heat  transfer,  fluid  flow,  and  mass  transfer.  The  predicted 
results  for  the  solidification  of  Al-Cu  alloys  were  consistent  with  the  experimental  results 
reported  in  1960s.  Some  very  interesting  transient  dynamic  behaviors  for  the  formation  of 
macrosegregation  were  discovered  by  the  model,  which  cannot  be  obtained  from 
experiments.  It  was  found  from  the  present  study  that  the  formation  of  macrosegregation 
in  castings  can  be  significantly  influenced  by  the  fluid  flow  caused  by  shrinkage.  It  is 
expected  that  the  results  obtained  from  the  sponsored  research  will  lead  to  two  journal 
publications,  which  are  currently  under  preparation. 
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I.  INTRODUCTION 


A  large  scale  of  nonuniformity  in  compositions,  i.e.,  macrosegregation,  has  frequently 
been  found  in  large  solidified  ingots  [1-2].  The  capability  to  obtain  an  ingot  with  uniform 
composition  is  necessary  in  order  to  guarantee  the  quality  of  subsequent  cast  parts  using  the 
ingots  as  the  raw  material.  Even  for  small  cast  parts,  the  requirement  of  high  quality  parts 
used,  for  example,  in  the  aircraft,  cannot  allow  a  nonuniformity  in  composition,  because 
the  mechanical  properties  of  the  cast  part  could  be  greatly  influenced  by  the  existence  of 
macrosegregation.  Hence,  to  understand  the  fundamental  mechanisms  responsible  for  the 
formation  of  macrosegregation  and  to  find  possible  methods  to  prevent  or  reduce 
macrosegregation,  are  the  long-term  desire  in  the  foundry  industry. 

The  formation  of  macrosegregation  in  castings  has  been  understood  to  be  caused  by  the 
fluid  flow  in  the  casting  [3-4].  Fluid  flow  in  the  casting  can  be  caused  by  density 
difference  between  the  solid  and  the  liquid  phases  (shrinkage  induced  flow),  as  well  as  the 
thermal  and  solutal  gradients  (natural  convection).  The  modeling  of  fluid  flow  and  heat 
transfer  in  a  casting  is  inherently  very  difficult  due  to  the  presence  of  solid  phase,  mushy 
zone,  and  liquid  phase,  as  well  as  the  release  of  latent  heat  at  the  unknown  solid-liquid 
interfaces.  Recently,  the  development  of  continuum  model  or  volume  averaging  method 
[5]  has  made  the  modeling  of  casting  solidification  with  fluid  flow  possible.  In  the 
continuum  model,  only  a  set  of  governing  equations  is  required  for  the  entire  domain, 
including  the  solid  phase,  mushy  zone,  and  liquid  phase. 

Previous  studies  on  the  formation  of  macrosegregation  in  castings  were  primary 
concentrated  on  experimental  work.  The  first  attempt  to  model  the  formation  of 
macrosegregation  is  described  in  the  pioneering  papers  of  Flemings  and  co-workers  [6-8], 
who  computed  interdendritic  flows  by  treating  the  mushy  zone  as  a  porous  medium  for 
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which  porosity  varied  with  solid  fraction.  Soiutal  segregation  in  the  mushy  region  was 
predicted,  subject  to  a  prescribed  flow  which  was  assumed  to  be  known.  A  major 
extension  of  the  model  was  made  by  Mehrabian  et  al.  [9],  who  included  the  calculations  of 
fluid  flow  in  the  mushy  zone.  Darcy's  law  was  used  to  describe  two-dimensional  velocity 
and  pressure  fields,  although  temperature  gradients  and  solidification  rate  were  prescribed 
and  the  average  concentration  of  the  solid  and  liquid  phases  was  assumed  to  be  constant 
Although  their  model  only  allowed  for  liquid  density  variations  with  temperature,  Szekely 
and  Jassal  [10]  appear  to  have  been  the  first  to  solve  the  two-dimensional  momentum  and 
energy  equations  for  both  the  mushy  zone  and  the  bulk  liquid.  From  their  research,  the 
important  conclusion  was  that  the  convection  in  the  bulk  liquid  had  a  marked  effect  on  the 
velocity  field  in  the  mushy  zone.  Bennon  and  Incropera  [11]  predicted  channel  segregation 
of  an  NH4CI-H2O  system  by  solving  the  fully  coupled  momentum,  energy,  and  species 

equations.  A  similar  model  was  also  established  by  Beckermann  and  Viskanta  [12].  In 
their  models,  the  fluid  flow  was  caused  by  temperature  and  soiutal  gradients,  but  the 
shrinkage  effect  was  neglected  and  a  constant  density  was  assumed  throughout  the  entire 
domain. 

For  many  alloys,  the  density  of  the  solid  phase  is  greater  than  the  density  of  the  liquid 
phase.  Hence,  most  alloys  shrinkage  during  solidification  by  2-8%  in  volume.  The 
shrinkage  inevitably  results  in  fluid  flow  and  global  domain  change,  which  are  naturally 
present  in  phase-change  systems  and  usually  cannot  be  avoided.  In  practical  foundry 
operations,  risers  or  hot  tops  are  used  to  feed  the  shrinkage  and  thus  to  reduce  possible 
casting  defects.  Although  the  shrinkage  induced  fluid  flow  is  expected  to  be  small 
compared  with  a  typical  flow  in  forced  convection,  Chiang  and  Tsai  [13-14]  have  found 
that,  depending  upon  solidification  conditions,  the  shrinkage  induced  fluid  flow  can  be 
comparable  to  or  of  the  same  order  of  magnitude  as  natural  convection  caused  by  thermal 


70-4 


and/or  solutal  gradients.  This  is  especially  true  for  the  fluid  flow  in  the  mushy  zone.  As 
the  fluid  flow  in  the  mushy  zone  has  been  recognized  as  the  major  cause  for  the  formation 
of  macrosegregation,  it  is  expected  that  the  consideration  of  shrinkage  effect  is  essential  for 
the  prediction  of  macrosegregation  in  castings.  In  the  present  study,  a  mathematical  model 
to  predict  the  formation  of  macrosegregation  in  castings  will  be  established,  which  includes 
the  natural  convection  caused  by  temperature  and  solutal  gradients,  as  well  as  the  fluid  flow 
caused  by  shrinkage. 

n.  FORMULATION  OF  THE  PROBLEM 

The  formulation  based  on  the  continuum  model  developed  by  Chiang  and  Tsai  [13- 14] 
will  be  used.  In  the  derivation  of  the  governing  equations,  the  following  assumptions  are 
made:  1)  The  solid-liquid  in  the  mushy  zone  are  in  local  thermal  and  phase  equilibrium;  2) 
Shrinkage  can  be  caused  by  phase  change  from  the  liquid  to  the  solid  phases;  3)  The 
Boussinesq  approximation  can  be  employed  for  the  natural  convection;  and  4)  The  feeding 
to  shrinkage  by  the  gravitational  force  is  complete  so  that  there  are  no  pores  inside  the  solid 
and  the  liquid  phases.  Two  solidification  conditions  will  be  studied,  the  first  one  is  a 
unidirectional  solidification  chilling  from  bottom  (Figure  1)  and  the  other  is  a  solidification 
process  starting  from  the  side- wall  (Figure  2).  In  the  first  case,  for  the  solidification  of  Al- 
Cu  alloys,  there  is  no  flow  in  the  casting  if  shrinkage  effect  is  neglected.  In  the  second 
case,  the  fluid  flow  is  caused  by  both  natural  convection  and  shrinkage,  as  well  as  their 
interaction.  In  the  following,  the  continuum  properties  are  first  defined,  then  the  governing 
equations  are  given.  The  meaning  of  each  variable  is  provided  in  the  nomenclature. 


Definitions: 

P  =  gsps+giPi ;  c-fscs+fi  c,;  k  =  gsks  +  gtki 

D=f,D,+f,Dr.  =  fi=gJ^- 

P  P 

V=/,VJ+//V/;  h  =fs  hs  +flhl;  /“=/,/?  +  ///?  (1) 

Continuity: 

|(p)  +  V(pV)  =  0 

(2) 


Momentum: 

|  (p«)  +  v  •  (Pv«)  =  v  •  o»,  £  v„)  -  ^  -  §  L  (U  -  us) 

ot  pi  dx  K  pi 


Cp2 
Km  Pi 


\u  -  Wjl  (u  -  Us)  -  V  •  {pfsfi  Vr  Ur)  +  v  •  (P,  «  V(-^-)) 

Pi 


(3) 


A<pv)  +  V 


<pVv)  =  V(/i,£-Vv)- |P-fi£.(v-vs) 
p/  dy  a:  p/ 


Cp2 

* 1/2  pi 


lv  -  v,l  (v  -  V,)  -  V  •  (p/*/,  Vr  w)  +  v  •  (Pi  v  V(-£-)) 

P/ 


+  pg{fr  (T-T0)+Ps(ff-J?0)) 


(4) 
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Energy: 


+  V  •  (pV/t)  =  V  •  (*-  VA)  +  V  •  (f  V(hs  -  h))- V  •  (p(V  -  V,)  (A/  -  /i)) 

CS 


(5) 


Species: 

§  (P/“)  +  V  •  (pV/“)  =  V  •  (  pD  V/“)  +  V  •  ( pD  V(/f  -/«))  -  V  •  (p  (V  -  Vs)  (/« -/«)) 

(6) 

The  above  governing  equations  are  "closed"  by  providing  the  relationships  between  the 
enthalpy  and  the  temperature,  between  the  solutal  concentration  and  the  temperature 
through  the  phase  diagram,  as  well  as  the  permeability  function  in  the  mushy  zone.  These 
auxiliary  equations  are  given  as  follows: 


hs  —  csT ;  hi  —  c\T  +  ( cs  —  ci)  Te  +  H ;  fs  =  — — ^ 


\-kD 


T-Ti 


T  -T„ 


f?  = 


/“;  f?  = 


[1  +fs(kp-\)\ 


K  = 


1  +fs  (kP  ~  1) 

- C1=1M;  C  =  0.13  gj3/2 

Ci(l-gi)2  d2 


r 


(7) 


The  corresponding  boundary  conditions  should  be  consistent  with  the  conditions  given 
in  either  Figure  1  or  Figure  2.  The  initial  casting  temperature  is  assumed  uniform. 
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ID.  COMPUTATIONAL  CONSIDERATIONS 


The  above  governing  equations  are  in  the  general  format  suggested  by  Patankar  [15]  for 
the  numerical  solution  of  heat  transfer  and  fluid  flow  problems;  i.eM  they  contain  a  transient 
term,  a  diffusive  term,  and  source  terms.  Hence,  any  established  numerical  procedure  for 
solving  coupled  elliptic  partial  differential  equations  can  be  used,  with  slight  modifications 
for  different  source  terms.  In  the  present  study,  the  equations  were  solved  iteratively  at 
each  time  step  using  the  control-volume-based  finite  difference  procedure  described  by 
Patankar.  A  fully  implicit  formulation  was  used  for  the  time-dependent  terms  and  the 
combined  convection/diffusion  coefficients  were  evaluated  using  an  upwind  scheme.  The 
SIMPLEC  algorithm  [16]  was  applied  to  solve  the  momentum  and  continuity  equations  to 
obtain  the  velocity  field. 

At  each  time  step,  the  momentum  equations  were  solved  first  in  the  iteration  process, 
using  the  estimated  volume  fraction  of  solid  and  liquid  for  the  mixture  density,  the 
permeability,  and  the  mass  fraction  of  solid  and  liquid.  Then  the  energy  equation  was 
solved  to  obtain  enthalpy  with  which  the  temperature  can  be  calculated.  Similarly,  the 
species  equation  was  solved  for  the  solutal  concentration.  Next,  the  volume  fraction  of 
solid  and  liquid,  the  permeability,  and  the  mass  fraction  of  solid  and  liquid  were  updated, 
and  this  process  was  repeated  for  each  iteration.  For  each  time  step,  iterations  were 
terminated  when  the  maximum  residual  source  of  mass,  momentum,  energy,  and 
concentration  was  less  than  1  x  10'5.  A  line-by-line  solver  based  on  the  tridiagonal  matrix 
algorithm  (TDMA)  was  used  to  iteratively  solve  the  algebraic  discretization  equations.  The 
last  five  terms  on  the  right-hand  side  of  equation  (3),  the  last  seven  terms  on  the  right-hand 
side  of  equation  (4),  and  the  last  two  terms  on  the  right-hand  side  of  equations  (5)  and  (6) 
represent  source  terms,  and  they  were  treated  according  to  the  procedure  outlined  in 
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Partankar's  book  [15].  The  last  two  terms  on  the  right-hand  side  of  equation  (3),  the  fifth 
and  sixth  terms  on  the  equations  (4),  as  well  as  the  last  term  on  the  right-hand  side  of 
equations  (5)  and  (6)  were  calculated  via  the  upwind  scheme. 

As  the  domain  is  changed  during  solidification,  special  attention  should  be  drawn  to  the 
moving  boundary  on  the  free  surface  at  the  top  of  the  riser.  Moving  nodes  were  placed  at 
the  free  surface  to  track  the  physical  domain  of  the  riser.  As  the  free  surface  was  assumed 
to  be  flat,  the  movement  of  the  free  surface  was  handled  by  taking  the  averaged  velocity  of 
the  moving  nodes  times  the  time-step  size.  Detailed  description  about  the  free  surface  can 
be  found  in  Reference  [17]. 

From  numerical  experimentation,  it  was  found  that  a  larger  time-step  size  decreased  the 
convergence  rate  at  the  beginning  of  the  computation.  This  is  understandable,  as  the  heat 
flux  at  the  walls  is  very  large  at  the  beginning  of  solidification.  In  order  to  obtain  optimum 
solution  accuracy  and  to  maintain  numerical  stability,  a  variable  time-step  size  was  adopted 
in  the  numerical  calculation.  The  numerical  algorithm  used  in  the  present  study  has  been 
verified  by  applying  it  to  several  well-known  problems  where  the  analytical  solutions 
and/or  numerical  solutions  are  available  for  comparison  [17].  The  initial  time-step  size  is 
0.0001  seconds  and  the  maximum  time-step  size  is  0.05  seconds.  All  the  calculations  were 
executed  in  Apollo  10,000  workstations. 


IV.  RESULTS  AND  DISCUSSION 

The  results  for  two  solidification  conditions  as  described  before  will  be  separately 
presented  in  the  following. 
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^ase  1:  unidirectional  aonamcanon  Lnuunp  irom  Bottom 

Figure  3  shows  the  flow  field  in  the  solidifying  casting  at  time  t  =  16.2  seconds.  For 
AI-Cu  alloys,  as  the  density  of  copper  is  greater  than  that  of  aluminum  and  the  cooling  is 
from  bottom,  both  the  temperature  and  concentration  fields  are  stable.  In  other  words, 
there  is  no  fluid  flow  if  the  shrinkage  effect  is  neglected.  Hence,  the  fluid  flow  as  shown 
in  Figure  3  is  caused  solely  by  shrinkage.  The  corresponding  isotherms  are  given  in 
Figure  4,  which  indicate  a  constant  temperature  distribution  in  the  horizontal  direction. 
Similarly,  a  uniform  distribution  of  solute  (copper)  is  also  found  in  the  horizontal  direction. 
Another  set  of  velocity  profile  and  temperature  distribution  are  given,  respectively,  in 
Figure  5  and  Figure  6,  at  time  t  =  136.2  seconds.  The  velocity  in  the  pure  liquid  phase 
appears  to  be  a  parabolic  shape,  and  the  velocity  profile  is  distorted  when  the  flow 
approaches  the  liquidus  line  at  which  the  solid  phase  starts  to  present.  The  flow  in  the 
mushy  zone  tends  to  become  uniform  in  the  horizontal  direction  and  its  magnitude 
decreases  as  the  solid  fraction  increases  toward  the  bottom  surface. 

The  copper  concentration  distribution  along  the  y-direction  at  dme  t  =  16.2  seconds  is 
shown  in  Figure  7.  The  maximum  concentration  occurs  at  the  bottom  wall  and  the 
concentration  above  the  liquidus  line  is  uniform,  which  is  the  same  as  the  initial  copper 
concentration,  4.1%.  The  concentration  profile  propagates  in  the  y-direction  with  time,  as 
shown  in  Figure  8.  The  copper  concentration  near  the  bottom  wall  is  increased  with  time. 
It  is  noted  that,  from  the  concentration  distribution  in  both  Figures  7  and  8,  the 
concentrations  below  and  above  the  initial  value  are  compensated  each  other,  so  that  the 
overall  conservation  of  mass  (copper)  is  maintained.  The  dynamic  evolution  of  copper 
concentration  in  the  y-direction  is  shown  in  Figure  9.  It  is  noted  that  the  concentration  in 
the  mushy  zone  is  increased  with  time  until  the  alloy  is  completely  solidified,  and  then  the 
concentration  in  the  solidified  alloy  remains  constant  The  predicted  final  concentration 
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distribution  in  the  solidified  alloy,  along  with  published  experimental  results,  is  given  in 
Figure  10.  For  comparison,  previously  calculated  results  are  also  given  in  the  figure  [18- 
19].  Although  similar  experimental  and  theoretical  results  were  also  reported  in  1960s  by 
Flemings  and  co-workers  [6-8],  a  direct  comparison  was  not  possible  because  the  exact 
casting  conditions  were  not  given  in  References  [6-8].  Nevertheless,  all  the  previous 
experiments  showed  the  formation  of  "inverse  segregation"  at  the  bottom  of  the  casting.  It 
is  noted  that  the  previously  calculated  results  were  obtained  under  very  simplified 
conditions,  i.e.,  assuming  the  fluid  flow  and  the  velocity  of  the  isotherm  were  known  [6- 
8,19].  Figure  10  indicates  that  the  model  established  by  the  present  study  has  correctly 
predicted  the  formation  of  macrosegregation.  Furthermore,  the  present  study  proves  that 
the  macrosegregation  for  the  case  of  unidirectional  solidification  of  Al-Cu  alloys  cooling 
from  bottom  is  created  solely  by  the  shrinkage  induced  fluid  flow. 

Case  II;  Solidification  Starting  from  the  Side-Wall 

Figure  1 1  shows  the  flow  pattern  in  the  casting  at  time  t  =  9. 1  seconds.  A  clockwise 
circulation  vortex  is  formed  due  to  the  natural  convection  caused  by  temperature  gradients. 
The  two  lines  in  the  casting  represent,  respectively,  the  liquidus  line  and  the  solidus  line. 
The  riser  in  the  figure  provides  liquid  metal  to  feed  shrinkage  in  the  casting.  The 
corresponding  isotherms  are  given  in  Figure  12,  showing  the  distorted  isotherms  due  to  the 
fluid  flow.  However,  the  isotherms  closer  to  the  side-wall  are  not  distorted  as  much  as 
those  further  from  the  side-wall.  Figure  12  also  shows  that  at  time  t  =  9.1  seconds,  the 
large  portion  of  the  casting  domain  is  not  affected  and  still  remains  at  the  initial  temperature, 
although  a  large  circulation  vortex  exists.  The  corresponding  concentration  profile  is  given 
in  Figure  13  at  time  t  =  9.1  seconds.  In  the  solid  phase  (can  be  seen  in  Figure  1 1),  the 
copper  concentration  is  greater  near  the  wall,  and  then  decreases  in  the  direction  away  from 
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the  wall.  However,  the  copper  concentration  in  the  mushy  zone  becomes  irregular  and  a 
concentration  "cell"  appears. 

Another  set  of  flow  pattern,  isotherms,  and  concentration  profile  are  given, 
respectively,  in  Figures  14, 15,  and  16,  at  time  t  =  57.1  seconds.  The  strength  of  the 
circulation  vortex  becomes  weaker  and  the  size  of  the  mushy  zone  increases.  The 
isotherms  in  Figure  15  are  nearly  vertical,  which  implies  that  the  fluid  flow  does  not 
significandy  affect  the  temperature  field.  This  is  because  the  high  thermal  conductivity  of 
the  Al-Cu  alloys  and  the  formation  of  solid  phase  which  increases  the  thermal  resistance  in 
the  casting.  Ve-y  complex  concentration  profile,  involving  several  cells,  is  found  in  Figure 
16.  By  comparing  Figures  13  and  16,  it  is  seen  that  the  concentration  profile  near  the  side- 
wall  is  similar  to  that  found  in  the  unidirectional  solidification  at  which  the  flow  is  caused 
by  shrinkage  only.  At  the  beginning  of  solidification,  the  fluid  flow  caused  by  shrinkage  is 
the  strongest  due  to  the  large  heat  flux,  while  the  natural  convection  is  weak.  Hence,  the 
concentration  near  the  side-wall  found  in  Figures  13  and  16  are  similar  to  the  phenomena 
observed  in  Figure  10.  However,  at  later  times,  the  natural  convection  becomes  dominate 
and  overcomes  the  shrinkage  induced  fluid  flow  and,  as  a  result,  the  concentration  profile 
away  from  the  side-wall  is  quite  different  A  large  cell  of  concentration  is  revealed  each  in 
the  solid  phase  and  in  the  mushy  zone.  As  no  experimental  results  are  available  for 
comparison,  it  is  not  clear  that  the  calculated  concentration  profile  is  correct  or  not. 
However,  the  velocity  flow  pattern  and  the  isotherms  in  the  casting  given  in  Figures  14  and 
15  appear  correct  As  the  velocity,  temperature,  and  concentration  are  coupled  in  the 
mathematical  model,  if  the  results  of  temperature  and  velocity  fields  are  correct,  it  is  a 
reasonable  assumption  that  the  concentration  profile  predicted  by  the  present  model  is 
correct,  at  least  its  trend.  However,  it  is  feel  that  corresponding  experiments  should  be 
conducted  to  verify  the  theoretical  predictions. 


70-12 


V.  CONCLUSIONS 


A  mathematical  model  based  on  the  continuum  approach  was  developed  for  the 
prediction  of  macrosegregation  in  castings  during  alloy  solidification.  In  the  model,  the 
fluid  flow  caused  by  temperature  gradients,  solutal  gradients,  and  shrinkage,  as  well  as  the 
domain  change  were  considered.  The  set  of  governing  equations  are  valid  for  the  entire 
domain,  including  the  solid  phase,  mushy  zone,  and  liquid  phase.  Two  solidification  cases 
were  studied:  the  first  one  is  a  unidirectional  solidification  chilling  from  below,  while  the 
other  is  a  solidification  process  starting  from  side-wall.  For  the  first  case,  the  experimental 
results  for  Al-Cu  alloys  were  reported  in  1960s  and  for  the  first  time  the  present  study  was 
able  to  predict  these  results.  This  is  because,  for  the  solidification  cooling  from  bottom,  the 
flow  is  caused  only  by  shrinkage  and  the  existing  models  have  neglected  shrinkage  effect. 

When  a  chill  is  placed  at  the  side-wall,  both  the  natural  convection  and  the  shrinkage 
induced  fluid  flow  exist,  and  they  interact  each  other.  The  final  solute  distribution  in  the 
solidified  casting  depends  on  the  relative  strength  of  these  two  flow  mechanisms.  As  the 
correct  values  of  several  parameters,  such  as  the  solutal  and  thermal  expansion  coefficients, 
are  not  available,  numerical  experiments  were  conducted.  It  was  found  that  under  different 
conditions,  the  predicted  temperature  and  velocity  fields  are  similar,  while  the  concentration 
profiles  are  very  much  different  This  implies  that  the  distribution  of  solute  in  the  casting  is 
very  sensitive  to  the  casting  conditions.  Corresponding  experiments  should  be  conducted 
to  verify  the  theoretical  predictions  by  the  present  model.  It  is  expected  that  the  results 
from  the  sponsored  research  will  lead  to  two  journal  publications,  which  are  currently 
under  preparation. 
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NOMENCLATURE 


c 

specific  heat 

Grgck  Symbols 

C 

coefficient  in  momentum  equation 

Ps 

solutal  expansion  coefficient 

Cl 

permeability  coefficient 

Pt 

thermal  expansion  coefficient 

d 

dendrite  arm  spacing 

viscosity 

D 

mass  diffusion  coefficient 

p 

density 

f 

mass  fraction 

F 

fractional  volume  of  the  cell 

Subscripts 

g 

volume  fraction  or  gravitational  accel. 

e 

* 

eutectic 

h 

enthalpy 

l 

liquid 

H 

latent  heat 

m 

fusion  temperature,  fa  =  0 

k 

thermal  conductivity 

o 

reference 

K 

permeability 

r 

solid  &  liquid  relative  velocity 

kp 

equilibrium  partition  ratio 

s 

solid 

P 

pressure 

T 

temperature 

Superscript 

t 

time 

a 

constituent 

u,  v 

x-,  y-direction  velocity  components 

V 

velocity  vector 

x,  y 

Cartesian  Coordinates 
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Fig.  2  Schematic  Representation  of  a  Solidification  Process  Chilling  from  Side- Wall. 
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Fig.  3  Flow  Pattern  and  Mushy  Zone  at  Time  t  =  16.2  Seconds. 
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Fig.  4  Isotherms  in  the  Casting  at  Time  t  =  16.2  Seconds. 
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Fig.  5  Flow  Pattern  and  Mushy  Zone  at  Tune  t  =  136.2  Seconds. 
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Cu  concentration 


Fig.  7  Copper  Concentration  in  the  y-direction  at  Time  t=  16.2  Seconds. 
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Fig.  8  Copper  Concentration  in  the  y-direction  at  Tune  t  =  136.2  Seconds. 
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Cu  concentration 


Fig.  9  Time  Evolution  of  Copper  Concentration  in  the  Casting. 
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Cu  concentration 


Fig.  10  Copper  Concentration  Distribution  in  the  Solidified  Casting. 
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Fig.  1 1  Flow  Pattern  and  Mushy  Zone  at  Time  t  =  9.1  Seconds. 
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Fig.  14  Flow  Patton  and  Mushy  Zone  at  Tune  t  =  57.1  Seconds. 
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Fig.  16  Copper  Concentration  Profile  in  the  Casting  at  Time  t  =  57.1  Seconds 
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